Justin Cheong

Software & Data Engineer · Business Analytics @ NUS

Full-stack engineer and analyst focused on building the technical infrastructure required for systematic research. My experience ranges from architecting RAG-based search engines and data pipelines to developing regime-adaptive backtesting frameworks for equity strategies.

SchoolNational University of Singapore
MajorBusiness Analytics
GPA4.88 / 5.00
GraduatingDec 2027

Experience

Where I've worked

Software Engineer Intern
Garda Capital Partners
CurrentJune 2026 – Present
  • Architecting high-throughput time-series analytics pipelines using PostgreSQL and TimescaleDB to aggregate and process intraday DTCC Swap Data Repository (SDR) ticks
  • Implementing dynamic time-bucketing and continuous aggregates for swap volume profiling, eliminating expensive data backfills and accelerating downstream DV01 risk reporting
  • Migrating the desk's swap analytics layer from legacy C# jobs and Oracle to PostgreSQL, building C# data-access components and normalized schemas that join vendor SDR feeds with internal yield-curve DV01 analytics
  • Building Python report pipelines that synthesize daily sell-side research into AI-generated market briefings delivered to the Fixed Income Relative Value trading desk
PostgreSQLTimescaleDBPythonC#
Software Engineer Intern
Algebris Investments
Dec 2025 – June 2026
  • Engineered a production-grade multi-agent LLM pipeline on Databricks to process daily sell-side research into consolidated macro briefings, utilized by 50+ traders daily and saving 2-3 hours of manual synthesis per reader
  • Architected a tiered Bronze-Silver-Gold Delta Lake infrastructure applying SCD Type-2 patterns, ensuring ACID-compliant versioning and full historical traceability of data outputs
  • Implemented end-to-end LLMOps using MLflow for complete observability and integrated the Instructor library with Pydantic to enforce typed schema outputs, eliminating runtime parsing errors
  • Reduced AI search index rebuild time by 96% (26 min to under 1 min) by parallelizing the full ingestion pipeline — including Mistral OCR, Azure Document Intelligence, and OpenAI embeddings — using Python ThreadPoolExecutor with per-stage worker pools and a thread-safe sliding-window rate limiter
  • Architected the downstream RAG retrieval engine using Azure AI Search, implementing a hybrid lexical-semantic fusion pipeline with a Cross-Encoder reranker to optimize search relevance by 25% (NDCG@10) and reduce P95 latency by 30%
DatabricksDelta LakeMLflowAzure AI SearchRAGPython
Analyst
NUS Student Investment Fund
CurrentJan 2026 – Present
  • Design and backtest systematic and factor-based equity strategies for the student-managed portfolio
  • Help maintain research and data infrastructure for screening, backtesting and performance attribution
Systematic strategiesBacktestingDashboarding
Data Engineer Intern
Monee
Aug 2025 – Nov 2025
  • Optimized distributed Spark Structured Streaming pipelines to normalize multi-source datasets, reducing end-to-end data latency by 60% for critical risk monitoring
  • Managed and deployed workflows using Airflow, automating complex maker-checker reconciliation and improving data availability by 30%
Spark Structured StreamingPySparkSparkSQLAirflowETL pipelines
Teaching Assistant
National University of Singapore (NUS)
Aug 2025 – Dec 2025
  • Achieved an overall effectiveness rating of 4.7/5.0, surpassing the School of Computing faculty average (4.3) by ~9%
  • Improved soft outcomes with a 4.6/5.0 score for enhancing students' industry readiness and team effectiveness
TeachingCommunication
Software Engineer Intern
Finexis Advisory
May 2025 – Aug 2025
  • Architected an AI-powered workflow (Python, Google STT, OpenAI) to extract signals from unstructured data, reducing manual processing time by 85% and enabling same-day data synthesis
PythonGoogle STTOpenAI
Consulting Intern
EY
May 2024 – Aug 2024
  • Performed statistical analyses in R to support technology and transformation initiatives for financial services clients
  • Translated quantitative findings into recommendations for process optimisation and risk management
RStatistical analysisConsulting

Projects

Things I've built

Macro RAG Analytics System
Dual-path RAG over 100k research artifacts
Active

Architected a dual-path Retrieval-Augmented Generation (RAG) system utilizing FastAPI and PostgreSQL to index and query over 100k semi-structured financial research artifacts. Engineered a low-latency hybrid search engine combining HNSW dense vector indexing and BM25 full-text search (tsvector), orchestrated via Reciprocal Rank Fusion to maximize semantic and keyword recall. Designed an autonomous LLM tool-routing layer using Claude's tool-use API to dynamically execute complex SQL aggregations or semantic retrieval paths, accelerating intraday macro-economic analysis.

PythonFastAPIPostgreSQLpgvectorDocker
DrawMyRoute
AI Geospatial Engine
Completed

AI-powered routing engine that converts text, prompts and images into road-matched GPS routes using RAG (GPT-4o/Sentence-Transformers), iterative scaling on SVG paths and self-hosted OSRM with custom profiles to handle 100+ concurrent low-latency requests.

PythonFastAPINext.jsOSRMOpenAI
FoodHeroes
Distributed Surplus Marketplace
Completed

A two-sided community marketplace connecting vendors with surplus food to consumers. Engineered with Vue.js and Firebase, featuring a real-time order lifecycle (Pending → Pickup → Completed), automated inventory synchronization via Cloud Functions, and a 98% sprint velocity over two high-intensity Agile cycles.

Vue.jsFirebase AuthFirestoreCloud FunctionsTailwindCSS
Regime-Adaptive Equity Engine
Volatility-aware systematic trading
Completed

A volatility-aware systematic trading system that dynamically switches between Trend-Following (Calm) and Mean-Reversion (Panic) modules. Engineered with a smoothed-VIX regime detector, achieving a 1.26 In-Sample Sharpe Ratio and 3.5% Max Drawdown through ATR-adjusted risk management and 200SMA trend filters.

PythonPandasyfinanceMatplotlibMonte Carlo Simulation

Education & Awards

Academic record

National University of Singapore
B.Sc. Business Analytics, School of Computing · Graduating Dec 2027
GPA 4.88 / 5.00
🏅 Awards
Dean's List (AY25/26)
NUS School of Computing · AY 2025/26 · Awarded to top 5% of undergraduates at NUS
Certificate →
Top Student – Applications Systems Development
NUS School of Computing · AY 2025/26 · Top 1 in Web Development
Certificate →
Dean's List (AY24/25)
NUS School of Computing · AY 2024/25 · Awarded to top 5% of undergraduates at NUS
Certificate →
Merit Scholarship
NUS · 2024–2027 · Full-tuition award based on academic excellence and leadership
📚 Relevant Coursework
Data Structures & Algorithms (A+)Applications Systems Development (A+)Systematic Trading Strategies (A+)Data Mining & Optimization (A+)Systems Thinking (A+)Accountancy & FSA (A+)Programming Methodology (A)Database Management (A)Linear Algebra (A)Calculus (A)

Skills

Technical skills

Languages & Databases
PythonSQL (PostgreSQL)JavaRJavaScriptHTML/CSSNeo4j
Backend & Concurrency
FastAPIAlembicREST APIsAsyncioMultiprocessingThreading
Data Engineering
PySparkSparkSQLpandasData WarehousingBig Data Analytics
DevOps & Systems
LinuxWSLAzure DevOps (CI/CD)DockerGitDatabricksAirflow

Accountability

What I'm working towards

Live stats from my personal tracker

60
Days to Exchange
28 Aug 2026
103
Days to Lisbon Marathon
10 Oct 2026
🏃 Marathon Training
Week 8 of 22 · Marathon Base SG · Target sub-4:00
27%
adherence
41 sessions completed154 planned · 🏁 Lisbon 10 Oct 2026
🧠 Interview Prep
LeetCode · 62/150 attempted
High confidence
48/150
Medium confidence
4/150
Low / needs revisit
10/150
Whitebox Problems
Solved
2/44
CS Theory
Solid / Mastered
8/31
💼 Internship Pipeline
1
Applied
0
OA
0
Phone
0
Onsite
0
Offer

Contact

Get in touch

Open to software engineering and data roles. The fastest way to reach me is email.