Xijing Wang (Thomas)

SDE · MLE · Research Engineer

Carnegie Mellon University, School of Computer Science

I'm a graduate student at Carnegie Mellon University (School of Computer Science), pursuing an M.S. in Automated Science.

I received my B.S. in Computer Science (emphasis in Algorithms) from Santa Clara University, where I worked with Dr. Lang Chen and was advised by Dr. Nicholas Q. Tran.

My interests lie in AI Agents, ML Systems, and full-stack intelligent applications.

Looking for opportunities in AI Agent, AI Infrastructure, and MLSys. Please reach out at thomasw3@andrew.cmu.edu

Xijing Wang

Updates

Last updated: Feb 2026
Upcoming

Incoming Software Development Engineer Intern at Apple.

Summer 2026

Open Source

Open source contributor to mlc-ai/mlc-llm — on-device LLM deployment framework. Used in Headache Note iOS app with quantized LLaMA 3.

Jan — Feb 2026

Open Source

Open source contributor to xthomaswang/OpenOT2 — lab automation framework for the OT-2 liquid handling robot.

Jan — Feb 2026

Publication

Poster at GenAI4Health @NeurIPS 2025 — The Second Workshop on GenAI for Health: Potential, Trust, and Policy Compliance. San Diego, CA.

Dec 2025

Research

Research Engineer at CMU — built AI-powered biomedical literature analysis platform with RAG pipeline and embedding-based semantic search. Fine-tuned Qwen3 0.6B embedding model on multi-GPU, achieving strong benchmark on PMC-Patients PPR task.

July — Dec 2025

Education

Started graduate studies at Carnegie Mellon University in the MSAS program, School of Computer Science. Dean's Scholarship recipient.

Aug 2025

Publication

Poster at CogSci 2025 — 47th Annual Meeting of the Cognitive Science Society. San Francisco, CA.

July 2025

Research

ML Research Intern at LCCN Lab, SCU. Fine-tuned CNNs and Vision Transformers to test neuroscience hypotheses. Built fMRI data processing and representation analysis pipelines.

2024

Experience

SDE Intern @ Apple

Summer 2026

Cupertino, CA

Incoming Software Development Engineer Intern.

Research Engineer (AI Agent) @ Carnegie Mellon University

July 2025 — Dec 2025

Pittsburgh, PA

Built an AI-powered biomedical literature analysis platform with Quick (<30s) and Deep (<=3min) modes using a RAG pipeline with parallel retrieval from PubMed and MedRxiv. Fine-tuned Qwen3 0.6B embedding model on multi-GPU, achieving strong benchmark results on PMC-Patients PPR task. Poster at NeurIPS 2025 Workshop on GenAI for Health.

ML Engineer Intern @ LCCN Lab, Santa Clara University

June 2024 — Sep 2024

Santa Clara, CA

Developed CNN models for neuroscience research. Preprocessed fMRI data for HPC platform. Paper accepted at CogSci 2025; under review at Communication Biology.

Education

Carnegie Mellon University

Aug 2025 — May 2027

M.S. in Automated Science

School of Computer Science · Pittsburgh, PA

Dean's Scholarship

Santa Clara University

Sep 2021 — Mar 2025

B.S. in Computer Science, Minor in Economics

College of Arts and Sciences · Santa Clara, CA

REAL Program Scholar

Skills

Languages
Python C++ Swift JavaScript Java Go Scala
AI / ML
PyTorch TensorFlow CoreML LLM MLSys OpenCV NumPy Pandas
Mobile
SwiftUI UIKit React Native Redux Core Data
Backend
FastAPI Node.js Gin (Go) PostgreSQL MySQL SQLite Firebase
Concepts
MVVM RESTful APIs CI/CD HPC Data Privacy

Projects

Clinical Copilot — Medical Literature Analyzer

Research

AI-powered literature analysis system using a RAG pipeline with PubMed & MedRxiv. Fine-tuned Qwen3 0.6B embedding model on multi-GPU, achieving strong benchmark on PMC-Patients PPR task. Poster at NeurIPS 2025 Workshop on GenAI for Health.

Python RAG Transformers Qwen3 Multi-GPU FastAPI

mlc-ai/mlc-llm — On-Device LLM Deployment

Open Source

Contributor to MLC-LLM, an open-source framework for deploying LLMs on-device across platforms (iOS, Android, Web). Used the framework to enable on-device inference with quantized LLaMA 3 models in the Headache Note iOS app.

MLSys LLM C++ Python iOS

OpenOT2 — Open-Source Lab Automation

Open Source

Open-source contribution to OpenOT2, a lab automation framework for the OT-2 liquid handling robot. Enabling reproducible scientific workflows through programmable protocols.

Python Robotics Automation Lab

Headache Note — Offline iOS AI Agent

Software

Fully offline iOS AI agent built by quantizing LLaMA 3 1.6B with MLC-LLM, optimized for iPhone. Swift-based health assistant with prompt engineering and SwiftData for private, on-device inference and personalized lifestyle recommendations.

Swift MLC-LLM MLSys PyTorch SwiftUI

Neurocomputational Basis of Face Recognition in ASD

Research

Research on face recognition changes in ASD using CNNs, ResNet50, and Vision Transformers as computational frameworks. Built fMRI data processing pipelines and Pearson correlation analysis. Poster at CogSci 2025; paper under review at Communication Biology.

Python TensorFlow Deep Learning Neuroscience HPC

MasumiRanker — AI Agent Platform (Hackathon)

Software

AI agent discovery platform with natural language semantic search using Sentence Transformers and Faiss for efficient similarity matching. FastAPI + SQLAlchemy backend with user ratings, recommendation logging, and SHA-256 data integrity.

Python FastAPI Faiss Transformers React

Food Recognition iOS App

Software

SwiftUI iOS app with MVVM architecture, integrating a TensorFlow-trained CoreML model for food classification with 85% accuracy across 100+ categories. Optimized inference speed by 4x and reduced memory usage by 40%.

Swift SwiftUI TensorFlow CoreML Python

EmojiAndEmotion — Health Tracking App

Software

Full-stack iOS app with React Native, Redux, and SQLite. Optimized data queries by 75% (200ms to 50ms). Integrated Apple HealthKit for real-time HRV metrics with 98% accuracy. 90% test coverage.

React Native Redux SQLite HealthKit

Social Networking Website

Software

Full-stack web server using the Go Gin framework with RESTful API, JSON-based database for efficient data management, and a scalable commenting system.

Go Gin JavaScript HTML/CSS

Resume & CV

Download the version most relevant to you.