cv

Basics

Name Yike (Eric) Tan
Label Software Engineer
Email yiketn@gmail.com
Phone +1-412-583-0350
Url http://likegiver.github.io
Summary Graduate student at Carnegie Mellon University specializing in Artificial Intelligence Engineering - Information Security with expertise in AI systems, full-stack development, and scalable software solutions.

Work

  • 2025.05 - 2025.07
    Software Engineer Intern
    Instance Creator LLC
    Developed advanced automation systems and semantic matching platforms for streamlining application workflows.
    • Spearheaded a vector-based semantic matching system using Qdrant, boosting matching accuracy to over 80%.
    • Developed a multi-agent automation platform with AutoGen and gpt-4o to orchestrate complex application workflows, reducing manual effort by 85%.
    • Designed and implemented a scalable microservices backend using async FastAPI to support over 1,000 concurrent users; optimized performance by introducing a Redis caching layer, reducing external API latency and call frequency by 80%.
  • 2024.07 - 2024.08
    Software Engineer Intern
    Epoching AI
    Enhanced AI model performance and deployed scalable watermark removal services.
    • Achieved a 10x inference speedup and 30% accuracy boost for a watermark removal service using NVIDIA TensorRT.
    • Developed a data generation and augmentation pipeline, improving model robustness against semi-transparent watermarks.
    • Containerized the service using Docker and deployed it on Cloud ECS, implementing Prometheus for real-time monitoring and ensuring high availability to handle 500+ concurrent requests.
  • 2023.09 - 2024.03
    Software Engineer Intern
    Ytell Network Technology Co., Ltd.
    Built high-performance multimodal systems and automated data processing pipelines.
    • Developed a high-performance multimodal RAG chatbot with FastAPI, increasing inference throughput by 5x using Flash Attention 2 and cutting retrieval errors by 40% through a hybrid search strategy.
    • Automated the processing of 1,000+ daily orders with a data pipeline using Apache Airflow and MongoDB, eliminating over 5 hours of weekly manual work for each store manager.

Education

  • 2023.08 - 2025.12

    Pittsburgh, PA

    Master of Science
    Carnegie Mellon University
    Artificial Intelligence Engineering - Information Security
    • LLM Systems
    • LLM Methods and Applications
    • AI Systems and Tool Chains
    • Computer Systems
  • 2020.08 - 2024.07

    Beijing, China

    Bachelor of Engineering
    University of International Business and Economics
    Data Science and Big Data Technology
    • Operation Systems
    • Data Structure and Algorithm
    • Natural Language Processing
    • Big Data Analysis

Projects

  • 2025.07 - 2025.08
    ImaginAItion: AI Literacy Game
    Engineered a real-time multiplayer AI literacy game with React (TypeScript) frontend and FastAPI backend for CMU HCI.
    • Leveraged Socket.IO over WebSockets to ensure low-latency (<150ms) state synchronization for 50+ concurrent players.
    • Implemented a stateful backend game loop to manage turn-based player progression, integrating the gpt-image-1 API for real-time, dynamic image generation.
    • Orchestrated the full-stack application with Docker Compose for reproducible local development and deployed the containerized services to Amazon ECS for a scalable production environment.
  • 2025.06 - 2025.08
    Silent Supporter: Scalable AI Multimodal Therapy Platform
    Built a scalable therapy platform on a Node.js backend for Tsinghua AIR with advanced AI capabilities.
    • Reduced real-time generation latency by 80% via WebSockets; utilized PostgreSQL for primary data storage and Redis for session caching to improve performance.
    • Achieved targeted music style transfer by fine-tuning the InspireMusic 1.5B model on a custom-built dataset, and developed a WebGL engine to dynamically visualize conversational emotion in real-time.
    • Designed a low-latency, event-driven architecture using RabbitMQ for asynchronous, non-blocking multimodal generation; separately optimized the core AI model's inference speed by 3x through ONNX Runtime graph optimization.
  • 2025.02 - 2025.03
    MovieRec – End-to-End Recommendation System
    Architected a full-stack recommendation platform with comprehensive MLOps workflow.
    • Integrated a Vue.js frontend with a backend based on Flask API and SpringBoot; Leveraged MySQL and Redis to handle the data persistence layer.
    • Trained Random Forest model using Scikit-learn, conducted offline evaluation with train-validation splits and online evaluation with telemetry data to assess model performance; managed A/B testing experiments with MLflow.
    • Established a containerized MLOps workflow, simulating a high-availability architecture with Minikube and implementing a Jenkins CI/CD pipeline with Prometheus/Grafana for automated deployment and real-time monitoring.
  • 2024.05 - 2024.06
    Self-LLM: Open-Source LLM Deployment Guide
    Co-authored and maintained a leading open-source guide simplifying LLM deployment with 23.2k GitHub stars.
    • Led to feature in the keynote presentation at Google I/O Connect China 2024.
    • As a core contributor, authored key tutorials on GLM4 deployment with vLLM, LangChain integration, and LoRA fine-tuning to simplify the learning curve for developers.

Skills

Programming Languages
Python
C
CUDA
C++
Java
JavaScript
SQL
R
Packages and Frameworks
PyTorch
TensorFlow
Triton
Jax
SparkML
NumPy
Pandas
React
Node.js
Django
Flask
Tools
Git
Docker
Kubernetes
Kafka
Neo4j
Amazon Web Service
Google Cloud Platform
Unix
LaTeX

Languages

English
Fluent
Chinese
Native