Education

Master of Application Informatics

2021 - 2024

University of Göttingen, Germany

Master Thesis:

Investigated ONNX as a unified middleware to bridge fragmented AI ecosystems (PyTorch, TensorFlow) by evaluating its portability and interoperability across the full AI development lifecycle. Built a custom Transformer Encoder from scratch in PyTorch to classify newly submitted tickets in the GWDG support system and predict the most suitable technical supporter. The data pipeline encompassed dialogue extraction and concatenation, stopword removal and stemming, statistical word-length analysis with Gaussian-fitted cutoff thresholds, word embedding, and multi-strategy dataset splitting. Hyperparameters — learning rate, batch size, number of attention heads, and number of encoder layers — were systematically explored over 300 epochs on the GWDG cluster using A100 GPUs, yielding an optimal configuration (batch size 1, lr 5e-7, 1 head, 4 layers). The model achieved 65% accuracy in recognizing ticket owners among 144 candidates from full conversations, and 22% accuracy when predicting candidates solely from the first submitted question. For portability, the model was exported in five formats — PyTorch state parameters, whole model, checkpoint, TorchScript-ONNX, and Dynamo-ONNX — and retrained across Python, C, C++, Rust, and JavaScript ONNX Runtime environments on both GPU and CPU. TorchScript-ONNX outperformed Dynamo-ONNX in Python but showed the reverse pattern in other languages; Rust exhibited over 2x inference latency compared to C/C++ and JavaScript due to FFI overhead. After reconfiguring the learning rate via ONNX Runtime's exposed API, retraining accuracy improved to 70% (owner recognition) and 25% (candidate prediction), surpassing the original PyTorch baseline. A distributed ONNX-based framework supporting (hyper-)federated learning across heterogeneous devices and programming languages was proposed as a direct outcome of the portability and interoperability findings.

NLP Transformer PyTorch ONNX ONNX Runtime TorchScript Dynamo A100 CUDA C/C++ Rust JavaScript Federated Learning Slurm

Research Intern

2023.10 - 2024.05

  • Implemented the ResNet model family in pure Golang without external AI framework dependencies, deployed compiled binaries on the GWDG cluster using Singularity containers with GPU support. Benchmarked model performance and investigated distributed learning strategies across multiple nodes.
  • Tested HPC benchmarks including IO500, HPL, HPCG, and STREAM to assess system performance. Implemented the MiniBUDE benchmark using OpenMP, Julia, CUDA, OpenACC, and OpenCL on both CPU and GPU architectures.
  • Managed clusters, performed performance analysis, and developed parallel computing solutions with MPI and CUDA. Designed and implemented a distributed learning system using Golang with MPI for inter-process communication.
Golang Gorgonia Singularity Docker HPC benchmark CUDA Linux MPI

Master of Physics

2016 - 2019

University of Göttingen, Germany

Master Thesis:

Simulated particle collisions at the LHC using MadGraph within the Standard Model and Higgs mechanism framework to explore parameter constraints for dark matter candidates predicted by the Inert Doublet Model.

Bachelor of Applied Informatics

2020 - 2021

University of Göttingen, Germany

Completed core computer science coursework and was admitted directly to the master's program.

Operating Systems Algorithms Data Structures Java C/C++

Bachelor of Physics

2009 - 2013

University of Shihezi, China

Bachelor Thesis:

Extended Maxwell's equations through a new formalization of electron spin to account for magnetic monopoles, offering an alternative explanation for the Lorentz force and the Hall effect.

Professional Experience

AI Engineering

2025.11 - 2026.03

Turing College

  • Built multiple end-to-end AI applications with customized tools and systematic prompt engineering, deployed on Kubernetes, managing the full lifecycle from development to production.
  • Integrated MCP to connect LLMs with external tools and APIs.
  • Built RAG pipelines with distributed Qdrant for hybrid search and context-aware retrieval.
  • Managed short- and long-term memory for user personalization, incorporating Human-in-the-Loop feedback mechanisms.
LLM LangGraph LangChain MCP RAG Qdrant Kubernetes OpenClaw

Competition Project: AgentX

2025.04 - 2025.05

GWDG

Built LynxNLI, an assistant agent that simplifies Linux operations in HPC environments through natural language. Served as core developer, owning the project from framework design to tool implementation.

LLM AI Agent Genertive AI LangGraph PydanticAI

Python Developer

2022.10 - 2023.09

GWDG

Built a content management system with integrated authentication, featuring a multi-layer security model for authentication and role-based permission control. Developed backend features for email-based notifications, keyword search, attribute filtering, timed tasks, and REST API endpoints with custom access rules. Owned testing and deployment, covering unit tests, integration tests, and production rollout.

Python Django Javascript Postgresql WSGI Linux RestAPI

Full-stack Web Developer

2022.02 - 2022.09

Eforsch

Built a full-stack platform for digitizing chemical and biological experiments, funded by the NBank Gründungsstipendium in Niedersachsen. Automated experiment workflows including calculations, report generation, and supply tracking. Managed the full project lifecycle as sole developer, from initial implementation through production deployment.

Vue Golang Django SQL Nginx Docker compose Cloud server

Robotics Engineer

2019.12 - 2020.05

Mianyang Lunqi Robotics Co., Ltd.

  • Deployed AI-based visual inspection systems for custom industrial automation applications.
  • Collaborated with university research groups to develop solutions for client-specific requirements.

Publications

Exploration for Distributed Learning Design with Golang
Silin Zhao
Technical ReportUniversität GöttingenApril 2024
Advisors: Sadegh Keshtkar and Julian Kunkel
Improving Portability and Interoperability of Deep-Learning-Workloads using ONNX
Silin Zhao
Journal ArticleApplied Intelligence (Springer Nature)2025
Under Review
Growing Domain-Specific LLMs Through Federated Split-Phase Learning: System Design and Analysis
Silin Zhao
Conference PaperFederated Learning Systems & Applications (FLICS)2026
Accepted

Certificates

2026 AI Engineering Professional Certificate Turing College
2025 Kubernetes Certified Application Developer (CKAD) Udemy
2025 Performance Analysis of AI and HPC Workloads GWDG
2024 Deep Learning Bootcamp: Building and Deploying AI Models GWDG
2022 Gründungsstipendium Nbank Niedersachsen
2021 Certificate of Attendance of GWDG Scientific Compute Cluster GWDG
2019 IBM Data Science Professional Certificate Coursera
2017 PIER Graduate Week 2017 Confirmation of Attendance DESY
2013 Teacher Qualification Certificate (High School Physics) Ministry of Education, PRC
2015 Patent: ZL2011 1 0004207.2 China National Intellectual Property Administration

Specialized Skills

Large Language Models

  • LLM deployment with Ollama for local inference across multiple environments
  • OpenClaw and Hermes agent deployment and orchestration in Kubernetes
  • MCP server implementation for local service exposure and multi-server management
  • AI agent skill management with modular tool integration and custom workflows
  • Aichat project: conversational AI interface with extensible plugin architecture
  • RAG pipelines with hybrid search and context-aware retrieval using Qdrant
  • API Integration: experience with leading commercial APIs and local API implementations

Programming & Tools

Python Golang Rust C/C++ JavaScript PyTorch Django Kubernetes Docker LangChain LangGraph ONNX Slurm MPI Emacs (10+ years)

Languages

English (C1) German (C1) Chinese (Native)