×
Lea (Mei Ling) Yeh

Lea (Mei Ling) Yeh

Sr. Software Engineer | Data Engineer | MLOps | Systems Architecture

Austria, Vienna, Vienna, AT, 1190
+4367761670726
Chinese, Taiwanese, English, German

Background


About

About

I hold a Master's degree in Computer Science with 8+ years spanning Data Engineering, Data Science, and Software Engineering — most recently as a Software/Solution Architect at c-sense GmbH, where I design instrument control systems and collaborate with scientists and hardware engineers to integrate nanoscale measurement devices.

Alongside my current role, I continue to deepen my foundation at 42 Vienna, focusing on Linux kernel internals and DevOps — building the system-level expertise to transition into AI platform and MLOps engineering roles.

Work Experience

Work Experience

  • Senior Software Engineerc-sense GmbH

    Aug, 2024 - Present

    c-sense develops nanoscale sensor technology and AFM/SPM instruments for scientific and industrial applications.

    • Architected a modular, layered software framework for SPM instrument control — deliberately separating hardware vendor, orchestration, and application layers for long-term extensibility and reuse across future instrument variants

    • Defined cross-layer interface contracts and async command routing patterns to enable parallel development by hardware and software teams, reducing integration risk during concurrent R&D cycles

    • Led the design of a hardware abstraction strategy over a LabVIEW-based proprietary API, keeping the software stack agnostic to vendor-specific instrument changes

    • Collaborated with hardware engineers and research scientists to translate physics measurement protocols into reliable, automated software workflows

    • Delivered automated measurement capabilities (spatial sampling, real-time safety management) that allow researchers to focus on experimental outcomes rather than instrument operation

  • Data ScientistMediaTek

    Sep, 2022 - Sep, 20231 year

    Devoted to data-related initiatives spanning cost control and chip development in the AI & Big Data department of a leading global semiconductor company.

    • Implemented ML models to enhance mobile temperature control mechanisms, achieving temperature errors below 1°C

    • Collaborated with chip developers to reduce power consumption by 20%, resulting in a 14K Antutu score improvement in mobile performance

    • Implemented model monitoring solutions using AWS and Streamlit, ensuring high availability and performance consistency

  • Data EngineerMediaTek

    Jul, 2019 - Sep, 20223 years 2 months

    Built data infrastructure and analytics platforms for one of the world's largest IC design companies.

    • Established automated data pipelines for structured and unstructured data; designed PB-level ETL processes using Airflow, NiFi, Dataflow, and BigQuery on GCP

    • Introduced Alibaba's Data Warehouse theory, enhancing data table reuse rates during the company's early digital transformation

    • Implemented real-time data quality monitoring and alerting, reducing data missing rates from 50% to <1% and lowering monthly labor costs by 7.5 man-days

    • Built model monitoring infrastructure using MLflow and Grafana dashboards to track ML model performance and data drift in production

    • Managed EDA License and Computing Farm costs via interactive BI Dashboard (Splunk + Grafana), aiding EO procurement decisions and reducing costs by 25%

  • Software EngineerMediaTek

    Sep, 2016 - Jun, 20192 years 9 months

    • Developed debugging and analysis tools for Modem Logs

    • Developed an automated ICD DMS using Python and Jenkins to enforce documentation standards

    • Created a World Wide Field Trial Upload Tool using Vue.js with Electron

    • Developed a StackOverflow-like QA platform using AngularJS

Projects Experience

Projects Experience

  • ft_transcendence — Full-Stack Platform with Observability Stack

    Oct, 2025 - Mar, 20265 months

    Contributed the DevOps module for a full-stack multiplayer web platform (42 Vienna capstone project). Responsible for designing and deploying the complete observability infrastructure covering metrics, logging, and alerting.

    • Deployed Prometheus for metrics collection and configured alerting rules for service health and performance thresholds

    • Built Grafana dashboards for real-time visibility into application and infrastructure metrics across the platform

    • Set up ELK stack (Elasticsearch, Logstash, Kibana) for centralized log aggregation and search across distributed services

    • Containerized the full observability stack with Docker Compose, enabling reproducible deployment and local development parity

  • ft_linux — Linux From Scratch

    Sep, 2025 - Present

    Built a fully bootable Linux system from scratch, covering every layer from cross-compilation toolchain to kernel configuration, filesystem hierarchy, init system, and bootloader. Follows LFS/BLFS/ALFS methodology with a custom automation layer.

    • Compiled a custom Linux kernel (4.x) with hand-selected driver and filesystem configuration

    • Built a two-phase cross-compilation toolchain (temporary + final) to produce a host-independent, self-contained Linux system

    • Automated the full build pipeline (14 stages) via an ALFS-style bootstrap script with environment isolation and error recovery

    • Designed partition layout, configured GRUB bootloader, SysV init, and udev for dynamic device management

  • Inception-of-Things — K3s Kubernetes Cluster

    Jun, 2025 - Oct, 20254 months

    Provisioned and configured a multi-node K3s (lightweight Kubernetes) cluster using Vagrant and VirtualBox, covering cluster networking, ingress, and GitOps-style deployment workflows.

    • Deployed a multi-node K3s cluster with Vagrant-provisioned VMs, including nested virtualization setup

    • Configured Kubernetes ingress, service routing, and workload deployment across cluster nodes

    • Applied GitOps principles with ArgoCD for continuous deployment in the bonus track

  • libftpp — C++ Architecture Library

    Oct, 2025 - Present

    A modern C++20 library built through structured exercises practicing SOLID principles, hexagonal architecture, and design patterns. Includes Architectural Decision Records (ADRs) documenting trade-off reasoning.

    • Implemented core data structures and utilities applying SOLID principles and hexagonal (ports & adapters) architecture

    • Documented architectural decisions via ADRs, treating design trade-offs as first-class engineering artifacts

    • Maintained CI pipeline with CMake, clang-format, and automated test suite

  • litetorch + litetune + sklite — AI Framework Toolkit

    Mar, 2025 - Jun, 20253 months

    A trio of educational ML infrastructure projects: litetorch (neural network framework built from scratch inspired by PyTorch), litetune (hyperparameter tuner inspired by Ray Tune), and sklite (ML preprocessing toolkit inspired by scikit-learn).

    • Implemented forward/backpropagation, autograd, and layer abstractions from scratch in litetorch

    • Built a hyperparameter search and experiment tracking system in litetune, mirroring Ray Tune's trial management model

    • Designed preprocessing pipelines and utility functions in sklite for educational clarity and extensibility

  • webserver — HTTP/1.1 Server in C++

    Jun, 2024 - Sep, 20243 months

    Implemented a standards-compliant HTTP/1.1 web server in C++ from scratch, handling concurrent connections, request parsing, and static/dynamic content serving.

    • Built non-blocking I/O event loop handling concurrent HTTP connections using poll/select

    • Implemented HTTP/1.1 request parsing, routing, and response generation

    • Supported CGI execution, static file serving, and configurable virtual hosts

  • minishell — Bash-compatible Shell

    Dec, 2023 - Mar, 20243 months

    Reimplemented core GNU Bash shell features including command parsing, process management, and built-in execution.

    • Developed a syntax analyzer using the shift-reduce algorithm for Bash-like grammar interpretation

    • Optimized subprocess management and pipeline execution across the full SDLC

    • Employed Docker to ensure consistent development environments across the team

  • CDNJS — Content Delivery Network for JavaScript

    Sep, 2015 - Sep, 20161 year

    Open-source CDN for JavaScript libraries used by over 3.5 million websites, serving 30+ billion requests per month.

    • Contributed to a platform serving over 30 billion requests per month across 3.5 million websites

Skills

Skills

  • Systems Architecture

    Event-driven architecture

    Layered system design

    SOLID principles

    Hardware abstraction

    Interface design

    Design Patterns

  • Data Engineering

    Databricks

    Big data pipeline

    Data Warehouse

    ETL

    Data Quality Monitoring

    Google Cloud Platform (GCP)

    Apache Airflow

    NiFi

    BigQuery

    Dataflow

  • MLOps & DevOps

    Kubernetes (K3s)

    Vagrant / VM provisioning

    MLflow

    Model monitoring

    Model deployment

    Prometheus

    Grafana

    ELK stack (Elasticsearch, Logstash, Kibana)

    Splunk

    CI/CD

    Docker

    GitHub Actions

    Git Flow

    Apache Airflow

  • Data Science

    Machine Learning

    Deep Learning

    Explainable AI

    Model Evaluation

    PyTorch

    Streamlit

  • Python

    OOP

    PyTorch

    Streamlit

    Pandas

    Pythonic

  • SQL

    MySQL

    BigQuery

  • C/C++

    Performance Optimization

    Multiprocessing

    Parallel Computing

    Linux kernel

  • Software Quality & Testing

    ISTQB CTFL v4.0

    Test design techniques

    Test planning

    Defect management

    SDLC quality assurance

  • Data Analysis

    BI Dashboard

    Splunk

    Plotly

    Data Visualization

Education

Education

  • Computer Science — Software Architecture, Linux Kernel & DevOps, Ongoing Professional Development, 42 Vienna

    Sep, 2023 - Present

    Software Architecture — SOLID principles, hexagonal architecture, ADR-driven design (libftpp)

    Linux Kernel Internals — built Linux from scratch: toolchain, kernel compilation, bootloader, init (ft_linux)

    DevOps & Container Orchestration — K3s, Kubernetes, Vagrant, GitOps/ArgoCD (Inception-of-Things)

    System Programming — HTTP server, shell interpreter in C/C++ (webserver, minishell)

    AI Framework Internals — neural net framework, hyperparameter tuner, ML toolkit from scratch (litetorch, litetune, sklite)

  • Computer Science, Master of Data Mining, National Chiao Tung University

    Sep, 2014 - Jun, 2016

    Data Mining

    High-dimensional Clustering

  • Computer Science, Bachelor, Tatung University

    Sep, 2010 - Jun, 2014

    Computer Vision

    Algorithm

Certificates

Certificates

Awards

Awards

  • IT Annual Award , MediaTek

    Awarded on: Dec 01, 2020

    Recognized for impact on data infrastructure and cost reduction initiatives.

Volunteer Work

Volunteer Work

  • Peer Tutor & Workshop Host, 42 Vienna

    Jul, 2024 - Oct, 2024

    Volunteered as a peer tutor at 42 Vienna, supporting fellow students in system programming and software engineering fundamentals. Designed and hosted a Git workshop for the student community.

    • Hosted a Git Essentials workshop — 'Something You Should Know Before Git Branch' — covering branching strategy, rebase, conflict resolution, and collaborative workflows (slides: https://docs.google.com/presentation/d/13InmNDRSfkeUnGWHNXWFiTr3QCAz4ecFL_wFz-NFdoI/edit?usp=sharing)

    • Provided peer tutoring in C/C++ system programming, shell scripting, and software architecture concepts

  • Volunteer Organizer, Taiwan in Data Science (TWiDS)

    Oct, 2023 - May, 2024

    Served as one of the organizers at Taiwan in Data Science (TWiDS), a volunteer organization dedicated to promoting data-related fields in Taiwan.

    • Promoted awareness and understanding of data science across Taiwan

    • Led preparations for workshops, podcasts, and conferences in 2024

Publications

Publications

  • Clustering using Radius-Weighted Means and Analytical Radius-Preserved Formula, NCTU

    Published on: Jun 01, 2016