×
Lea Yeh

Lea Yeh

Data Engineer & Scientist

Austria, Vienna, AT
+4367761670726
Chinese, Taiwanese, English, German

Background


About

About

I hold a Master’s degree in Computer Science and possess extensive expertise spanning the full data spectrum, including roles as a Data Engineer, Data Scientist, and Data Analyst.
I am committed to excellence in the data domain, with a focus on DevOps, MLOps, and data warehouse design.

Work Experience

Work Experience

  • Data ScientistMediaTek

    Sep, 2022 - Sep, 20231 year

    • Devoted data-related initiatives spanning cost control and chip development in the AI & Big Data department of a semiconductor advanced process company.
    • Implemented machine learning models to enhance mobile temperature control mechanisms, achieving temperature errors below 1°C.
    • Collaborated with chip developers to reduce power consumption by 20%, resulting in a 14K Antutu score improvement in mobile performance.
  • Data EngineerMediaTek

    Jul, 2019 - Sep, 20223 years 2 months

    [Data Pipeline]

    • Established automated data pipelines for structured and unstructured data, and designed PB-level data ETL processes.
    • Introduced Alibaba's Data Warehouse theory, enhancing data table reuse rates during the company's early digital transformation.
    • Implemented real-time data quality monitoring, reducing data missing rates from 50% to <1% and lowering monthly labor costs by 7.5 man-days.
    • Coordinated cloud analytics pipelines using NiFi, Airflow, Dataflow, and BigQuery.

    [Data Analysis]

    • Managed EDA License and Computing Farm costs, leading to an interactive BI Dashboard using Splunk, aiding EO procurement decisions and reducing costs by 25%.
  • Software EngineerMediaTek

    Sep, 2016 - Jun, 20192 years 9 months

    • Developed debugging and analysis tools for Modem Logs.
    • Created a World Wide Field Trial Upload Tool using Vue.js with Electron.
    • Developed an automated ICD DMS using Python and Jenkins to adhere to documentation standards.
    • Developed a StackOverflow-like QA platform using AngularJS.
Projects Experience

Projects Experience

  • CDNJS (Content Delivery Network for JavaScript)

    Sep, 2015 - Sep, 20161 year

    CDNJS is a free and open-source content delivery network for JavaScript libraries. It is used by over 3.5 million websites, serving over 30 billion requests per month.

    • Served over 30 billion requests per month.

    • Used by over 3.5 million websites.

  • minishell (Implemented a simple shell)

    Dec, 2023 - Mar, 20243 months

    minishell reimagines traditional shell functionalities by emulating sophisticated features of GNU Bash. This project delves into system programming and software architecture to provide a robust interpreter modeled on Bash standards.

    • Developed a syntax analizer with the shift-reduce algorithm for interpreting commands using Bash-like grammar.

    • Employed Docker to ensure consistent development environments across the team, reducing setup inconsistencies.

    • Contributed to all phases of the software development lifecycle, optimizing subprocess management for enhanced command execution.

Skills

Skills

  • Data Engineering

    Big data pipeline

    Data Warehouse

    Data Architectural

    ETL

    Data Quality Monitoring

    Google Cloud Platform (GCP)

  • Data Science

    Machine Learning

    Deep Learning

    High-dimensional Clustering

    Explainable AI

    Model Evaluation

    Model Deployment

  • Data Analysis

    BI Dashboard

    Plotly

    Splunk

    Data Visualization

  • Software Development

    System Design

    Design Patterns

    System Architecture

    Agile

  • Python

    OOP

    Pytorch

    Streamlit

    Pandas

    Pythonic

  • SQL

    MySQL

    BigQuery

  • DevOPS

    Git

    Docker

    CI/CD

    GitHub Actions

    Git Flow

  • C/C++

    Performance Optimization

    Multiprocessing

    Parallel Computing

Education

Education

  • Computer Science, Professional development, 42 Vienna

    Sep, 2023 - Sep, 2024

    System Programming

    C/C++

    DevOPS

  • Computer Science, Master of Data Mining, National Chiao Tung University

    Dec, 2014 - Dec, 2016

  • Computer Science, Bachelor, Tatung University

    Dec, 2010 - Dec, 2014

Awards

Awards

  • IT annual award , Mediatek

    Awarded on: Dec 01, 2020

    Recognized with the IT Annual Award at MediaTek.

Volunteer Work

Volunteer Work

  • Volunteer Organizer, Taiwan in Data Science (TWiDS)

    Oct, 2023 - Jun, 2024

    Serving as one of the organizers at Taiwan in Data Science (TWiDS), a volunteer organization dedicated to promoting data-related fields in Taiwan. Taking an active role in promoting awareness and cultivating a deeper understanding of data science across Taiwan. Spearheading the organization's preparations for workshops, podcasts, and conferences in 2024.

    • Promoting awareness and understanding of data science across Taiwan.

    • Leading preparations for significant events including workshops, podcasts, and conferences planned for 2024.

Publications

Publications

  • Clustering using Radius-Weighted Means and Analytical Radius-Preserved Formula, NCTU

    Published on: Jun 01, 2016