×
Jack Stehn

Jack Stehn

Backend AI Engineer | Real-time Systems & Data Infrastructure

San Francisco, California, US
(415) 787-7975

Background


About

About

Experienced Backend Engineer specializing in high-performance AI infrastructure. I bring a strong work ethic and deep technical ability, proven in independently designing and deploying scalable data pipelines and building robust APIs for complex inference workloads. Passionate about shipping production-grade AI systems collaboratively.

Work Experience

Work Experience

  • Data Scientist (ML, DE, MLOps), Caliber Public Schools

    Sep, 2024 - Present

    • Owned full data lifecycle with strong technical ownership, modernizing GCP data infrastructure (BigQuery, GCS, Dagster, dbt) for actionable insights.

    • Engineered People Team data pipeline, integrating rigorous validation & testing; reduced manual checks from months to seconds.

    • Developed/deployed production-grade predictive risk models for staff turnover, enabling proactive retention strategies.

    • Translated complex data findings into clear, actionable reports for diverse technical & non-technical leaders.

  • Data Scientist (ML, Data Engineering, MLOps), SetSail

    Aug, 2021 - Feb, 20231 year 6 months

    • Applied deep technical expertise to lead AWS data pipeline overhaul, reducing processing 75% & scaling 4x (TBs) for LLM integration, ensuring data integrity.

    • Developed/deployed production ML models, delivering actionable insights; contributed to 33% faster ramp & 16% higher revenue.

    • Independently architected scalable data solutions (star schema, optimized DAGs, async ingestion) with robust testing & CI/CD.

    • Collaborated cross-functionally to translate complex data problems into actionable solutions for diverse stakeholders.

  • Data Science Research Team Lead, UC Berkeley School of Public Health

    Sep, 2020 - May, 20218 months

    • Led data science in mixed-methods studies, applying causal inference & qualitative analysis.

    • Developed novel data processing for unstructured/geospatial datasets; communicated findings via interactive dashboards.

Projects Experience

Projects Experience

  • ResumeLLM (AI Agent for Resume Tailoring)

    - Present

    • Developed robust, agent-based system for precise, context-aware resume customization.

    • Designed and implemented internal APIs for seamless system integration and data flow.

    • Leveraged LangChain, LangGraph for sophisticated AI agent orchestration, multi-step reasoning.

    • Applied NLP, LLM techniques to parse job requirements, generate tailored content.

  • Dagster People Team Pipeline

    - Present

    • Architected scalable ETL pipeline using Dagster for HR data workflows into BigQuery.

    • Automated data consistency checks, reducing manual validation from months to seconds.

    • Enabled rich longitudinal analysis, predictive turnover models for talent planning.

Skills

Skills

  • Backend & Systems

    Python FastAPI

    Async Programming

    API Microservices

    System Design

    Performance Opt

    Data Streaming

  • Data Engineering & Cloud

    GCP, AWS Cloud

    Docker Containers

    PostgreSQL DB

    Data Pipelines

    Data Validation

    CI/CD, Testing

  • MLOps, AI & Insights

    ML Inference

    Model Rollout

    Shadow Testing

    LLM Deployment

    Actionable Data

    Metrics & Logs

Education

Education

  • Data Science, Bachelor of Arts, UC Berkeley

    Aug, 2019 - May, 2021

    4.00/4.00