Active Top Secret Clearance with SCI access. Data Engineer with extensive expertise in cloud technologies, including AWS and Databricks. Skilled in designing and optimizing data workflows using Spark (PySpark, Spark SQL) and Delta Lake. Proficient in Python and SQL, delivering robust, scalable solutions for complex data challenges.
As Senior Data Engineer for Customs and Border Protection (CBP) under the United States Department of Homeland Security (DHS), spearheaded the optimization and modernization of data pipelines in Databricks on AWS. Leveraged Delta Lake, PySpark, and Spark SQL to implement scalable solutions for real-time and batch data processing, while streamlining legacy data workflows. Regularly collaborated with internal teams and external database administrators to ensure data quality, accessibility, and security across the organization.
Led data engineering initiatives for the Command Chief Digital and Artificial Intelligence Office (CDAO) of USSOCOM. Contributed to the team's adoption of Databricks on AWS by leveraging expertise in Spark, PySpark, and Spark SQL. Collaborated closely with data scientists to process and analyze classified data, while also driving knowledge sharing, cloud migration, and operational efficiency. Focused on supporting mission-critical objectives through innovative workflows and scalable solutions.
Designed and implemented scalable data solutions as a Data Engineer. Migrated legacy data systems to modern cloud environments, developed tools to automate critical processes, and optimized workflows to reduce operational bottlenecks. Collaborated with directors and data science teams to support efficient data storage, analysis, and quality assurance processes. Proficient in reading and understanding Java code, experienced with Hadoop and MapReduce, familiar with access modifiers and class structures. Proficient in reading and understanding C# applications, familiar with its similarities to Java. Used SQL Alchemy to create applications in Python that utilized PostgreSQL and SQLite databases.
Promoted to the position to develop Python applications to enhance production initiatives and quality control functions. Primary duties included auditing current SPSS code, researching fluctuations in trends, and translating new survey questions into Spanish. Proofread company-wide documentation. Analyzed mobile carrier and device data for market penetration and other components. Reported findings to product managers in the form of Tableau and Excel visualizations.
Developed a web-based application using HTML/CSS, JavaScript, and MySQL to store and manage survey questions and responses. Utilized natural language processing (NLP) techniques to classify survey questions and answers into different contexts. Designed and implemented a MySQL database to store categorized survey questions and answers. Received an award for developing the application. Applied NLP techniques to categorize verbs, nouns, and other elements within survey questions. Demonstrated strong teamwork and project management skills.