Apr 2024 — Present

Senior Data Consultant

Soyoma · Austin, Texas

Engineered predictive risk indicators using Python (pandas, NumPy) and advanced SQL in Snowflake, significantly reducing manual review overhead.
Designed scalable ELT pipelines and optimized Snowflake queries (CTEs, window functions, warehouse scaling) to drive major processing efficiency gains.
Conducted exploratory data analysis and hypothesis testing to uncover pricing and risk drivers, presenting actionable insights to cross-functional stakeholders.
Designed and deployed production-grade LLM pipelines processing large document datasets with RAG architectures.
Owned full ML lifecycle: data ingestion, model deployment, API integration, monitoring, CI/CD, and performance evaluation.
Oct 2023 — Mar 2024

Data Scientist

Solenix · Darmstadt, Germany

Built time-series anomaly detection models on satellite telemetry data using statistical thresholds and forecasting techniques to proactively identify system deviations.
Evaluated model performance using precision/recall metrics and reduced false positives through iterative threshold optimization.
Jan 2022 — Oct 2023

Associate Data Scientist

Reply · Berlin, Germany

Built end-to-end data processing pipelines handling ingestion, validation, cleaning, and transformation of high-volume data from multiple disparate sources using Python, SQL, and AWS Glue.
Implemented data quality checks at every pipeline stage, ensuring near-perfect accuracy for business-critical calculations.
Analyzed and resolved data discrepancies by interfacing directly with internal teams and external stakeholders.
Research

Research Assistant — Human-Robot Collaboration

Fraunhofer IPA · Stuttgart, Germany

Applied contrastive unsupervised learning to extract meaningful representations from high-dimensional image data without labeled datasets.
Built latent feature representations that enhanced deep reinforcement learning policies, reducing training latency and accelerating robot learning.
Worked at the intersection of computer vision, representation learning, and robotics control in real-world physical environments.
Sep 2019 — Dec 2021

Data Scientist

Nokia · Ulm, Germany

Developed SQL-based analytical dashboards in Tableau to identify defect and risk trends, accelerating investigation cycles.
Built forecasting models using XGBoost and reliability models for manufacturing analytics using regression and time-series techniques.
Conducted statistical analysis to identify drivers of production variability and support process optimization initiatives.

Skills & Technologies

ML & AI

Time Series ForecastingAnomaly DetectionXGBoostSupervised LearningFeature EngineeringModel EvaluationLLM FundamentalsRAG Pipelines

Analytics & Statistics

A/B TestingHypothesis TestingCausal AnalysisRegressionClassificationKPI DesignCohort Analysis

Data Engineering

SnowflakeAWS GlueSparkETL/ELTData ModelingSQLPython (pandas)

Visualization

TableauSQL ReportingExcel Dashboards

Infrastructure & DevOps

DockerKubernetesTerraformCI/CDMLflow