Simone Benitozzi

Resume

Simone Benitozzi

Data & Software Engineer

Download PDF

Data & Software Engineer with 5+ years building reliable data infrastructure at scale. Specialized in real-time streaming systems, ML pipelines, and cloud data platforms. Experience spanning backend engineering to data platform architecture — with a track record of shipping systems that run quietly in production while teams focus on what matters.

Experience

Senior Data Engineer

Current Company

2024 – Present
  • Designed and deployed real-time ingestion pipeline handling 12M+ events/day with 99.97% uptime
  • Built ML feature store unifying 40+ features across 8 production models, reducing computation time by 63%
  • Led architecture review for streaming infrastructure migration to AWS MSK

Data Engineer

Previous Company

2022 – 2024
  • Migrated 10TB+ data warehouse from Redshift to Snowflake with zero downtime and 41% cost reduction
  • Built ETL orchestration platform with Airflow managing 120+ DAGs in production
  • Reduced pipeline failures by 78% through comprehensive monitoring and alerting

Backend Engineer

Early Career

2020 – 2022
  • Built REST APIs and data ingestion services in Python/FastAPI
  • Contributed to migration of monolithic services to microservices architecture
  • Started moving into data engineering through ETL and batch processing projects

Education

MSc Computer Science

Thesis: Stream Processing Performance in Distributed Systems

2016–2020

Skills

Languages

PythonTypeScriptSQLScala

Data & Streaming

Apache SparkApache KafkaApache FlinkAirflowdbt

Cloud

AWS (EMR, MSK, S3, Glue, Lambda)GCP (BigQuery, Dataflow)Terraform

Infrastructure

DockerKubernetesCI/CD (GitHub Actions)PrometheusGrafana

Databases

PostgreSQLSnowflakeRedshiftDynamoDBRedis

ML & AI

Feast (Feature Store)MLflowLangChainOpenAI API

Interested in working together?

Open to the right opportunities — interesting problems, strong teams.

Get in touch