Senior Data Engineer · Team Lead · Orlando, FL

I build real-time systems
& autonomous trading tools.

Building production-grade data infrastructure at a Fortune 500 financial services firm. Architecting Florin, Zeitgeist, and Oracle — a suite of autonomous trading and intelligence tools.

Florin P&L
+$0.00
Kalshi prediction markets
Win Rate
0.0%
Across all resolved trades
Zeitgeist Entities
95
Tracked in real-time pipeline
Daily Pipeline Volume
~200GB
Processed through Kafka + Flink
About

Beyond building systems, I lead a team of 3 engineers at a Fortune 500 financial services firm as a Senior Data Engineer and Team Lead. I own our team's architecture decisions, run PI Planning and sprint ceremonies, and mentor engineers through personalized development goals. I've driven projects including a hard-deadline M&A data integration and a full Informatica-to-AWS ETL migration.

Projects

Live

Florin

Autonomous prediction market trading bot operating on Kalshi. Ingests signals from Zeitgeist, executes trades, and manages risk with no human intervention.

Python Kalshi API async ML
Live

Zeitgeist

Real-time sentiment pipeline tracking 95 entities across news and social feeds. Kafka ingestion, Flink processing, local LLM scoring — sub-second latency.

Kafka Flink local LLM Python
Production Internal · Fortune 500 Financial Services

Informatica to AWS ETL Migration

Assumed primary ownership of migrating 200 Informatica workflows to a modern AWS stack. Designed a YAML-driven architecture with Apache Iceberg and Parquet-backed output on S3. Reached dark production deployment after approximately one year.

AWS Glue Airflow Apache Iceberg Parquet Python JDBC
Production Internal Tool · Source Private

CDM Mapping Tool

Internal tool at a Fortune 500 financial services firm that automates complex data model mapping. Reduced a multi-day manual process to minutes.

Claude Opus Bedrock AWS Python
In Progress

Oracle

RAG layer bridging Zeitgeist's real-time sentiment signals and Florin's trading decisions. Retrieves relevant historical context via pgvector similarity search to sharpen prediction market entry accuracy.

LangChain pgvector Python OpenAI

Skills

Streaming & Data

  • Apache Kafka
  • Apache Flink
  • Apache Airflow
  • PySpark
  • Apache Iceberg
  • AWS Glue
  • Informatica
  • PostgreSQL / AWS RDS

Cloud & DevOps

  • AWS (S3, Lambda, EC2, EKS)
  • Terraform Cloud
  • Docker
  • GitHub Actions
  • SonarQube
  • Vercel

AI & ML

  • LLMs (Claude, GPT-4)
  • LangChain
  • RAG
  • pgvector
  • Amazon Bedrock
  • llama.cpp
  • GitHub Copilot
  • Sentiment NLP

Languages

  • Python
  • SQL
  • Bash
  • JavaScript
  • Selenium

Trading & Markets

  • Kalshi
  • Prediction Markets
  • Algorithmic Trading
  • Signal Processing

Leadership

  • Cross-functional Collaboration
  • Technical Roadmapping
  • PI Planning
  • Mentorship

Let's build something.

Let's talk data, trading systems, or whatever you're building.

kylepeiman16@gmail.com