Silvia Zeamer is a Software Engineer at Tolemi, working on data pipelines and infrastructure using Elixir, Postgres, Python, and other tools as needs dictate. She studied Media Arts and Sciences at Wellesley College. She has spent much of her life living on an off-grid homestead in coastal Maine, tackling a variety of interesting electrical problems but is now based in Boston, MA, where she spends her free time binding books, photographing nature, and designing knitwear.
In 2018, we decided to rewrite our Groovy-based data platform in Elixir. Today, we run roughly 25,000 jobs processing 2 billion records per day in a single Elixir/Phoenix application. We (Tolemi) integrate with and combine hundreds of various data sources to provide municipal governments across the USA with daily spatial data insights.
In this talk, I’ll walk through the key design decisions that allowed a team of three to migrate to Elixir, scale to this level while keeping complexity low and enabling implementation developers to contribute to our data pipelines.
We use Elixir’s concurrency model for orchestration, and Elixir as a glue layer that allows databases, containers, and scripts written in other languages (Python, DuckDB, Groovy, R scripts) to be used where they are most effective.
Through a series of practical code snippets and architectural choices, I will share the hard-won lessons and specific performance bottlenecks we overcame to maintain a lean, high-throughput data pipeline.
Key Takeaways:
Target Audience: