Tom Lous

Berkel & Rodenrijs · Netherlands · +31645528510 · info@graphiq.xyz · KVK 57477574

I'm a Data & ML Software Engineer
I develop Scala and Python software that runs on a Spark cluster or dockerize functional scala microservices to run on a Kubernetes cluster.
I'm proficient with many tools concerning setting up a big data ingestion & processing pipeline in the cloud and deploying the results via a scalable API.
I'm also skilled in cleaning & analyzing huge amounts of data, followed by training, validating & testing machine learning models and deploying them in production.

Experience

Senior Data Engineer

Eneco, Rotterdam

Building streaming data products with on Azure

Python, Kafka, dbt, Snowflake, Rust.

April 2024 - Present

Senior Software Engineer

DHL, Utrecht

Building streaming data products with ZIO 2, Scala 3, Kafka and Cassandra on Azure Kubernetes

Scala, ZIO, Kafka, Azure, Kubernetes.

October 2022 - April 2024

Lead Data Engineer

Schiphol, Amsterdam

Leading the data factory team and implementing scalable data ingestion solutions for a data mesh architecture using Spark, Scala, Databricks, Kafka and Kubernetes (OpenShift)

Scala, Spark, ZIO, Azure, Kubernetes, Databricks.

November 2021 - October 2022

Principal Engineer

Nike, Hilversum

Part of the Architecture Chapter

Spark, Python, AWS, Airflow, Ansible, Snowflake & Hadoop development.

June 2021 - November 2021

Lead Data Engineer

Shell, Rotterdam

Part of the Agile Hub

Spark, Scala, Azure, Kubernetes, Airflow, Terraform & Hadoop development.

June 2019 - June 2021

Machine Learning Engineer

VodafoneZiggo, Utrecht

Part of the Advanced Analytics Platform (AAP) and Technical Passport (TP)

(Py)Spark, Hive, Oozie & Hadoop development.

April 2019 - June 2019

Big Data Engineer

eBay Classifieds Group, Amsterdam

Data ingestion as a service (Kafka, Hadoop, Kubernetes) @ eBay's PE (Platform Engineering) Team

Spark, Scala, Flink, Hadoop, Cassandra, Kafka & Machine Learning @ eBay's CDATA (Central Data) Team

June 2018 - April 2019

Big Data & Machine Learning Software Engineer

Datlinq Datalabs, Rotterdam

Building fully automatic data ingestion & processing pipelines in the cloud with Scala, Spark and Airflow. Enrichment via machine learning and deployment via API on top of Elasticsearch in Kubernetes

April 2016 - June 2018

« 15 years of diverse tech related jobs »

All over the place

In the past I've been a PHP web developer, MySQL database admin, Linux system engineer, IT Manager, Team lead, Frontend developer, iOS Software developer, etc etc. Less relevant to the tools I'm using now, but it does make me an experienced well rounded developer.

Check LinkedIn for the details, or read my story

Jan 2001 - April 2016

Speaker Bio

DevOps for Data Engineers

Young Maverics Training, Remote
2020 - 2021

Functional Programming in Scala

Young Maverics Training, Remote
2020 - 2021

Building & Deploying Spark Applications

Young Maverics Training, Remote
2020 - 2021

Deploying Apache Spark Jobs on Kubernetes with Helm and Spark Operator

Spark+AI Summit 2020, San Francisco
June 2020

Apache Airflow & Apache Spark data pipelines in the cloud

Data Driven Rijnmond Meetup
January 2018

Googling the error message - 2

Days of Code
July 2017

Building a Data Ingestion & Processing Pipeline with Spark & Airflow

Data Driven Rijnmond Meetup
February 2017

Building a Distributed Data Pipeline

Days of Code
July 2016

Googling the error message

Days of Code
July 2016

Published Articles

Building an open source Scala gRPC/REST HTTP Proxy for Kafka

Medium Part 1 - Part 2
March 2021

CI/CD for Data Engineers

February 2021

Deploying Apache Spark Jobs on Kubernetes with Helm and Spark Operator

January 2020

Record linking with Apache Spark’s MLlib & GraphX

April 2017

Re-becoming a developer

April 2016

Installing Octave on Mac OS X Mountain Lion

September 2012

Skills

Programming Languages & Tools

Open Source contributions

Interests

Apart from being a software engineer, I organize a regular meetup for data enthousiasts in Rotterdam and surroundings, called Data Driven Rijnmond

I also spend a lot of my free time at night learning new things using MOOC like Coursera, resulting in 34 certifications

In my free time I enjoy spending time with my wife, daughter and son. I run, swim, play tennis and golf and get in shape via a personal trainer. I also hold 1st dan (shodan) in Aikido.

I love puzzles and attempt to escape from about 4-5 escape rooms every year.

When forced in doors I like to read, and watch Youtube (My favorite channels: PBS Space Time, Mark Rober, Numberphile, SmarterEveryDay, 3Blue1Brown, standupmaths, minutephysics, CPG Grey, Kurzgesagt, Veritasium, Critical Role, Simone Giertz). I also play board games or catch up on some movies / series

Availability

starting October 2024

GraphIQ

GraphIQ
Spinel 7
2651RV Berkel en Rodenrijs

info@graphiq.xyz

+31645528510

KvK (CoC): 57477574
BTW (VAT): NL001703498B23
Bank (IBAN): NL15INGB0007044957
Swift (BIC) : INGBNL2A