Technologies

We work with industry-leading technologies to build robust, scalable data platforms.

Data Processing & Storage

Apache Spark

Apache Spark

Large-scale data processing

Apache Kafka

Apache Kafka

Real-time data streaming

Apache Kafka

Kafka Connect

Data integration framework

Apache Flink

Apache Flink

Stream processing

D

Delta Lake

ACID transactions on data lakes

I

Apache Iceberg

Table format for huge datasets

DuckDB

DuckDB

Embedded analytical database

MinIO

MinIO

S3-compatible object storage

PostgreSQL

PostgreSQL

Relational database

Cloud Platforms

AWS

Amazon Web Services

Google Cloud

Google Cloud

GCP infrastructure

Azure

Microsoft cloud platform

Kubernetes

Kubernetes

Container orchestration

Terraform

Terraform

Infrastructure as code

Docker

Docker

Containerization

Data Warehousing

Snowflake

Snowflake

Cloud data warehouse

Google Cloud

BigQuery

Google's serverless warehouse

Redshift

AWS data warehouse

Trino

Trino

Distributed SQL query engine

Databricks

Databricks

Unified analytics platform

dbt

dbt

Data transformation

Apache Airflow

Airflow

Workflow orchestration

AI & Machine Learning

Python

Python

ML development language

Google Cloud

Vertex AI

Google Cloud ML platform

SageMaker

AWS ML platform

Databricks

Databricks ML

Unified ML on Databricks

MLflow

MLflow

ML lifecycle management

K

Kubeflow

ML on Kubernetes

Analytics & Reporting

Metabase

Metabase

Open-source BI platform

Looker

Looker

Enterprise BI and analytics