Technologies
We work with industry-leading technologies to build robust, scalable data platforms.
Data Processing & Storage
Apache Spark
Large-scale data processing
Apache Kafka
Real-time data streaming
Kafka Connect
Data integration framework
Apache Flink
Stream processing
Delta Lake
ACID transactions on data lakes
Apache Iceberg
Table format for huge datasets
DuckDB
Embedded analytical database
MinIO
S3-compatible object storage
PostgreSQL
Relational database
Cloud Platforms
AWS
Amazon Web Services
Google Cloud
GCP infrastructure
Azure
Microsoft cloud platform
Kubernetes
Container orchestration
Terraform
Infrastructure as code
Docker
Containerization
Data Warehousing
Snowflake
Cloud data warehouse
BigQuery
Google's serverless warehouse
Redshift
AWS data warehouse
Trino
Distributed SQL query engine
Databricks
Unified analytics platform
dbt
Data transformation
Airflow
Workflow orchestration
AI & Machine Learning
Python
ML development language
Vertex AI
Google Cloud ML platform
SageMaker
AWS ML platform
Databricks ML
Unified ML on Databricks
MLflow
ML lifecycle management
Kubeflow
ML on Kubernetes
Analytics & Reporting
Metabase
Open-source BI platform
Looker
Enterprise BI and analytics