DEV Community

Aniket Abhishek Soni profile picture

Aniket Abhishek Soni

Senior Data Engineer at Cognizant with 7+ years in ETL pipelines, AI-driven analytics, and cloud-native systems (Databricks, PySpark, Snowflake, AWS). Writing about data engineering, AI & cloud.

Location Brooklyn, New York, United States of America Joined Joined on  Personal website https://aniketsoni.com

Education

MS Computer Science - Southern Arkansas University

Pronouns

He/Him/His

Work

Senior Data Engineer at Cognizant Technology Solutions

Why your Iceberg catalog choice is costing you more than your storage

Why your Iceberg catalog choice is costing you more than your storage

1
Comments
5 min read
Text-to-SQL is a solved problem: why you’re about to leak your PII

Text-to-SQL is a solved problem: why you’re about to leak your PII

1
Comments
5 min read
Time Travel isn't a Debugging Luxury: Why Delta and Iceberg are Compliance Essentials

Time Travel isn't a Debugging Luxury: Why Delta and Iceberg are Compliance Essentials

1
Comments
4 min read
Is BigLake the End of Your Vendor Lock-in Delusion?

Is BigLake the End of Your Vendor Lock-in Delusion?

1
Comments
5 min read
Querying Petabytes of Iceberg Tables via BigLake without Breaking Production

Querying Petabytes of Iceberg Tables via BigLake without Breaking Production

1
Comments
4 min read
Why I’m finally ditching Hive Metastore for BigLake Iceberg

Why I’m finally ditching Hive Metastore for BigLake Iceberg

1
Comments
5 min read
How I Finally Killed the Full-Refresh Silver Layer

How I Finally Killed the Full-Refresh Silver Layer

Comments
5 min read
Cutting Snowflake compute costs 40 percent: warehouse sizing, auto-suspend, and query pruning

Cutting Snowflake compute costs 40 percent: warehouse sizing, auto-suspend, and query pruning

Comments
5 min read
Streaming Tables vs. Materialized Views: Stop Guessing Your Databricks Refresh Strategy

Streaming Tables vs. Materialized Views: Stop Guessing Your Databricks Refresh Strategy

Comments
5 min read
Stop Burning Cash: Databricks Cost Optimization Patterns That Actually Work

Stop Burning Cash: Databricks Cost Optimization Patterns That Actually Work

Comments
5 min read
Navigating Schema Shifts: Keeping Your Streaming Pipeline Smooth for Everyone

Navigating Schema Shifts: Keeping Your Streaming Pipeline Smooth for Everyone

Comments
4 min read
The Silent Killer in Your Streaming Pipeline: Schema Evolution Without Tears

The Silent Killer in Your Streaming Pipeline: Schema Evolution Without Tears

Comments
10 min read
Zero to Hardened: A Practical Migration Playbook for Docker Hardened Images in Regulated Industries

Zero to Hardened: A Practical Migration Playbook for Docker Hardened Images in Regulated Industries

Comments
7 min read
It Works on My Cluster: Containerizing Spark and Lakehouse Development with Docker

It Works on My Cluster: Containerizing Spark and Lakehouse Development with Docker

Comments
7 min read
Biohack Your Brain with Neural Implants: Why Silicon Valley's Elite Are Betting Big in 2026

Biohack Your Brain with Neural Implants: Why Silicon Valley's Elite Are Betting Big in 2026

Comments
5 min read
loading...