Rahul Joshi
Distinguished Data Engineer specializing in cloud-native data platforms for financial services. M.Tech, IIT Kharagpur. I write about lakehouse architecture, data engineering at scale, and building AI-ready data platforms.
Journal article on why data platform architecture — not model architecture — is the primary determinant of AI system reliability.
Analysis of format convergence between Delta Lake and Apache Iceberg — what it means for lakehouse architecture.
Deep dive into Delta Lake’s transaction log internals — how ACID transactions work on object storage.
From Hadoop to modern lakehouse — tracing the architectural evolution of data lakes.
Proposes a variant of Kendall-Tau distance metric for evaluating rank aggregation — published in Springer PReMI 2011.