PinnedBalachandar PaulrajEssential Considerations for Data Engineers When Selecting a NoSQL DatabaseIn the realm of modern data engineering, the choices abound, and the stakes are high. Data engineers are the architects of the digital age…Sep 4, 202318Sep 4, 202318
PinnedBalachandar Paulraj2022 : Modern Data StackYou might have seen multiple posts around this subject as time keeps evolving and bringing changes into tech stack, however this includes…May 3, 202215May 3, 202215
PinnedBalachandar PaulrajDuckDB: Primer on the subject and fascinating highlightsThroughout our data engineering journey, we’ve come across a myriad of database management systems (DBMS). But what sets DuckDB apart from…Jul 2, 20236Jul 2, 20236
Balachandar PaulrajPOLARS: A Swift and Powerful DataFrame Library for Analytical TasksEssential to data engineering and data science are the tasks of data manipulation and analysis. Pandas has long been the staple library for…Jun 19Jun 19
Balachandar PaulrajHarnessing the Potential of Databricks Liquid Clustering: A Dynamic Data Layout Scaling with Growth…Databricks made waves at the previous Data + AI Summit by introducing Liquid Clustering alongside Delta Universal Format (UniForm) and…Jan 32Jan 32
Balachandar PaulrajAirbyte Spotlight: The Open-Source Solution for Data Integration — Features, Benefits, and…Data integration is foundational to the success of modern businesses, fostering better decision-making, operational efficiency, and…Dec 26, 20231Dec 26, 20231
Balachandar PaulrajFast-Track PySpark UDF execution with Apache ArrowDevelopers often create custom UDFs (user-defined-functions) in their Spark code to handle specific transformations. This allows users to…Nov 19, 2023Nov 19, 2023
Balachandar PaulrajRAY: Distributed computing framework for ML & AIThe evolving domain of artificial intelligence and machine learning is witnessing an unprecedented demand for tools that are efficient…Nov 6, 2023Nov 6, 2023
Balachandar PaulrajKey Database Compaction Strategies Used In Distributed SystemIn the realm of distributed database systems, the adoption of compaction strategies plays a pivotal role in the effective management of…Sep 4, 2023Sep 4, 2023
Balachandar PaulrajApache Paimon: A fresh face joins the frayRecently, few people might have heard about Apache Paimon. Undergoing incubation at the Apache Software Foundation (ASF), Apache Paimon is…Apr 3, 20231Apr 3, 20231