Balachandar Paulraj – Medium

Balachandar Paulraj

Pinned

Balachandar Paulraj

Essential Considerations for Data Engineers When Selecting a NoSQL Database

In the realm of modern data engineering, the choices abound, and the stakes are high. Data engineers are the architects of the digital age…

Sep 4, 2023

Essential Considerations for Data Engineers When Selecting a NoSQL Database

Sep 4, 2023

Pinned

Balachandar Paulraj

2022 : Modern Data Stack

You might have seen multiple posts around this subject as time keeps evolving and bringing changes into tech stack, however this includes…

May 3, 2022

2022 : Modern Data Stack

May 3, 2022

Pinned

Balachandar Paulraj

DuckDB: Primer on the subject and fascinating highlights

Throughout our data engineering journey, we’ve come across a myriad of database management systems (DBMS). But what sets DuckDB apart from…

Jul 2, 2023

DuckDB: Primer on the subject and fascinating highlights

Jul 2, 2023

Balachandar Paulraj

POLARS: A Swift and Powerful DataFrame Library for Analytical Tasks

Essential to data engineering and data science are the tasks of data manipulation and analysis. Pandas has long been the staple library for…

Jun 19

POLARS: A Swift and Powerful DataFrame Library for Analytical Tasks

Jun 19

Balachandar Paulraj

Harnessing the Potential of Databricks Liquid Clustering: A Dynamic Data Layout Scaling with Growth…

Databricks made waves at the previous Data + AI Summit by introducing Liquid Clustering alongside Delta Universal Format (UniForm) and…

Jan 3

Harnessing the Potential of Databricks Liquid Clustering: A Dynamic Data Layout Scaling with Growth…

Jan 3

Balachandar Paulraj

Airbyte Spotlight: The Open-Source Solution for Data Integration — Features, Benefits, and…

Data integration is foundational to the success of modern businesses, fostering better decision-making, operational efficiency, and…

Dec 26, 2023

Airbyte Spotlight: The Open-Source Solution for Data Integration — Features, Benefits, and…

Dec 26, 2023

Balachandar Paulraj

Fast-Track PySpark UDF execution with Apache Arrow

Developers often create custom UDFs (user-defined-functions) in their Spark code to handle specific transformations. This allows users to…

Nov 19, 2023

Fast-Track PySpark UDF execution with Apache Arrow

Nov 19, 2023

Balachandar Paulraj

RAY: Distributed computing framework for ML & AI

The evolving domain of artificial intelligence and machine learning is witnessing an unprecedented demand for tools that are efficient…

Nov 6, 2023

RAY: Distributed computing framework for ML & AI

Nov 6, 2023

Balachandar Paulraj

Key Database Compaction Strategies Used In Distributed System

In the realm of distributed database systems, the adoption of compaction strategies plays a pivotal role in the effective management of…

Sep 4, 2023

Key Database Compaction Strategies Used In Distributed System

Sep 4, 2023

Balachandar Paulraj

Apache Paimon: A fresh face joins the fray

Recently, few people might have heard about Apache Paimon. Undergoing incubation at the Apache Software Foundation (ASF), Apache Paimon is…

Apr 3, 2023

Apache Paimon: A fresh face joins the fray

Apr 3, 2023

Balachandar Paulraj

Balachandar Paulraj

Big Data Habitue. Current stint at PlayStation. https://www.linkedin.com/in/balachandar-paulraj-b8a26727

Help
Status
About
Careers
Press
Blog
Privacy
Terms
Text to speech
Teams