Balachandar Paulraj – Medium

Balachandar Paulraj

Pinned

Balachandar Paulraj

Essential Considerations for Data Engineers When Selecting a NoSQL Database

In the realm of modern data engineering, the choices abound, and the stakes are high. Data engineers are the architects of the digital age…

3 min readSep 4, 2023

--

18

Essential Considerations for Data Engineers When Selecting a NoSQL Database

--

18

Pinned

Balachandar Paulraj

2022 : Modern Data Stack

You might have seen multiple posts around this subject as time keeps evolving and bringing changes into tech stack, however this includes…

5 min readMay 3, 2022

--

15

2022 : Modern Data Stack

--

15

Pinned

Balachandar Paulraj

DuckDB: Primer on the subject and fascinating highlights

Throughout our data engineering journey, we’ve come across a myriad of database management systems (DBMS). But what sets DuckDB apart from…

4 min readJul 2, 2023

--

6

DuckDB: Primer on the subject and fascinating highlights

--

6

Balachandar Paulraj

Harnessing the Potential of Databricks Liquid Clustering: A Dynamic Data Layout Scaling with Growth…

Databricks made waves at the previous Data + AI Summit by introducing Liquid Clustering alongside Delta Universal Format (UniForm) and…

3 min readJan 3, 2024

--

2

Harnessing the Potential of Databricks Liquid Clustering: A Dynamic Data Layout Scaling with Growth…

--

2

Balachandar Paulraj

Airbyte Spotlight: The Open-Source Solution for Data Integration — Features, Benefits, and…

Data integration is foundational to the success of modern businesses, fostering better decision-making, operational efficiency, and…

3 min readDec 26, 2023

--

2

Airbyte Spotlight: The Open-Source Solution for Data Integration — Features, Benefits, and…

--

2

Balachandar Paulraj

Fast-Track PySpark UDF execution with Apache Arrow

Developers often create custom UDFs (user-defined-functions) in their Spark code to handle specific transformations. This allows users to…

4 min readNov 19, 2023

--

Fast-Track PySpark UDF execution with Apache Arrow

--

Balachandar Paulraj

RAY: Distributed computing framework for ML & AI

The evolving domain of artificial intelligence and machine learning is witnessing an unprecedented demand for tools that are efficient…

4 min readNov 6, 2023

--

RAY: Distributed computing framework for ML & AI

--

Balachandar Paulraj

Key Database Compaction Strategies Used In Distributed System

In the realm of distributed database systems, the adoption of compaction strategies plays a pivotal role in the effective management of…

3 min readSep 4, 2023

--

Key Database Compaction Strategies Used In Distributed System

--

Balachandar Paulraj

Apache Paimon: A fresh face joins the fray

Recently, few people might have heard about Apache Paimon. Undergoing incubation at the Apache Software Foundation (ASF), Apache Paimon is…

3 min readApr 3, 2023

--

1

Apache Paimon: A fresh face joins the fray

--

1

Balachandar Paulraj

Can Snowpark supersede Databricks and AWS EMR?

An Overview on Snowpark compared with Spark

3 min readJun 28, 2022

--

6

Can Snowpark supersede Databricks and AWS EMR?

--

6

Balachandar Paulraj

Balachandar Paulraj

Big Data Habitue. Current stint at PlayStation. https://www.linkedin.com/in/balachandar-paulraj-b8a26727

Help
Status
About
Careers
Blog
Privacy
Terms
Text to speech
Teams