PinnedEssential Considerations for Data Engineers When Selecting a NoSQL DatabaseIn the realm of modern data engineering, the choices abound, and the stakes are high. Data engineers are the architects of the digital age…Sep 4, 202318Sep 4, 202318
Pinned2022 : Modern Data StackYou might have seen multiple posts around this subject as time keeps evolving and bringing changes into tech stack, however this includes…May 3, 202215May 3, 202215
PinnedDuckDB: Primer on the subject and fascinating highlightsThroughout our data engineering journey, we’ve come across a myriad of database management systems (DBMS). But what sets DuckDB apart from…Jul 2, 20236Jul 2, 20236
Google Spanner: The Database That Scales Globally with Strong ConsistencyIn the modern data engineering landscape, companies require applications that can scale globally without compromising on performance and…Nov 13Nov 13
POLARS: A Swift and Powerful DataFrame Library for Analytical TasksEssential to data engineering and data science are the tasks of data manipulation and analysis. Pandas has long been the staple library for…Jun 19Jun 19
Harnessing the Potential of Databricks Liquid Clustering: A Dynamic Data Layout Scaling with Growth…Databricks made waves at the previous Data + AI Summit by introducing Liquid Clustering alongside Delta Universal Format (UniForm) and…Jan 32Jan 32
Airbyte Spotlight: The Open-Source Solution for Data Integration — Features, Benefits, and…Data integration is foundational to the success of modern businesses, fostering better decision-making, operational efficiency, and…Dec 26, 20231Dec 26, 20231
Fast-Track PySpark UDF execution with Apache ArrowDevelopers often create custom UDFs (user-defined-functions) in their Spark code to handle specific transformations. This allows users to…Nov 19, 2023Nov 19, 2023
RAY: Distributed computing framework for ML & AIThe evolving domain of artificial intelligence and machine learning is witnessing an unprecedented demand for tools that are efficient…Nov 6, 2023Nov 6, 2023
Key Database Compaction Strategies Used In Distributed SystemIn the realm of distributed database systems, the adoption of compaction strategies plays a pivotal role in the effective management of…Sep 4, 2023Sep 4, 2023