February 01, 2025

Hadoop - The Evolution of Hadoop: From Batch to Real-Time Processing
Hadoop has come a long way since its inception. Originally designed as a batch processing framework for handling large datasets, it has evolved into a versatile ecosystem that supports real-time data processing and analytics. This article explores the journey of Hadoop, from its early days to its modern capabilities. Read More
by
Phee Jay
January 20, 2025

Hadoop - What is Hadoop? An Overview of the Ecosystem
In an era where data is considered the new oil, managing and processing large volumes of data efficiently is critical for businesses. Hadoop, an open-source framework, has revolutionized the way organizations store, process, and analyze massive datasets. In this article, we’ll explore what Hadoop is, its core components, and how it fits into the broader big data ecosystem. Read More
by
Phee Jay
January 03, 2025

Explained - Key Data Concepts: Data Warehouse, Data Mart, Data Lake, and Data Lakehouse
Explore the differences between data warehouses, data marts, data lakes, and data lakehouses. Understand their features, use cases, and how to choose the right solution for effective data management. Read More
by
Phee Jay
January 03, 2025

Explained - Lambda Architecture
Learn about Lambda Architecture, a data-processing framework that combines batch and real-time processing to efficiently handle massive data streams, ensuring scalability, fault tolerance, and accuracy in data systems. Read More
by
Phee Jay
January 03, 2025

Explained - Key differences between Kafka and AMQP
Understand the key differences between Kafka and AMQP, comparing their architectures and use cases to help you choose the right solution. Read More
by
Phee Jay
October 28, 2024

Explained - Data Mesh
Explore the concept of Data Mesh in this white paper, a decentralized approach to data architecture. Learn how it empowers domain teams, ensures scalability, and improves data ownership and quality, revolutionizing traditional data management practices. Read More
by
Phee Jay
October 28, 2024

Explained - Message Queuing Telemetry Transport (MQTT)
The blog post provides a detailed overview of the MQTT protocol, focusing on its architecture, benefits, and use cases in IoT environments. Read More
by
Phee Jay
October 28, 2024

Explained - Distributed Storage Systems
The blog post explores distributed storage systems, discussing their architecture, benefits, and key technologies like consistency models and replication. It covers use cases, scalability, fault tolerance, and the evolution of storage solutions for modern applications. Read More
by
Phee Jay
February 15, 2017

Java - How to Timeout JDBC Queries
Learn how to set timeouts for JDBC queries to prevent indefinite blocking. This guide covers configuring timeouts on individual statements, applying global timeouts at the driver level, and setting timeouts using Spring's JdbcTemplate. Read More
by
Phee Jay