Skip to main content

VirtusLab's ArticlesRSS

Data Engineering|Jan 15, 2025

What is data transformation?

Data transformation involves altering, refining, and structuring information into a standardized and usable format. It plays a key role in enabling effective data analysis, supporting decision-making, and driving organizational growth. Read more about different types of data transformation, as well as its benefits and challenges.

colorful_shapes_what_is_data_transformation
Data Engineering|Jan 10, 2025

How to manage data for your project? Comparing batch processing with stream processing.

In this article, we will compare two approaches to data processing, namely batch and stream processing. Each of them has different characteristics and serves different purposes, but we will attempt to break them down to help you decide which one would better suit your project.

roads_joining_into_one
Data Engineering|May 20, 2024

How to build an LLM chatbot for your company’s information

As organizations grow in size, the volume of internal information swells exponentially. A LLM chatbot helps to find the information and streamline data distribution.

How_to_build_an_LLM_chatbot_for_your_companys_information_image-min.jpg
Data Engineering|Nov 24, 2023

What does data transformation enable data analysts to accomplish?

Read everything about data transformation encompassing SQL, GUI, Scala, and Python, to get clean and structured data for your decision-making process. We delve into the technology and which is best for what project.

What_does_data_transformation_enable_data_analysts_to_accomplish__image-min.jpg
Data Engineering|Sep 27, 2023

What is data mesh? Redefining data platform architecture

Data Mesh has been founded on four key principles that revolutionized data management. It focuses on decentralization, domain-oriented teams, data-as-a-product, self-serve platform, and federated governance.

What_is_data_mesh_Redefining_data_platform_architecture_image-min.jpg
Data Engineering|May 19, 2023

Large Language Models: How to use open source alternatives to ChatGPT for Scala documentation

Large Language Models can revolutionize how programmers seek assistance. We tested them on Scala documentation and present the results here.

Large Language Models: How to use open source alternatives to ChatGPT for Scala documentation image
Data Engineering|Apr 25, 2023

Unlock the power of your analytical data platform for data-driven decisions

Businesses become competitive, once they avoid guesswork, create an environment for data-driven decisions, and use data from analytics platforms in their operational use cases. This is how you can excel.

Unlock the power of your analytical data platform for data-driven decisions image
Data Engineering|Mar 23, 2023

Is Hadoop still relevant: Is it our future, or does it belong to the past?

As business needs and market trends change, Hadoop and cloud data platforms will evolve together. Will Hadoop remain a data solution companies rely on?

Is_Hadoop_still_relevant__Is_it_our_future,_or_does_it_belong_to_the_past_image-min.jpg
Data Engineering|Aug 20, 2021

Table schemas in data pipelines Spark: How to handle large, nested & growing ones

In this post, we describe how we built a pipeline for the type of “incoming data” situation, and how we came up with a good solution in the end.

Table_schemas_in_data_pipelines_Spark_How_to_handle_large,_nested_&_growing_ones_image-min.jpg