Versioning NiFi Flows and Automating Their Deployment with NiFi Registry and...
When it comes to the efficient development of data flows, one can argue there should be a way to store different versions of the flows in some central repository and the possibility to deploy those...
View ArticleA Comparative Analysis of the Dask and Ray Libraries
Nowadays data analysis is one of the most important fields in the business world, full of enormous amounts of data to be analysed in order to extract conclusions and gain insights. As we are dealing...
View ArticleODI Multiple Execution Units and their Advantages
In one of our customer projects, we needed to identify the part of an ODI (Oracle Data Integrator)-generated mapping query that ran for longer than expected and then fix it with the help of various...
View ArticleEnhancing an AWS Data Platform with Airflow and Containers
Amazon Web Services (AWS) is the market-leading on-demand public cloud computing provider, getting more and more popular year after year. AWS improves its services regularly and creates new ones every...
View ArticleServerless Near Real-time Data Ingestion in BigQuery
Here at ClearPeaks we have been using Google Cloud Platform (GCP) in our customers’ projects for a while now, and a few months ago we published a blog article about a solution we had implemented for...
View ArticleReal-Time Streaming Analytics with Cloudera Data Flow and SQL Stream Builder
Real-time data processing is a critical aspect for most of today’s enterprises and organisations; data analytics teams are more and more often required to digest massive volumes of high-velocity data...
View ArticlePower BI Goals
It is nothing new to say that data is currently one of the most important assets for the proper functioning of a company. However, it is not enough to know this data, study, and analyse it – companies...
View ArticleHow Feature Engineering Trumps Algorithms
In the AI community we have recently seen a greater emphasis on moving from ‘Model Centric AI’ to ‘Data Centric AI’. Within the ‘Data Centric AI’ space there is an important data science lifecycle...
View ArticleIntegrating Apache MiNiFi with Apache NiFi for Collecting Data from the Edge
Business Intelligence (BI) is now a very well-known term among decision-makers. We could say that the concepts, methodologies, and paradigms behind the term are popping up almost every day, and these...
View Article