Data Models, Database and Events - Data Science Current

Data Abstraction for Data Engineering with its Different Levels

Analytics Vidhya

OCTOBER 10, 2022

This article was published as a part of the Data Science Blogathon. Introduction A data model is an abstraction of real-world events that we use to create, capture, and store data in a database that user applications require, omitting unnecessary details.

Data Engineering

Data Engineering Data Engineer Data Engineering Data Engineering

Object-centric Process Mining on Data Mesh Architectures

Data Science Blog

NOVEMBER 15, 2023

In addition to Business Intelligence (BI), Process Mining is no longer a new phenomenon, but almost all larger companies are conducting this data-driven process analysis in their organization. The Event Log Data Model for Process Mining Process Mining as an analytical system can very well be imagined as an iceberg.

Data Modeling

Data Modeling Data Models Business Intelligence Business Intelligence

Remote Data Science Jobs: 5 High-Demand Roles for Career Growth

Data Science Dojo

OCTOBER 31, 2024

Top Employers Microsoft, Facebook, and consulting firms like Accenture are actively hiring in this field of remote data science jobs, with salaries generally ranging from $95,000 to $140,000. Strong analytical skills and the ability to work with large datasets are critical, as is familiarity with data modeling and ETL processes.

Data Science

Data Science Data Scientist Machine Learning Machine Learning

Webinars

What’s New in Apache Airflow® 3.0—And How Will It Reshape Your Data Workflows?

MORE WEBINARS

Dynamic text-to-SQL for enterprise workloads with Amazon Bedrock Agents

AWS Machine Learning Blog

APRIL 14, 2025

Text-to-SQL empowers people to explore data and draw insights using natural language, without requiring specialized database knowledge. Amazon Web Services (AWS) has helped many customers connect this text-to-SQL capability with their own data, which means more employees can generate insights.

SQL

SQL Database AWS AI

Data Modeling Fundamentals in Power BI

phData

JUNE 13, 2023

While the front-end report visuals are important and the most visible to end users, a lot goes on behind the scenes that contribute heavily to the end product, including data modeling. In this blog, we’ll describe data modeling and its significance in Power BI. What is Data Modeling?

Power BI

Power BI Data Modeling Data Models Data Warehouse

Visualizing graph data without a graph database

Cambridge Intelligence

OCTOBER 25, 2023

Visualizing graph data doesn’t necessarily depend on a graph database… Working on a graph visualization project? You might assume that graph databases are the way to go – they have the word “graph” in them, after all. Do I need a graph database? It depends on your project. Unstructured?

Database

Database Data Modeling Data Models Algorithm

How to choose a graph database: we compare 6 favorites

Cambridge Intelligence

OCTOBER 19, 2023

That’s why our data visualization SDKs are database agnostic: so you’re free to choose the right stack for your application. There have been a lot of new entrants and innovations in the graph database category, with some vendors slowly dipping below the radar, or always staying on the periphery.

Database

Database Azure Analytics Analytics

Demystifying Time Series Database: A Comprehensive Guide

Pickl AI

JULY 8, 2024

Summary: Time series databases (TSDBs) are built for efficiently storing and analyzing data that changes over time. This data, often from sensors or IoT devices, is typically collected at regular intervals. Within this data ocean, a specific type holds immense value: time series data.

Database

Database Data Pipeline Machine Learning Machine Learning

Unleash the Power of Data: An Introduction to the 8 Types of Databases You Should Know

Mlearning.ai

FEBRUARY 13, 2023

Welcome to the wild, wacky world of databases! to the digital world, you’ll find that these unsung heroes of the digital age are essential for keeping your data organised and secure. But with so many types of databases to choose from, how do you know which one is right for you? The most well-known graph database is Neo4j.

Database

Database Data Modeling Data Models Big Data

Beyond data: Cloud analytics mastery for business brilliance

Dataconomy

SEPTEMBER 4, 2023

Key features of cloud analytics solutions include: Data models , Processing applications, and Analytics models. Data models help visualize and organize data, processing applications handle large datasets efficiently, and analytics models aid in understanding complex data sets, laying the foundation for business intelligence.

Analytics

Analytics Analytics Big Data Analytics Big Data Analytics

Data Version Control for Data Lakes: Handling the Changes in Large Scale

ODSC - Open Data Science

SEPTEMBER 27, 2023

In this article, we will delve into the concept of data lakes, explore their differences from data warehouses and relational databases, and discuss the significance of data version control in the context of large-scale data management. This ensures data consistency and integrity.

Data Lakes

Data Lakes Data Warehouse Database Big Data

Knowledge Graph QA using Gemini and NebulaGraph Lite

Towards AI

MARCH 20, 2024

Graph databases and knowledge graphs are among the most widely adopted solutions for managing data represented as graphs, consisting of nodes (entities) and edges (relationships). Knowledge graphs extend the capabilities of graph databases by incorporating mechanisms to infer and derive new knowledge from the existing graph data.

Database

Database Data Analysis Data Analysis Data Modeling

Jepsen: TigerBeetle 0.16.11

Hacker News

JUNE 6, 2025

Kyle Kingsbury 2025-06-06 TigerBeetle is a distributed OLTP database oriented towards financial transactions. 1 Background TigerBeetle is an Online Transactional Processing (OLTP) database built for double-entry accounting with a strong emphasis on safety and speed. Events within a request are executed in order. through 0.16.30.

Clustering

Clustering Database Data Modeling Data Models

Azure Cosmos DB tutorial for KronoGraph & KeyLines

Cambridge Intelligence

NOVEMBER 20, 2023

This Azure Cosmos DB tutorial shows you how to integrate Microsoft’s multi-model database service with our graph and timeline visualization SDKs to build an interactive graph application. Create a graph data model Our chess dataset is in CSV file format, not a graph, so we’ll have to think about what sort of graph data model to apply.

Azure

Azure Database Data Modeling Data Models

Analyzing the history of Tableau innovation

Tableau

DECEMBER 1, 2021

Four reference lines on the x-axis indicate key events in Tableau’s almost two-decade history: The first Tableau Conference in 2008. Chris had earned an undergraduate computer science degree from Simon Fraser University and had worked as a database-oriented software engineer. Release v1.0 April 2005) is in the top left corner.

Tableau

Tableau ML ML Database

Cassandra vs MongoDB

Pickl AI

SEPTEMBER 20, 2024

Summary: Apache Cassandra and MongoDB are leading NoSQL databases with unique strengths. Introduction In the realm of database management systems, two prominent players have emerged in the NoSQL landscape: Apache Cassandra and MongoDB. Flexible Data Model: Supports a wide variety of data formats and allows for dynamic schema changes.

Database

Database Clustering Data Modeling Data Models

Synthetic data generation: Building trust by ensuring privacy and quality

IBM Journey to AI blog

NOVEMBER 29, 2023

You can combine this data with real datasets to improve AI model training and predictive accuracy. Creating synthetic test data to expedite testing, optimization and validation of new applications and features. Using synthetic data to prevent the exposure of sensitive data in machine learning algorithms.

Data Scientist

Data Scientist Machine Learning Machine Learning AI

Best Data Engineering Tools Every Engineer Should Know

Pickl AI

MARCH 19, 2025

Without data engineering , companies would struggle to analyse information and make informed decisions. What Does a Data Engineer Do? A data engineer creates and manages the pipelines that transfer data from different sources to databases or cloud storage. How is Data Engineering Different from Data Science?

Data Engineering

Data Engineering Data Engineer Data Engineering Data Engineering

The innovators behind intelligent machines: A look at ML engineers

Dataconomy

MAY 2, 2023

By acquiring expertise in statistical techniques, machine learning professionals can develop more advanced and sophisticated algorithms, which can lead to better outcomes in data analysis and prediction. These techniques can be utilized to estimate the likelihood of future events and inform the decision-making process.

ML

ML ML Machine Learning Machine Learning

How to build a simple data visualization web app with Neo4j

Cambridge Intelligence

APRIL 4, 2023

The Neo4j graph data platform Neo4j has cemented itself as the market leader in graph database management systems, so it’s no surprise that many of our customers want to visualize connected data stored in Neo4j databases. It’s a great option if you don’t want the hassle of database administration.

Data Visualization

Data Visualization Database Data Modeling Data Models

React Neo4j visualization with ReGraph

Cambridge Intelligence

JUNE 18, 2024

To build a high-performance, scalable graph visualization application, you need a reliable way to store and query your data. Neo4j is one of the most popular graph database choices among our customers. This will replicate a full Neo4j database and let us test our Cypher querying. So let’s continue.

Database

Database Data Modeling Data Models Data Visualization

The Backbone of Data Engineering: 5 Key Architectural Patterns Explained

Mlearning.ai

MAY 16, 2023

ETL Design Pattern The ETL (Extract, Transform, Load) design pattern is a commonly used pattern in data engineering. It is used to extract data from various sources, transform the data to fit a specific data model or schema, and then load the transformed data into a target system such as a data warehouse or a database.

Data Engineering

Data Engineering Data Engineer Data Engineering Data Engineering

Data science vs data analytics: Unpacking the differences

IBM Journey to AI blog

SEPTEMBER 19, 2023

And you should have experience working with big data platforms such as Hadoop or Apache Spark. Additionally, data science requires experience in SQL database coding and an ability to work with unstructured data of various types, such as video, audio, pictures and text.

Data Science

Data Science Analytics Analytics Data Scientist

Discover the Most Important Fundamentals of Data Engineering

Pickl AI

NOVEMBER 4, 2024

Summary: The fundamentals of Data Engineering encompass essential practices like data modelling, warehousing, pipelines, and integration. Understanding these concepts enables professionals to build robust systems that facilitate effective data management and insightful analysis. What is Data Engineering?

Data Engineering

Data Engineering Data Engineer Data Engineering Data Engineering

Analyzing the history of Tableau innovation

Tableau

DECEMBER 1, 2021

Four reference lines on the x-axis indicate key events in Tableau’s almost two-decade history: The first Tableau Conference in 2008. Chris had earned an undergraduate computer science degree from Simon Fraser University and had worked as a database-oriented software engineer. Release v1.0 April 2005) is in the top left corner.

Tableau

Tableau ML ML Database

Understanding earthquakes: what map visualizations teach us

Cambridge Intelligence

NOVEMBER 8, 2023

Analysts rely on our data visualization toolkits to spot hidden patterns in their visualized data. They investigate these patterns and use them to predict – and, if possible, prevent – future events. What role can interactive data visualization play? I chose one containing significant earthquakes (5.5+

Data Visualization

Data Visualization Clustering Database Data Modeling

Implementing Knowledge Bases for Amazon Bedrock in support of GDPR (right to be forgotten) requests

AWS Machine Learning Blog

MAY 31, 2024

Challenges associated with these stages involve not knowing all touchpoints where data is persisted, maintaining a data pre-processing pipeline for document chunking, choosing a chunking strategy, vector database, and indexing strategy, generating embeddings, and any manual steps to purge data from vector stores and keep it in sync with source data.

AWS

AWS Machine Learning Machine Learning Database

What is a Customer Data Platform (CDP)?

phData

MARCH 11, 2024

A CDP has historically been an all-in-one platform designed to help companies collect, store, and unify customer data within a hosted database so that marketing and business teams can easily build audiences and activate data to downstream operational tools. dbt has become the standard for modeling.

Data Warehouse

Data Warehouse Cloud Data Data Modeling Data Models

OpenTelemetry vs. Prometheus: You can’t fix what you can’t see

IBM Journey to AI blog

MARCH 29, 2024

Metrics vary depending on the data that a team deems important and can include network traffic, latency and CPU storage. Logs: Logs are a record of events that occur within a software or application component. Prometheus is a time-series database for end-to-end monitoring of time-series data.

Database

Database Python Data Modeling Data Models

Unlocking Tabular Data’s Hidden Potential

ODSC - Open Data Science

MAY 10, 2023

Feature engineering of tabular data demands considerable manual effort, making tabular data preparation even more dependent on luck or the data scientist’s skill set. One might say that tabular data modeling is the original data-centric AI! In practice, tabular data is anything but clean and uncomplicated.

Data Scientist

Data Scientist Data Science Deep Learning Deep Learning

From zero to BI hero: Launching your business intelligence career

Dataconomy

MARCH 24, 2023

Some of the common career opportunities in BI include: Entry-level roles Data analyst: A data analyst is responsible for collecting and analyzing data, creating reports, and presenting insights to stakeholders. They may also be involved in data modeling and database design.

Business Intelligence

Business Intelligence Business Intelligence Data Analysis Data Analysis

From zero to BI hero: Launching your business intelligence career

Dataconomy

MARCH 24, 2023

Some of the common career opportunities in BI include: Entry-level roles Data analyst: A data analyst is responsible for collecting and analyzing data, creating reports, and presenting insights to stakeholders. They may also be involved in data modeling and database design.

Business Intelligence

Business Intelligence Business Intelligence Data Analysis Data Analysis

Future-Proofing Your App: Strategies for Building Long-Lasting Apps

Iguazio

MAY 29, 2024

In the training pipeline, teams can swap: The model itself, whether a version or a type. For example, based on user input or requirements, teams might switch from a full LLM to a smaller, more specialized model. In the application pipeline, teams can swap: Logging inputs + responses to various data sources (database, stream, file, etc.)

Data Pipeline

Data Pipeline AI AI ML

MLOps Landscape in 2023: Top Tools and Platforms

The MLOps Blog

JUNE 27, 2023

Model versioning, lineage, and packaging : Can you version and reproduce models and experiments? Can you see the complete model lineage with data/models/experiments used downstream? Dolt Dolt is an open-source relational database system built on Git. Is it fast and reliable enough for your workflow?

Machine Learning

Machine Learning Machine Learning ML ML

Delivering More Together with DataRobot and Snowflake Integrations

DataRobot Blog

JUNE 9, 2022

Snowflake Summit 2022 (June 13-16) draws ever closer, and I believe it’s going to be a great event. A couple of sessions I’m excited about include the keynote The Engine & Platform Innovations Running the Data Cloud and learning how the frostbyte team conducts Rapid Prototyping of Industry Solutions. Prediction explanations.

Cloud Data

Cloud Data AI AI Database

GraphQL vs. REST API: What’s the difference?

IBM Journey to AI blog

MARCH 29, 2024

The resolver provides instructions for turning GraphQL queries, mutations, and subscriptions into data, and retrieves data from databases, cloud services, and other sources. Resolvers also provide data format specifications and enable the system to stitch together data from various sources.

Data Profiling

Data Profiling Database Data Modeling Data Models

How to Integrate SAP Data With Snowflake

phData

MAY 13, 2024

Built for integration, scalability, governance, and industry-leading security, Snowflake optimizes how you can leverage your organization’s data, providing the following benefits: Built to Be a Source of Truth Snowflake is built to simplify data integration wherever it lives and whatever form it takes.

Database

Database Analytics Analytics Machine Learning

ODSC West Recap, Slides, and Minisodes Podcast, Open-Source Data Catalogs, and Limitations of LLMs

ODSC - Open Data Science

NOVEMBER 21, 2024

The Top AI Slides from ODSC West 2024 This blog highlights some of the most impactful AI slides from the world’s best data science instructors, focusing on cutting-edge advancements in AI, data modeling, and deployment strategies. Learn more about what to expect from this massive event here and why you won’t want to miss it.

Data Science

Data Science Database AI AI

Discover the Snowflake Architecture With All its Pros and Cons- NIX United

Mlearning.ai

FEBRUARY 16, 2023

Thus, the solution allows for scaling data workloads independently from one another and seamlessly handling data warehousing, data lakes , data sharing, and engineering. Snowflake Database Pros Extensive Storage Opportunities Snowflake provides affordability, scalability, and a user-friendly interface.

Data Warehouse

Data Warehouse Business Intelligence Business Intelligence Database

Data Demystified: What Exactly is Data?- 4 Types of Analytics

Pickl AI

JULY 23, 2023

It is curated intentionally for a specific purpose, often to analyze and derive insights from the data it contains. Datasets are typically formatted and stored in files, databases, or spreadsheets, allowing for easy access and analysis. Types of Data 1. It follows a specific schema, making it easy to analyze and process.

Analytics

Analytics Analytics Predictive Analytics Data Analysis

How to use foundation models and trusted governance to manage AI workflow risk

IBM Journey to AI blog

OCTOBER 16, 2023

It includes processes that trace and document the origin of data, models and associated metadata and pipelines for audits. Curated foundation models, such as those created by IBM or Microsoft, help enterprises scale and accelerate the use and impact of the most advanced AI capabilities using trusted data.

AI

AI AI Data Warehouse ML

Splunk Tutorial For Beginners: It’s Application & Features

Pickl AI

JUNE 29, 2023

Furthermore, The platform’s versatility extends beyond data analysis. This role involves configuring data inputs, managing users and permissions, and monitoring system performance. Explore Security and SIEM Splunk is widely used in cybersecurity for security information and event management (SIEM).

Big Data Analytics

Big Data Analytics Big Data Analytics Big Data Big Data

Best Practices for Fact Tables in Dimensional Models

Pickl AI

AUGUST 11, 2024

These tables are called “factless fact tables” or “junction tables” They are used for modelling many-to-many relationships or for capturing timestamps of events. This schema serves as the foundation of dimensional modeling. A star schema forms when a fact table combines with its dimension tables.

Data Quality

Data Quality Data Warehouse Data Governance Analytics

Why Data without Context Lacks Integrity

Precisely

MARCH 7, 2024

Ask ten people to define data integrity , and you’ll likely get different answers. Many people use the term to describe a data quality metric. Technical users, including database administrators, might tell you that data integrity concerns whether or not the data conforms to a pre-defined data model.

Data Quality

Data Quality Database Administration Analytics Analytics

Data Abstraction for Data Engineering with its Different Levels

Object-centric Process Mining on Data Mesh Architectures

Webinars

Trending Sources

Remote Data Science Jobs: 5 High-Demand Roles for Career Growth

Webinars

Dynamic text-to-SQL for enterprise workloads with Amazon Bedrock Agents

Data Modeling Fundamentals in Power BI

Visualizing graph data without a graph database

How to choose a graph database: we compare 6 favorites

Demystifying Time Series Database: A Comprehensive Guide

Unleash the Power of Data: An Introduction to the 8 Types of Databases You Should Know

Beyond data: Cloud analytics mastery for business brilliance

Data Version Control for Data Lakes: Handling the Changes in Large Scale

Knowledge Graph QA using Gemini and NebulaGraph Lite

Jepsen: TigerBeetle 0.16.11

Azure Cosmos DB tutorial for KronoGraph & KeyLines

Analyzing the history of Tableau innovation

Cassandra vs MongoDB

Synthetic data generation: Building trust by ensuring privacy and quality

Best Data Engineering Tools Every Engineer Should Know

The innovators behind intelligent machines: A look at ML engineers

How to build a simple data visualization web app with Neo4j

React Neo4j visualization with ReGraph

The Backbone of Data Engineering: 5 Key Architectural Patterns Explained

Data science vs data analytics: Unpacking the differences

Discover the Most Important Fundamentals of Data Engineering

Analyzing the history of Tableau innovation

Understanding earthquakes: what map visualizations teach us

Implementing Knowledge Bases for Amazon Bedrock in support of GDPR (right to be forgotten) requests

What is a Customer Data Platform (CDP)?

OpenTelemetry vs. Prometheus: You can’t fix what you can’t see

Unlocking Tabular Data’s Hidden Potential

From zero to BI hero: Launching your business intelligence career

From zero to BI hero: Launching your business intelligence career

Future-Proofing Your App: Strategies for Building Long-Lasting Apps

MLOps Landscape in 2023: Top Tools and Platforms

Delivering More Together with DataRobot and Snowflake Integrations

GraphQL vs. REST API: What’s the difference?

How to Integrate SAP Data With Snowflake

ODSC West Recap, Slides, and Minisodes Podcast, Open-Source Data Catalogs, and Limitations of LLMs

Discover the Snowflake Architecture With All its Pros and Cons- NIX United

Data Demystified: What Exactly is Data?- 4 Types of Analytics

How to use foundation models and trusted governance to manage AI workflow risk

Splunk Tutorial For Beginners: It’s Application & Features

Best Practices for Fact Tables in Dimensional Models

Why Data without Context Lacks Integrity

Stay Connected