Blog, Data Models and SQL - Data Science Current

Data Modeling in Machine Learning Pipelines: Best Practices Using SQL and NoSQL Databases

Dataversity

JANUARY 14, 2025

Data, undoubtedly, is one of the most significant components making up a machine learning (ML) workflow, and due to this, data management is one of the most important factors in sustaining ML pipelines.

Machine Learning

Machine Learning Machine Learning SQL Data Models

Why You Need RAG to Stay Relevant as a Data Scientist

KDnuggets

JUNE 11, 2025

By Nate Rosidi , KDnuggets Market Trends & SQL Content Specialist on June 11, 2025 in Language Models Image by Author | Canva If you work in a data-related field, you should update yourself regularly. Data scientists use different tools for tasks like data visualization, data modeling, and even warehouse systems.

Data Scientist

Data Scientist Natural Language Processing Data Science Machine Learning

SQL vs. NoSQL: Decoding the database dilemma to perfect solutions

Data Science Dojo

JULY 12, 2023

Welcome to the world of databases, where the choice between SQL (Structured Query Language) and NoSQL (Not Only SQL) databases can be a significant decision. In this blog, we’ll explore the defining traits, benefits, use cases, and key factors to consider when choosing between SQL and NoSQL databases.

SQL

SQL Database Big Data Big Data

CI/CD for Data Pipelines: A Game-Changer with AnalyticsCreator

Data Science Blog

MAY 20, 2024

It offers full BI-Stack Automation, from source to data warehouse through to frontend. It supports a holistic data model, allowing for rapid prototyping of various models. It also supports a wide range of data warehouses, analytical databases, data lakes, frontends, and pipelines/ETL. Mixed approach of DV 2.0

Data Pipeline

Data Pipeline Data Warehouse Azure Data Lakes

Traditional vs Vector databases: Your guide to make the right choice

Data Science Dojo

MARCH 8, 2024

This blog delves into a detailed comparison between the two data management techniques. In today’s digital world, businesses must make data-driven decisions to manage huge sets of information. Hence, databases are important for strategic data handling and enhanced operational efficiency.

Database

Database Natural Language Processing Clustering SQL

Dynamic text-to-SQL for enterprise workloads with Amazon Bedrock Agents

AWS Machine Learning Blog

APRIL 14, 2025

Text-to-SQL empowers people to explore data and draw insights using natural language, without requiring specialized database knowledge. Amazon Web Services (AWS) has helped many customers connect this text-to-SQL capability with their own data, which means more employees can generate insights.

SQL

SQL Database AWS AI

Object-centric Process Mining on Data Mesh Architectures

Data Science Blog

NOVEMBER 15, 2023

New big data architectures and, above all, data sharing concepts such as Data Mesh are ideal for creating a common database for many data products and applications. The Event Log Data Model for Process Mining Process Mining as an analytical system can very well be imagined as an iceberg.

Data Models

Data Models Data Modeling Business Intelligence Business Intelligence

Why using Infrastructure as Code for developing Cloud-based Data Warehouse Systems?

Data Science Blog

SEPTEMBER 19, 2023

So why using IaC for Cloud Data Infrastructures? This ensures that the data models and queries developed by data professionals are consistent with the underlying infrastructure. Enhanced Security and Compliance Data Warehouses often store sensitive information, making security a paramount concern.

Data Warehouse

Data Warehouse Azure SQL Database

Becoming a Data Engineer: 7 Tips to Take Your Career to the Next Level

Data Science Connect

JANUARY 27, 2023

In this blog post, we will be discussing 7 tips that will help you become a successful data engineer and take your career to the next level. Learn SQL: As a data engineer, you will be working with large amounts of data, and SQL is the most commonly used language for interacting with databases.

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

Tales of Data Modelers

Dataversity

DECEMBER 20, 2021

Reading Larry Burns’ “Data Model Storytelling” (TechnicsPub.com, 2021) was a really good experience for a guy like me (i.e., someone who thinks that data models are narratives). The post Tales of Data Modelers appeared first on DATAVERSITY. The post Tales of Data Modelers appeared first on DATAVERSITY.

Data Models

Data Models Data Modeling Database SQL

2021: Three Game-Changing Data Modeling Perspectives

Dataversity

JANUARY 18, 2021

So, I had to cut down my January 2021 list of things of importance in Data Modeling in this new, fine year (I hope)! The post 2021: Three Game-Changing Data Modeling Perspectives appeared first on DATAVERSITY. Common wisdom has it that we humans can only focus on three things at a time.

Data Models

Data Models Data Modeling Database SQL

Best practices for prompt engineering with Meta Llama 3 for Text-to-SQL use cases

AWS Machine Learning Blog

AUGUST 30, 2024

In this post, we provide an overview of the Meta Llama 3 models available on AWS at the time of writing, and share best practices on developing Text-to-SQL use cases using Meta Llama 3 models. Meta Llama 3’s capabilities enhance accuracy and efficiency in understanding and generating SQL queries from natural language inputs.

SQL

SQL AWS Database AI

Transform your data into insights: The data analyst’s guide to Power BI

Data Science Dojo

FEBRUARY 9, 2023

Data is an essential component of any business, and it is the role of a data analyst to make sense of it all. Power BI is a powerful data visualization tool that helps them turn raw data into meaningful insights and actionable decisions. Check out this course and learn Power BI today!

Power BI

Power BI Data Analyst Data Visualization Data Analysis

Tabular Data Exploration and Modelling with LLMs

Towards AI

JANUARY 11, 2024

Tabular data is the data in the typical table — some columns and rows are structured well, like in Excel or SQL data. It's the most common usage of data forms in many data use cases. With the power of LLM, we would learn how to explore the data and perform data modeling.

Python

Python Clean Data Data Science SQL

10 Data Modeling Tools You Should Know

Pickl AI

JUNE 28, 2023

Data is driving most business decisions. In this, data modeling tools play a crucial role in developing and maintaining the information system. Moreover, it involves the creation of a conceptual representation of data and its relationship. Data modeling tools play a significant role in this.

Data Models

Data Models Data Modeling Database SQL

How to Refresh a Single Table in a Power BI Semantic Model

phData

OCTOBER 29, 2024

However, in Power BI Service, we can only refresh the entire semantic model, as there is no out-of-the-box solution for refreshing a single table. In this blog, we will explain how to refresh a single table in Power BI Service using XMLA Endpoints. Now, open SQL Server Management Studio (SSMS).

Power BI

Power BI SQL Database Azure

How to Use Custom SQL and CSVs in Sigma Computing

phData

JULY 10, 2024

Sigma Computing , a cloud-based analytics platform, helps data analysts and business professionals maximize their data with collaborative and scalable analytics. One of Sigma’s key features is its support for custom SQL queries and CSV file uploads. These tools allow users to handle more advanced data tasks and analyses.

SQL

SQL Data Warehouse Analytics Analytics

What Are the Best Data Modeling Methodologies & Processes for My Data Lake?

phData

SEPTEMBER 19, 2023

However, to fully harness the potential of a data lake, effective data modeling methodologies and processes are crucial. Data modeling plays a pivotal role in defining the structure, relationships, and semantics of data within a data lake. Consistency of data throughout the data lake.

Data Lakes

Data Lakes Data Models Data Modeling Data Warehouse

Data Scientist Job Description – What Companies Look For in 2025

Pickl AI

JUNE 5, 2025

As Indian companies across industries increasingly embrace data-driven decision-making, artificial intelligence (AI), and automation, the demand for skilled data scientists continues to surge. Validation techniques ensure models perform well on unseen data. Data Manipulation: Pandas, NumPy, dplyr.

Data Scientist

Data Scientist Data Science Power BI Machine Learning

GraphRAG Is the Logical Step From RAG — So Why the Sudden Hype?

Towards AI

JULY 17, 2024

I’m not going to go into huge details on this as if you follow AI / LLM (which I assume you do if you are reading this) but in a nutshell, RAG is the process whereby you feed external data into an LLM alongside prompts to ensure it has all of the information it needs to make decisions. What is GraphRAG? Why use Graphs and what are they?

Database

Database Data Models Data Modeling SQL

Best Data Engineering Tools Every Engineer Should Know

Pickl AI

MARCH 19, 2025

Summary: Data engineering tools streamline data collection, storage, and processing. Tools like Python, SQL, Apache Spark, and Snowflake help engineers automate workflows and improve efficiency. Learning these tools is crucial for building scalable data pipelines.

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

How Rocket Companies modernized their data science solution on AWS

AWS Machine Learning Blog

FEBRUARY 21, 2025

Data exploration and model development were conducted using well-known machine learning (ML) tools such as Jupyter or Apache Zeppelin notebooks. Apache Hive was used to provide a tabular interface to data stored in HDFS, and to integrate with Apache Spark SQL. HBase is employed to offer real-time key-based access to data.

Data Science

Data Science AWS Hadoop Data Scientist

Citus 12: Schema-based sharding for PostgreSQL

Hacker News

JULY 18, 2023

What if you could automatically shard your PostgreSQL database across any number of servers and get industry-leading performance at scale without any special data modelling steps? In this blog post, you’ll get a high-level overview of schema-based sharding and other new Citus 12 features: What is schema-based sharding?

Database

Database SQL Data Models Data Modeling

Optimizing Snowflake’s Performance for Data Vault Modeling

phData

OCTOBER 9, 2023

However, to harness the full potential of Snowflake’s performance capabilities, it is essential to adopt strategies tailored explicitly for data vault modeling. Because of data vault’s modeling structure, transformation queries for moving data between these layers can become exceedingly complex.

ETL

ETL Clustering Data Warehouse SQL

5 Ways to Optimize Your Sigma Computing Calculations

phData

SEPTEMBER 13, 2023

Sigma Computing is a powerful data modeling and analysis platform designed to leverage the power of modern cloud technology. Once connected to Snowflake , Sigma utilizes Machine Generated SQL to produce the most optimal results. Check out this blog to master the fundamentals. True or False. True or False.

SQL

SQL Data Models Data Modeling Analytics

It’s All About Relations!

Dataversity

NOVEMBER 21, 2022

The new ISO 39075 Graph Query Language Standard is to hit the data streets in late 2023 (?). If graph databases are standardized pretty soon, what will happen to SQL? Not simply because legacy SQL has a tremendous inertia, but because relational database paradigms […]. They will very likely stay around for a long time.

SQL

SQL Database Data Modeling Data Models

Who is a BI Developer: Role, Responsibilities & Skills

Pickl AI

JULY 3, 2023

It is the process of converting raw data into relevant and practical knowledge to help evaluate the performance of businesses, discover trends, and make well-informed choices. Data gathering, data integration, data modelling, analysis of information, and data visualization are all part of intelligence for businesses.

Business Intelligence

Business Intelligence Business Intelligence SQL Data Visualization

How to Optimize Power BI and Snowflake for Advanced Analytics

phData

MAY 25, 2023

The June 2021 release of Power BI Desktop introduced Custom SQL queries to Snowflake in DirectQuery mode. In 2021, Microsoft enabled Custom SQL queries to be run to Snowflake in DirectQuery mode further enhancing the connection capabilities between the platforms.

Power BI

Power BI Analytics Analytics Azure

A Comprehensive Guide to Business Intelligence Analysts

Pickl AI

MARCH 3, 2025

Summary: Business Intelligence Analysts transform raw data into actionable insights. They use tools and techniques to analyse data, create reports, and support strategic decisions. Key skills include SQL, data visualization, and business acumen. Introduction We are living in an era defined by data.

Business Intelligence

Business Intelligence Business Intelligence Data Analyst Data Visualization

Maximize the Power of dbt and Snowflake to Achieve Efficient and Scalable Data Vault Solutions

phData

AUGUST 10, 2023

In this blog, our focus will be on exploring the data lifecycle along with several Design Patterns, delving into their benefits and constraints. Data architects can leverage these patterns as starting points or reference models when designing and implementing data vault architectures.

SQL

SQL Data Observability Data Quality Data Pipeline

Best 8 Data Version Control Tools for Machine Learning 2024

DagsHub

DECEMBER 11, 2023

Best 8 data version control tools for 2023 (Source: DagsHub ) Introduction With business needs changing constantly and the growing size and structure of datasets, it becomes challenging to efficiently keep track of the changes made to the data, which leads to unfortunate scenarios such as inconsistencies and errors in data.

Machine Learning

Machine Learning Machine Learning Data Lakes Data Science

Analyzing the history of Tableau innovation

Tableau

DECEMBER 1, 2021

In this blog post, I'll describe my analysis of Tableau's history to drive analytics innovation—in particular, I've identified six key innovation vectors through reflecting on the top innovations across Tableau releases. Query allowed customers from a broad range of industries to connect to clean useful data found in SQL and Cube databases.

Tableau

Tableau ML ML Database

Learn the Difference Between MySQL and PostgreSQL

Pickl AI

DECEMBER 17, 2024

This blog explores PostgreSQL vs MySQL, two popular RDBMS solutions, highlighting their differences to help you choose the right one for your needs. It is open-source and uses Structured Query Language (SQL) to manage and manipulate data. PostgreSQLs architecture is highly flexible, supporting many data models and workloads.

Database

Database SQL Analytics Analytics

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

AWS Machine Learning Blog

SEPTEMBER 18, 2024

Hosted on Amazon ECS with tasks run on Fargate, this platform streamlines the end-to-end ML workflow, from data ingestion to model deployment. This blog post delves into the details of this MLOps platform, exploring how the integration of these tools facilitates a more efficient and scalable approach to managing ML projects.

AWS

AWS Machine Learning Machine Learning ML

Data science vs data analytics: Unpacking the differences

IBM Journey to AI blog

SEPTEMBER 19, 2023

And you should have experience working with big data platforms such as Hadoop or Apache Spark. Additionally, data science requires experience in SQL database coding and an ability to work with unstructured data of various types, such as video, audio, pictures and text.

Data Science

Data Science Analytics Analytics Data Scientist

How to Build a Power BI Datamart Using Snowflake Data

phData

JULY 11, 2023

Power BI Datamarts provides a low/no code experience directly within Power BI Service that allows developers to ingest data from disparate sources, perform ETL tasks with Power Query, and load data into a fully managed Azure SQL database. Note: At the time of writing this blog, Power BI Datamarts is in preview.

Power BI

Power BI SQL Azure ETL

Azure Data Engineer Jobs

Pickl AI

APRIL 6, 2023

Accordingly, one of the most demanding roles is that of Azure Data Engineer Jobs that you might be interested in. The following blog will help you know about the Azure Data Engineering Job Description, salary, and certification course. Having experience using at least one end-to-end Azure data lake project.

Azure

Azure Data Engineering Data Engineering Data Engineering

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

Pickl AI

JULY 25, 2023

Data engineers are essential professionals responsible for designing, constructing, and maintaining an organization’s data infrastructure. They create data pipelines, ETL processes, and databases to facilitate smooth data flow and storage. Data Visualization: Matplotlib, Seaborn, Tableau, etc.

Data Engineering

Data Engineering Data Engineering Data Engineering Data Engineer

dbt and Sigma Integration

phData

JUNE 27, 2023

All of which have a specific role used to collect, store, process, and analyze data. This blog will hone in on the new collaboration, how to implement it into your workbooks, and why Sigma users should be excited about the feature. Using SQL-centric transformations to model data to be deployed.

SQL

SQL Database Data Quality Data Warehouse

Tables vs. Pivot Tables in Sigma Computing

phData

MARCH 28, 2024

In this blog, we will cover what tables and pivot tables are, the advantages and limitations of each, and the factors to consider when choosing which element to use. At the end of this blog, you will have a firm understanding of both elements and how to utilize each in your day-to-day data exploration.

Data Models

Data Models Data Modeling Data Visualization SQL

Understanding the Benefits of Data Vault Architecture in Snowflake

phData

AUGUST 16, 2023

To address these complexities, a powerful data warehousing solution like the Snowflake Data Cloud , coupled with an effective data modeling approach such as the Data Vault architecture, can be a winning combination. What is a Data Vault Architecture? Contact phData!

Data Warehouse

Data Warehouse Data Governance SQL Data Models

How to choose a graph database: we compare 6 favorites

Cambridge Intelligence

OCTOBER 19, 2023

The answer probably depends more on the complexity of your queries than the connectedness of your data. Relational databases (with recursive SQL queries), document stores, key-value stores, etc., Multi-model databases combine graphs with two other NoSQL data models – document and key-value stores.

Database

Database Azure Analytics Analytics

Hierarchies in Dimensional Modelling

Pickl AI

AUGUST 9, 2024

Summary: This blog delves into hierarchies in dimensional modelling, highlighting their significance in data organisation and analysis. Real-world examples illustrate their application, while tools and technologies facilitate effective hierarchical data management in various industries.

Data Warehouse

Data Warehouse Data Quality ETL Business Intelligence

Generative AI in Software Development

Mlearning.ai

JUNE 16, 2023

Blog - Everest Group Requirements gathering: ChatGPT can significantly simplify the requirements gathering phase by building quick prototypes of complex applications. GPT-4 Data Pipelines: Transform JSON to SQL Schema Instantly Blockstream’s public Bitcoin API. The data would be interesting to analyze.

AI

AI AI Data Analysis Data Analysis

Data Modeling in Machine Learning Pipelines: Best Practices Using SQL and NoSQL Databases

Why You Need RAG to Stay Relevant as a Data Scientist

Trending Sources

SQL vs. NoSQL: Decoding the database dilemma to perfect solutions

CI/CD for Data Pipelines: A Game-Changer with AnalyticsCreator

Traditional vs Vector databases: Your guide to make the right choice

Dynamic text-to-SQL for enterprise workloads with Amazon Bedrock Agents

Object-centric Process Mining on Data Mesh Architectures

Why using Infrastructure as Code for developing Cloud-based Data Warehouse Systems?

Becoming a Data Engineer: 7 Tips to Take Your Career to the Next Level

Tales of Data Modelers

2021: Three Game-Changing Data Modeling Perspectives

Best practices for prompt engineering with Meta Llama 3 for Text-to-SQL use cases

Transform your data into insights: The data analyst’s guide to Power BI

Tabular Data Exploration and Modelling with LLMs

10 Data Modeling Tools You Should Know

How to Refresh a Single Table in a Power BI Semantic Model

How to Use Custom SQL and CSVs in Sigma Computing

What Are the Best Data Modeling Methodologies & Processes for My Data Lake?

Data Scientist Job Description – What Companies Look For in 2025

GraphRAG Is the Logical Step From RAG — So Why the Sudden Hype?

Best Data Engineering Tools Every Engineer Should Know

How Rocket Companies modernized their data science solution on AWS

Citus 12: Schema-based sharding for PostgreSQL

Optimizing Snowflake’s Performance for Data Vault Modeling

5 Ways to Optimize Your Sigma Computing Calculations

It’s All About Relations!

Who is a BI Developer: Role, Responsibilities & Skills

How to Optimize Power BI and Snowflake for Advanced Analytics

A Comprehensive Guide to Business Intelligence Analysts

Maximize the Power of dbt and Snowflake to Achieve Efficient and Scalable Data Vault Solutions

Best 8 Data Version Control Tools for Machine Learning 2024

Analyzing the history of Tableau innovation

Learn the Difference Between MySQL and PostgreSQL

Building an efficient MLOps platform with OSS tools on Amazon ECS with AWS Fargate

Data science vs data analytics: Unpacking the differences

How to Build a Power BI Datamart Using Snowflake Data

Azure Data Engineer Jobs

The Data Dilemma: Exploring the Key Differences Between Data Science and Data Engineering

dbt and Sigma Integration

Tables vs. Pivot Tables in Sigma Computing

Understanding the Benefits of Data Vault Architecture in Snowflake

How to choose a graph database: we compare 6 favorites

Hierarchies in Dimensional Modelling

Generative AI in Software Development

Stay Connected