Remove DataOps Remove Python Remove SQL
article thumbnail

Authoring custom transformations in Amazon SageMaker Data Wrangler using NLTK and SciPy

AWS Machine Learning Blog

For scenarios where you need to add your own custom scripts for data transformations, you can write your transformation logic in Pandas, PySpark, PySpark SQL. With the Data Wrangler custom transform capability, you can write your transformation logic in Pandas, PySpark, PySpark SQL. Choose Python (Pandas).

AWS 101
article thumbnail

How to use Snowflake Zero Copy Cloning in your CI/CD Pipelines

phData

There are many frameworks for testing software, but the right way to test the data and SQL scripts that change data are less obvious. This is a simple example of how SQL that compiles and runs perfectly might fail when trying to migrate it to a higher environment like production. Run the create clone SQL statement.

professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

phData Awarded dbt Labs’ 2023 Partner of the Year

phData

Data Vault Modeling in Snowflake Using dbt Vault Reducing the Time to Value of your dbt Deployment with SlimCI How to use dbt with Snowpark Python to Implement Sentiment Analysis How to Talk to Your Data with ChatGPT, Snowflake, & dbt Customer Success We genuinely love watching our clients succeed.

DataOps 52
article thumbnail

Ask HN: Who is hiring? (July 2025)

Hacker News

Good at Go, Kubernetes (Understanding how to manage stateful services in a multi-cloud environment) We have a Python service in our Recommendation pipeline, so some ML/Data Science knowledge would be good. Queries everywhere – SQL lives in Slack snippets, BI folders, dusty Git repos, and copy-pasted Notion pages.

Python 78