Remove 2020 Remove Blog Remove Data Quality
article thumbnail

Scaling de-duplication in WorldCat: Balancing AI innovation with cataloging care | OCLC

Flipboard

At OCLC, we’ve invested resources into a hybrid approach, leveraging AI to process vast amounts of data while ensuring catalogers and OCLC experts remain at the center of decision-making. From paper slips to machine learning Long before I joined OCLC, I worked in bibliographic data quality when de-duplication was entirely manual.

AI 144
article thumbnail

Ask HN: Who wants to be hired? (July 2025)

Hacker News

Also, I have two 0days and received CVEs under my name and a company research blog post to go along with it. I'm also happy to work on other stuff, I had a recent blog post [2] do fairly well on HN a few months back, which would give you get a great idea of how I work. [1] Worked at IBM as a programmer too.

Python 54
professionals

Sign Up for our Newsletter

This site is protected by reCAPTCHA and the Google Privacy Policy and Terms of Service apply.

article thumbnail

Evaluating RAG Pipelines

The MLOps Blog

in 2020 , RAG has become the go-to technique for incorporating external knowledge into the LLM pipeline. The mitigation strategies for poor retrieval include the following: Ensuring data quality in the knowledge base The retrievers quality is constrained by the quality of the documents in the knowledge base.

article thumbnail

Architect a mature generative AI foundation on AWS

Flipboard

Data quality is ownership of the consuming applications or data producers. Governance The two key areas of governance are model and data: Model governance Monitor model for performance, robustness, and fairness. He was the legal licensee in his ancient (AD 1468) English countryside village pub until early 2020.

AWS 139
article thumbnail

Best of 2022: Top 5 Insurance Blog Posts

Precisely

Accurate, consistent, and contextualized data enables faster, more confident decisions when it comes to your underwriting, claims processing, risk assessments, and beyond. Let’s explore the impact of data in this industry as we count down the top 5 insurance blog posts of 2022. #5

article thumbnail

Snowcase Study: How Data Governance Gives Texas Mutual Insurance Company a Competitive Edge

Alation

Much of his work focuses on democratising data and breaking down data silos to drive better business outcomes. In this blog, Chris shows how Snowflake and Alation together accelerate data culture. He shows how Texas Mutual Insurance Company has embraced data governance to build trust in data.

article thumbnail

Why Effective Data Management Is Key in a Connected World

Dataversity

2020 saw a rapid acceleration in digital transformation, and this trend shows no sign of slowing down in 2021. The smart factory and plant now incorporate an array of connected technologies, all generating a vast volume of data. As a result, data will continue its exponential growth, […].