The Great Data Closure: Why Databricks and Snowflake Are Hitting Their Ceiling
Acquisitions, venture, and an increasingly competitive landscape all point to a market ceiling
The post The Great Data Closure: Why Databricks and Snowflake Are Hitting Their Ceiling appeared first on Towards Data Science.
Let's make sense of the current state of retrieval-augmented generation
The post TDS Newsletter: Is It Time to Revisit RAG? appeared first on Towards Data Science.
When Shapley Values Break: A Guide to Robust Model Explainability
Shapley Values are one of the most common methods for explainability, yet they can be misleading. Discover how to overcome these limitations to achieve better insights.
The post When Shapley Values Break: A Guide to Robust Model Explainability appeared first on Towards Data Science.
Do You Smell That? Hidden Technical Debt in AI Development
Why speed without standards creates fragile AI products
The post Do You Smell That? Hidden Technical Debt in AI Development appeared first on Towards Data Science.
Why Human-Centered Data Analytics Matters More Than Ever
From optimizing metrics to designing meaning: putting people back into data-driven decisions
The post Why Human-Centered Data Analytics Matters More Than Ever appeared first on Towards Data Science.
How structured knowledge became healthcare’s quiet advantage
The post What Is a Knowledge Graph — and Why It Matters appeared first on Towards Data Science.
A history of Transformer artifacts and the latest research on how to fix them
The post Glitches in the Attention Matrix appeared first on Towards Data Science.
Why Your ML Model Works in Training But Fails in Production
Hard lessons from building production ML systems where data leaks, defaults lie, populations shift, and time does not behave the way we expect.
The post Why Your ML Model Works in Training But Fails in Production appeared first on Towards Data Science.
How to Leverage Slash Commands to Code Effectively
Learn how I utilize slash commands to be a more efficient engineer
The post How to Leverage Slash Commands to Code Effectively appeared first on Towards Data Science.
Data Science Spotlight: Selected Problems from Advent of Code 2025
Hands-on walkthroughs of problems and solution approaches that power real‑world data science use cases
The post Data Science Spotlight: Selected Problems from Advent of Code 2025 appeared first on Towards Data Science.
Mastering Non-Linear Data: A Guide to Scikit-Learn’s SplineTransformer
Forget stiff lines and wild polynomials. Discover why Splines are the "Goldilocks" of feature engineering, offering the perfect balance of flexibility and discipline for non-linear data using Scikit-Learn’s SplineTransformer.
The post Mastering Non-Linear Data: A Guide to Scikit-Learn’s SplineTransf...
TDS Newsletter: December Must-Reads on GraphRAG, Data Contracts, and More
Don't miss our most popular articles of the previous month
The post TDS Newsletter: December Must-Reads on GraphRAG, Data Contracts, and More appeared first on Towards Data Science.
How to Improve the Performance of Visual Anomaly Detection Models
Apply the best methods from academia to get the most out of practical applications
The post How to Improve the Performance of Visual Anomaly Detection Models appeared first on Towards Data Science.
HNSW at Scale: Why Your RAG System Gets Worse as the Vector Database Grows
How approximate vector search silently degrades Recall—and what to do about It
The post HNSW at Scale: Why Your RAG System Gets Worse as the Vector Database Grows appeared first on Towards Data Science.
Why Supply Chain is the Best Domain for Data Scientists in 2026 (And How to Learn It)
My take after 10 years in Supply Chain on why this can be an excellent playground for data scientists who want to see their skills valued.
The post Why Supply Chain is the Best Domain for Data Scientists in 2026 (And How to Learn It) appeared first on Towards Data Science.
Part 2: Avoiding burnout, learning strategies and the superpower of solitude
The post The Best Data Scientists Are Always Learning appeared first on Towards Data Science.
GliNER2: Extracting Structured Information from Text
From unstructured text to structured Knowledge Graphs
The post GliNER2: Extracting Structured Information from Text appeared first on Towards Data Science.