DEV Community

Data Science

Data Science allows us to extract meaning from and interpret data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
Deploying Label Studio Open-Source Data Labeling Platform on Ubuntu 24.04

Deploying Label Studio Open-Source Data Labeling Platform on Ubuntu 24.04

Comments
2 min read
Can AI Boost AI Itself? The Recursive Flywheel of Machine Learning

Can AI Boost AI Itself? The Recursive Flywheel of Machine Learning

Comments
3 min read
How a Neural Network Actually Learns — Training, in Plain Words

How a Neural Network Actually Learns — Training, in Plain Words

Comments
2 min read
I Built a Python Library for Synthetic Dataset Generation and Missing Value Simulation

I Built a Python Library for Synthetic Dataset Generation and Missing Value Simulation

Comments
2 min read
When Polymarket says 70%, does it happen 70% of the time? I checked against 19.4M price snapshots.

When Polymarket says 70%, does it happen 70% of the time? I checked against 19.4M price snapshots.

Comments
4 min read
My Journey Learning Excel for Data Analysis

My Journey Learning Excel for Data Analysis

Comments
2 min read
Creating HIPAA-Safe Synthetic Patient Data for Healthcare App Testing

Creating HIPAA-Safe Synthetic Patient Data for Healthcare App Testing

Comments
4 min read
Stop Shipping ML Models With Bare Floats: A Deep Dive Into Statistically Rigorous Model Evaluation

Stop Shipping ML Models With Bare Floats: A Deep Dive Into Statistically Rigorous Model Evaluation

Comments 1
4 min read
k-Nearest Neighbors From Scratch: the ML Algorithm With No Training Step

k-Nearest Neighbors From Scratch: the ML Algorithm With No Training Step

Comments
4 min read
Why we score company quality the way we do (and why REITs and banks get different rules)

Why we score company quality the way we do (and why REITs and banks get different rules)

Comments
4 min read
From Variant CSV to Review-Ready Report: A Python Workflow With Docker and GitHub Actions

From Variant CSV to Review-Ready Report: A Python Workflow With Docker and GitHub Actions

Comments
2 min read
Can AI Reason From Marker Genes? Building a Single-Cell Benchmark From PBMC3k

Can AI Reason From Marker Genes? Building a Single-Cell Benchmark From PBMC3k

Comments
3 min read
Power analysis for LLM evals: how big does your eval set need to be to catch a 5% regression?

Power analysis for LLM evals: how big does your eval set need to be to catch a 5% regression?

Comments 1
2 min read
Using Excel for Data Analysis – Week 2

Using Excel for Data Analysis – Week 2

Comments
2 min read
The Disk-Level Architecture of OLTP vs. OLAP

The Disk-Level Architecture of OLTP vs. OLAP

Comments
4 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.