Yacine Jernite

yjernite

AI & ML interests

Technical, community, and regulatory tools of AI governance @HuggingFace

Articles

Organizations

yjernite's activity

upvoted an article 13 days ago
view article
Article

Getty Images Brings High-Quality, Commercially Safe Dataset to Hugging Face

14
upvoted an article 16 days ago
view article
Article

The Environmental Impacts of AI -- Primer

By sasha
23
upvoted an article 24 days ago
view article
Article

The 5 Most Under-Rated Tools on Hugging Face

74
upvoted an article about 2 months ago
upvoted 4 articles 2 months ago
view article
Article

How NuminaMath Won the 1st AIMO Progress Prize

92
view article
Article

Docmatix - a huge dataset for Document Visual Question Answering

63
view article
Article

Structured Harm Reporting in AI: New Research Paper at AIES and DEFCON event!

By evijit
3
upvoted an article 2 months ago
upvoted an article 2 months ago
view article
Article

SmolLM - blazingly fast and remarkably powerful

242
upvoted 2 articles 2 months ago
view article
Article

Experimenting with Automatic PII Detection on the Hub using Presidio

23
view article
Article

Announcing New Dataset Search Features

22
upvoted 10 articles 3 months ago
view article
Article

EU Training Data Transparency: A Proposal for a Sufficiently Detailed Summary 📑📚🖼️🇪🇺

By yjernite
8
view article
Article

BM25 for Python: Achieving high performance while simplifying dependencies with *BM25S*⚡

By xhluca
34
view article
Article

AI Policy @🤗: Open ML Considerations in the EU AI Act

2
view article
Article

📚 Training Data Transparency in AI: Tools, Trends, and Policy Recommendations 🗳️

By yjernite
1
view article
Article

Ethics and Society Newsletter #6: Building Better AI: The Importance of Data Quality

30
view article
Article

Introducing Synthetic Data Workshop: Your Gateway to Easy Synthetic Dataset Creation

12
view article
Article

Data Is Better Together: A Look Back and Forward

17
view article
Article

Open-source embeddings and LLMs outperform Gemini and OpenAI for Web Navigation while being faster and cheaper

By dhuynh95
5
view article
Article

BigCodeBench: Benchmarking Large Language Models on Solving Practical and Challenging Programming Tasks

35
view article
Article

Unveiling CIVICS: A New Dataset for Examining Cultural Values in Language Models

By giadap
7
upvoted an article 3 months ago
upvoted 2 articles 3 months ago
view article
Article

Reports on the Hub: A First Look at Self-governance in Open Source AI Development

By frimelle
7
view article
Article

How to build an interactive HF Space to visualize an Image Dataset

3
upvoted an article 3 months ago
view article
Article

How to directly access 150k+ Hugging Face Datasets with DuckDB and query using GPT-4o

By chilijung
10
upvoted an article 4 months ago
view article
Article

Wikipedia's Treasure Trove: Advancing Machine Learning with Diverse Data

By frimelle
12
upvoted 3 articles 4 months ago
view article
Article

Space secrets security update

50
upvoted an article 4 months ago
upvoted 2 articles 5 months ago
view article
Article

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

28
view article
Article

Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data

21
upvoted 3 articles 5 months ago
view article
Article

Vision Language Models Explained

175
view article
Article

Policy Questions Blog 1: AI Data Transparency Remarks for NAIAC Panel 📚🔍⚖️

By yjernite
2
view article
Article

Public Policy at Hugging Face

19