Sylvain Lesage PRO

severo

AI & ML interests

Dataviz freelance developer. Part-time 🤗 Hugging Face (dataset viewer).

Articles

Organizations

severo's activity

upvoted an article 29 days ago
upvoted 2 articles about 1 month ago
view article
Article

XetHub is joining Hugging Face!

76
view article
Article

ArabicWeb24: Creating a High Quality Arabic Web-only Pre-training Dataset

By MayFarhat
9
upvoted an article about 2 months ago
upvoted 4 articles 2 months ago
view article
Article

How we leveraged distilabel to create an Argilla 2.0 Chatbot

30
view article
Article

Enhancing Search Capabilities for Non-English Datasets in the Dataset Viewer

By asoria
4
view article
Article

Experimenting with Automatic PII Detection on the Hub using Presidio

23
view article
Article

Announcing New Dataset Search Features

22
upvoted 2 articles 3 months ago
view article
Article

How to directly access 150k+ Hugging Face Datasets with DuckDB and query using GPT-4o

By chilijung
10
upvoted 3 articles 4 months ago
view article
Article

FiftyOne Computer Vision Datasets Come to the Hugging Face Hub

By jamarks
12
view article
Article

Wikipedia's Treasure Trove: Advancing Machine Learning with Diverse Data

By frimelle
12
upvoted 2 articles 4 months ago
view article
Article

Synthetic data: save money, time and carbon with open source

45
upvoted 5 articles 5 months ago
view article
Article

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

28
view article
Article

Post-OCR-Correction: 1 billion words dataset of automated OCR correction by LLM

13
view article
Article

Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data

21
view article
Article

Cosmopedia: how to create large-scale synthetic data for pre-training Large Language Models

58
view article
Article

Text2SQL using Hugging Face Dataset Viewer API and Motherduck DuckDB-NSQL-7B

23