Clem 🤗 PRO

clem

AI & ML interests

multi-modal, time-series, biology and chemistry

Organizations

clem's activity

posted an update 12 days ago
replied to their post 20 days ago
replied to Taylor658's post 20 days ago
replied to their post 20 days ago
posted an update 21 days ago
view post
Post
1530
Very cool to see more and more amazing startups like https://huggingface.co/PrunaAI relying on Hugging Face to get more visibility, distribution and usage!
·
posted an update 24 days ago
view post
Post
4109
Just crossed 200,000 free public AI datasets shared by the community on Hugging Face! Text, image, video, audio, time-series & many more... Thanks everyone!

http://hf.co/datasets
posted an update 27 days ago
posted an update 28 days ago
view post
Post
3579
This isn’t a goal of ours because we have plenty of money in the bank but quite excited to see that @huggingfaceis profitable these days, with 220 team members and most of our platform being free (like model hosting) and open-source for the community!

Especially noteworthy at a time when most AI startups wouldn’t survive a year or two without VC money. Yay!
·
replied to ybelkada's post 28 days ago
posted an update 28 days ago
replied to samjulien's post about 2 months ago
replied to 1aurent's post about 2 months ago
replied to fdaudens's post about 2 months ago
posted an update 2 months ago
view post
Post
2491
This is the week of small AI language models!
·
posted an update 3 months ago
view post
Post
5758
5,000 new repos (models, datasets, spaces) are created EVERY DAY on HF now. The community is amazing!
replied to louisbrulenaudet's post 3 months ago
view reply

very cool! feel free to create the HF for Legal org and share about it and we can amplify!

posted an update 3 months ago
replied to thomwolf's post 4 months ago
replied to lunarflu's post 4 months ago
posted an update 4 months ago
view post
Post
1541
I would pick @ylecun over @elonmuskceo every single day of the week.

Despite getting much less $$, recognition & visibility than entrepreneurs, the scientists who publish their groundbreaking research openly are the cornerstone of technological progress & massively contribute to making the world a better place!
  • 1 reply
·
replied to singhsidhukuldeep's post 4 months ago
replied to their post 4 months ago
view reply

any info on when it's going to be released though?

replied to their post 4 months ago
posted an update 4 months ago
view post
Post
1564
What are you excited about from Google I/O?
·
replied to Undi95's post 4 months ago
replied to HeshamHaroon's post 4 months ago
replied to singhsidhukuldeep's post 4 months ago
view reply

Interesting update! They can open-source GPT4 now haha

replied to danielhanchen's post 5 months ago
replied to fdaudens's post 5 months ago
replied to gsarti's post 5 months ago
posted an update 5 months ago
posted an update 5 months ago
view post
Post
2903
Already almost 1,000 llama3 model variations have been shared publicly on HF (many more in private use at companies): https://huggingface.co/models?p=5&sort=trending&search=llama3.

Everyone should fine-tune their own models for their use-cases, languages, industry, infra constraints,...

10,000 llama3 variants by the end of next week?
·
replied to visheratin's post 5 months ago
posted an update 5 months ago
view post
Post
2674
We noticed that all the open-source models and datasets from https://huggingface.co/WizardLM in their personal Hugging Face account & in the Microsoft Hugging Face organization (https://huggingface.co/microsoft) have been made private by the author, which will lead some demos to fail (these models were collectively downloaded over a hundred thousand times a month).

This is the explanation that @WizardLM communicated a few hours ago: https://huggingface.co/posts/WizardLM/329547800484476#661e0d17bca1a6038b60503e

We apologize for the inconvenience & are trying to get in touch with the author & Microsoft in order to try to find a good resolution for community members. Let us know if you have any questions!
  • 1 reply
·
posted an update 5 months ago
posted an update 6 months ago
view post
Post
2525
Introducing gretelai/synthetic_text_to_sql by https://huggingface.co/gretelai

It stands as the largest and most diverse synthetic Text-to-SQL dataset available to-date.

The dataset includes:

- 105,851 records partitioned into 100,000 train and 5,851 test records
~23M total tokens, including ~12M SQL tokens
- Coverage across 100 distinct domains/verticals
- Comprehensive array of SQL tasks: data definition, retrieval, manipulation, analytics & reporting
- Wide range of SQL complexity levels, including subqueries, single joins, multiple joins, aggregations, window functions, set operations
- Database context, including table and view create statements
- Natural language explanations of what the SQL query is doing
- Contextual tags to optimize model training

Blogpost: https://gretel.ai/blog/synthetic-text-to-sql-dataset
Dataset: gretelai/synthetic_text_to_sql
  • 1 reply
·
replied to Smooke's post 6 months ago
replied to julien-c's post 6 months ago
posted an update 7 months ago
view post
Post
Terribly excited about open-source + on-device AI these days! Great to see @qualcomm release 80+ models optimized and curated for their devices and chips on HF: https://huggingface.co/qualcomm

  • 1 reply
·
replied to dvilasuero's post 7 months ago
view reply

Unpopular opinion: this is the most impactful release of the day (because open)!

replied to DmitryRyumin's post 7 months ago
view reply

would be cool to have some integration with the HF hub

replied to trisfromgoogle's post 7 months ago
replied to stas's post 7 months ago
replied to victor's post 7 months ago
replied to manu's post 8 months ago
replied to dvilasuero's post 8 months ago
replied to clefourrier's post 8 months ago
replied to julien-c's post 8 months ago
posted an update 8 months ago
posted an update 8 months ago
view post
Post
With the Google announcement last week, I think we're now officially the only AI startup out there who has commercial collaborations with all the major cloud providers (AWS, GCP, Azure) and hardware providers (Nvidia, AMD, Intel, Qualcomm,...), making our vision of being the independent and agnostic platform for all AI builders truer than ever!

Let's go!
posted an update 8 months ago
replied to jimfan's post 8 months ago
posted an update 8 months ago
replied to abidlabs's post 8 months ago
replied to Norod78's post 8 months ago
posted an update 8 months ago
view post
Post
Most upvoted papers of 2023 on HF. What do you think are going to be the most prominent research topics in AI for 2024 (also, don't forget to add your papers to the hub this year!).

From: hysts/daily-papers
  • 1 reply
·
replied to their post 8 months ago