view article Article All LLMs Write Great Code, But Some Make (A Lot) Fewer Mistakes By onekq β’ 7 days ago β’ 3
π Awesome 3D AIGC Demos Collection Representative 3D AIGC Demos. #Image-to-3D #Text-to-3D β’ 22 items β’ Updated 30 days ago β’ 4
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery Paper β’ 2408.06292 β’ Published Aug 12 β’ 114
Building and better understanding vision-language models: insights and future directions Paper β’ 2408.12637 β’ Published 28 days ago β’ 109
Build-A-Scene: Interactive 3D Layout Control for Diffusion-Based Image Generation Paper β’ 2408.14819 β’ Published 24 days ago β’ 18
MUMU: Bootstrapping Multimodal Image Generation from Text-to-Image Data Paper β’ 2406.18790 β’ Published Jun 26 β’ 33
Text-to-Image History Collection How Text-to-Image evolved on HF and inspired the Community β’ 50 items β’ Updated Aug 1 β’ 11
view article Article Sentiment Classification with Fully Homomorphic Encryption using Concrete ML Nov 17, 2022 β’ 2
view article Article CyberSecEval 2 - A Comprehensive Evaluation Framework for Cybersecurity Risks and Capabilities of Large Language Models May 24 β’ 21
view article Article Powerful ASR + diarization + speculative decoding with Hugging Face Inference Endpoints May 1 β’ 61
MaPa: Text-driven Photorealistic Material Painting for 3D Shapes Paper β’ 2404.17569 β’ Published Apr 26 β’ 12
Edit Your Image! Collection Find all the trending and useful Gradio demos that you can use to edit your images. β’ 21 items β’ Updated Apr 26 β’ 23
β UI is a good thing π β Collection cool spaces with a cool UI, what could be better? β’ 5 items β’ Updated Jun 18 β’ 13
view article Article SVGDreamer: Text Guided Vector Graphics Generation with Diffusion Model By xingxm β’ Apr 19 β’ 5
HQ-Edit: A High-Quality Dataset for Instruction-based Image Editing Paper β’ 2404.09990 β’ Published Apr 15 β’ 12
view article Article Introducing Idefics2: A Powerful 8B Vision-Language Model for the community Apr 15 β’ 160
Idefics2 πΆ Collection Idefics2-8B is a foundation vision-language model. In this collection, you will find the models, datasets and demo related to its creation. β’ 11 items β’ Updated May 6 β’ 88
Multimodal Models Collection Multimodal models with leading performance. β’ 9 items β’ Updated 15 days ago β’ 11
HyperGraph Datasets Collection Collection of HyperGraph Datasets β’ 17 items β’ Updated Apr 4 β’ 7
RadSplat: Radiance Field-Informed Gaussian Splatting for Robust Real-Time Rendering with 900+ FPS Paper β’ 2403.13806 β’ Published Mar 20 β’ 18
Latent Consistency Model Demos Collection Latent Consistency Models for Stable Diffusion β’ 8 items β’ Updated Nov 12, 2023 β’ 25
VLMs for 3D reconstructions and their evaluation Collection List of papers to help with developing a model that reviews a photogrammetry scan and evaluates its quality β’ 11 items β’ Updated Dec 5, 2023 β’ 2
Biomedical NLP papers Collection Papers posted on @ArxivHealthcareNLP@sigmoid.social (Clinical, Healthcare & Biomedical NLP) β’ 150 items β’ Updated 2 days ago β’ 31
Leveraging Biomolecule and Natural Language through Multi-Modal Learning: A Survey Paper β’ 2403.01528 β’ Published Mar 3 β’ 1
TnT-LLM: Text Mining at Scale with Large Language Models Paper β’ 2403.12173 β’ Published Mar 18 β’ 19
LLM Leaderboard best models β€οΈβπ₯ Collection A daily uploaded list of models with best evaluations on the LLM leaderboard: β’ 264 items β’ Updated Jun 22 β’ 392
Pretrained Text-Generation Models Below 250M Parameters Collection Great candidates for fine-tuning targeting Transformers.js, ordered by number of parameters. β’ 8 items β’ Updated Aug 10 β’ 7
Soft Prompts Collection Ordered List of Resources to understand soft prompting while covering the basics of discrete prompting as well. β’ 4 items β’ Updated Mar 22 β’ 2
based Collection These language model checkpoints are trained at the 360M and 1.3Bn parameter scales for up to 50Bn tokens on the Pile corpus, for research purposes. β’ 14 items β’ Updated May 14 β’ 8
FiT: Flexible Vision Transformer for Diffusion Model Paper β’ 2402.12376 β’ Published Feb 19 β’ 48
ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs Paper β’ 2402.11753 β’ Published Feb 19 β’ 5
Zeroshot Classifiers Collection These are my current best zeroshot classifiers. Some of my older models are downloaded more often, but the models in this collection are newer/better. β’ 11 items β’ Updated Apr 3 β’ 103
OWL-series π¦ Collection Models and applications of OWL-ViT and OWLv2. β’ 13 items β’ Updated Mar 11 β’ 5
Music ControlNet: Multiple Time-varying Controls for Music Generation Paper β’ 2311.07069 β’ Published Nov 13, 2023 β’ 43
ChatAnything: Facetime Chat with LLM-Enhanced Personas Paper β’ 2311.06772 β’ Published Nov 12, 2023 β’ 34
ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image Paper β’ 2310.17994 β’ Published Oct 27, 2023 β’ 8
LP-MusicCaps: LLM-Based Pseudo Music Captioning Paper β’ 2307.16372 β’ Published Jul 31, 2023 β’ 37
MagicBrush: A Manually Annotated Dataset for Instruction-Guided Image Editing Paper β’ 2306.10012 β’ Published Jun 16, 2023 β’ 35