krishnapraveen (krishna praveen)

upvoted a collection 17 days ago

CogVideo

Collection

7 items • Updated 1 day ago • 18

upvoted 3 papers about 1 month ago

upvoted a collection about 2 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Meta Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Aug 2 • 570

upvoted a paper about 2 months ago

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Paper • 2407.15762 • Published Jul 22 • 8

upvoted an article 2 months ago

Article

Docmatix - a huge dataset for Document Visual Question Answering

Jul 18

• 63

upvoted 3 collections 2 months ago

xLAM models

Collection

xLAM: A Family of Large Action Models to Empower AI Agent Systems • 9 items • Updated 11 days ago • 40

LLaVa-Interleave

Collection

LLaVa models that extends the model capabilities to Multi-image, Multi-frame (videos), Multi-patch (single-image) scenarios. • 3 items • Updated Jul 10 • 14

Navarasa 2.0 Models

Collection

Collection of models Navarasa 2.0 Models finetuned with Gemma on 15 Indian languages • 5 items • Updated Mar 18 • 12

upvoted a paper 2 months ago

Animate3D: Animating Any 3D Model with Multi-view Video Diffusion

Paper • 2407.11398 • Published Jul 16 • 8

upvoted an article 2 months ago

Article

Faster fine-tuning using TRL & Unsloth

Jan 10

• 35

upvoted a collection 2 months ago

Optimizing diffusion models

Collection

Provides a list of papers focusing on optimizing T2I diffusion models, targeting fewer timesteps, architecture optimization, and more. • 21 items • Updated 29 days ago • 16

upvoted 3 collections 3 months ago

Model Merging

Collection

Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12 • 211

Meta Llama 3

Collection

This collection hosts the transformers and original repos of the Meta Llama 3 and Llama Guard 2 releases • 5 items • Updated Aug 2 • 673

Florence

Collection

9 items • Updated Jul 11 • 153

upvoted 8 papers 7 months ago

Design2Code: How Far Are We From Automating Front-End Engineering?

Paper • 2403.03163 • Published Mar 5 • 93

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

Paper • 2402.17764 • Published Feb 27 • 590

WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models

Paper • 2401.13919 • Published Jan 25 • 23

Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent Diffusion Models for Virtual Try-All

Paper • 2401.13795 • Published Jan 24 • 64

SliceGPT: Compress Large Language Models by Deleting Rows and Columns

Paper • 2401.15024 • Published Jan 26 • 67

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

Paper • 2401.15947 • Published Jan 29 • 48

OLMo: Accelerating the Science of Language Models

Paper • 2402.00838 • Published Feb 1 • 78

WebLINX: Real-World Website Navigation with Multi-Turn Dialogue

Paper • 2402.05930 • Published Feb 8 • 39

upvoted a collection 8 months ago

LLaVA-1.6

Collection

A collection of LLaVA-1.6 checkpoints • 4 items • Updated Jan 31 • 64

krishna praveen

AI & ML interests

Organizations

krishnapraveen's activity

CogVideo

MeshFormer: High-Quality Mesh Generation with 3D-Guided Reconstruction Model

GPUDrive: Data-driven, multi-agent driving simulation at 1 million FPS

MeshAnything V2: Artist-Created Mesh Generation With Adjacent Mesh Tokenization

Llama 3.1

Conditioned Language Policy: A General Framework for Steerable Multi-Objective Finetuning

Docmatix - a huge dataset for Document Visual Question Answering

xLAM models

LLaVa-Interleave

Navarasa 2.0 Models

Animate3D: Animating Any 3D Model with Multi-view Video Diffusion

Faster fine-tuning using TRL & Unsloth

Optimizing diffusion models

Model Merging

Meta Llama 3

Florence

Design2Code: How Far Are We From Automating Front-End Engineering?

The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits

WebVoyager: Building an End-to-End Web Agent with Large Multimodal Models

Diffuse to Choose: Enriching Image Conditioned Inpainting in Latent Diffusion Models for Virtual Try-All

SliceGPT: Compress Large Language Models by Deleting Rows and Columns

MoE-LLaVA: Mixture of Experts for Large Vision-Language Models

OLMo: Accelerating the Science of Language Models

WebLINX: Real-World Website Navigation with Multi-Turn Dialogue

LLaVA-1.6