Patel

#9 opened 5 months ago by

CouchCommander

New activity in google/gemma-7b-it 8 days ago

Could not find GemmaForCausalLM neither in <module 'transformers.models.gemma'

#44 opened 7 months ago by

chenwei1984

New activity in google/gemma-scope-9b-pt-res 15 days ago

Layer 13 saes raising "zipfile.BadZipFile: File is not a zip file"

#5 opened 18 days ago by

MrGonao

New activity in google/gemma-2-2b-it 19 days ago

running it on cpu using pretrained

#35 opened 26 days ago by

himanshuyadav62

New activity in google/paligemma-3b-pt-224 20 days ago

Using `low_cpu_mem_usage=True` or a `device_map` requires Accelerate: `pip install accelerate`

#19 opened 3 months ago by

mdeniz1

New activity in google/gemma-2-9b-it 20 days ago

Why does this model have powerful text generation capabilities for various countries, and the results are very good, most likely in English?

#24 opened 3 months ago by

windkkk

New activity in google/gemma-2b-it 27 days ago

Issue with loading 4-bit quantized model on Apple M1 pro

#45 opened 4 months ago by

waxsum8

New activity in google/gemma-2-2b about 1 month ago

Weird output based on example code

#18 opened about 2 months ago by

mark100

New activity in google/gemma-2-9b-it about 1 month ago

not work

#12 opened 3 months ago by

sdyy

New activity in google/gemma-2b-it about 1 month ago

Request: access to gated repo

#51 opened about 1 month ago by

iamamofa

New activity in google/gemma-2-2b-it about 1 month ago

RuntimeError: probability tensor contains either `inf`, `nan` or element < 0

#18 opened about 2 months ago by

lcahill

Running Gemma-2b with Torch 2.0.1?

#28 opened about 2 months ago by

insdaguirre

New activity in google/gemma-7b about 1 month ago

Getting EnvironmentError

#107 opened about 2 months ago by

Ninad0109

New activity in google/gemma-2-2b about 1 month ago

Problem with Lora finetuning, Out of memory

#13 opened about 2 months ago by

zokica

New activity in google/gemma-2-27b-it about 1 month ago

Can multiple NVIDIA T4 GPUs be used to deploy Gemma2-27B-IT?

#36 opened about 1 month ago by

armanZhou

New activity in google/gemma-2-9b about 1 month ago

TypeError: arange() received an invalid combination of arguments

4

#12 opened 3 months ago by

darrenbudiman

New activity in google/gemma-2-9b-it about 1 month ago

system role

#15 opened 3 months ago by

wuriyanto

New activity in google/gemma-2-9b about 1 month ago

Model repeating information and "spitting out" random characters

8

#14 opened 3 months ago by

brazilianslib

New activity in google/gemma-7b-it about 1 month ago

add_special_tokens=False results in poor generation

#80 opened 6 months ago by

DMaksimov

New activity in google/gemma-2-9b-it about 1 month ago

nonsense response when bsz>1

#16 opened 3 months ago by

jinjieni

New activity in google/gemma-2b about 1 month ago

gemma 2b inference Endpoints error

4

#46 opened 6 months ago by

gawon16

New activity in google/gemma-7b-it about 1 month ago

Can several different prompts be handled together?

#77 opened 6 months ago by

WENJINLIU

New activity in google/gemma-2-9b-it about 2 months ago

Gemma-2 is a huge step up over previous Google OS models - short feedback

#22 opened 3 months ago by

Dampfinchen

New activity in google/gemma-2-27b-it about 2 months ago

What code was this trained on?

#18 opened 3 months ago by

grothetr

New activity in google/gemma-2-9b-it about 2 months ago

error of ATen\native\cuda\IndexKernel.cu

#14 opened 3 months ago by

koromatsu

New activity in google/gemma-2-2b-it about 2 months ago

Please mention context size for gemma2 in the model card

#19 opened about 2 months ago by

bionicles

New activity in google/gemma-2-27b-it about 2 months ago

Model repeating information and "spitting out" random characters

#12 opened 3 months ago by

brazilianslib

New activity in google/gemma-1.1-7b-it about 2 months ago

Is 1.1 trained from the same SFT model as 1.0?

#18 opened 5 months ago by

chujiezheng

New activity in google/gemma-2b-it about 2 months ago

Generating multiple responses from the same prompt

#50 opened about 2 months ago by

OfriH

New activity in google/gemma-2b about 2 months ago

Strange and limited response

#15 opened 7 months ago by

Squeack

New activity in google/gemma-7b-it about 2 months ago

Bug about number generation?

#30 opened 7 months ago by

myownskyW7

New activity in google/gemma-2-27b-it 2 months ago

gemma-2-27b-it Model Access

#30 opened 2 months ago by

RAGUWING

New activity in google/gemma-2-9b-it 2 months ago

Fails to generate with `inputs_embeds`

#18 opened 3 months ago by

JaronTHU

New activity in google/gemma-2-9b 2 months ago

Gemma2FlashAttention2 missing sliding_window variable

#8 opened 3 months ago by

emozilla

Inference error

8

#20 opened 3 months ago by

gsasikiran

Error

#25 opened 2 months ago by

ImpactInsights

AttributeError: module 'torch._dynamo' has no attribute 'mark_static_address'

#29 opened 2 months ago by

AsirAsir

New activity in google/gemma-2-27b-it 2 months ago

A100 can process only 4k tokens

#27 opened 3 months ago by

KubilayCan

New activity in google/gemma-2-9b 3 months ago

base vs instruct model

#17 opened 3 months ago by

saireddy

New activity in huggingface/HuggingDiscussions 3 months ago

[FEEDBACK] Notifications

129

#6 opened about 2 years ago by

victor

New activity in google/gemma-2-9b 3 months ago

ValueError: Transformers does not recognize this architecture.

#15 opened 3 months ago by

mike202303

New activity in google/paligemma-3b-pt-224 3 months ago

Support for Flash Attention?

#15 opened 3 months ago by

arnaudstiegler

New activity in google/gemma-2b 3 months ago

403 Forbidden: Authorization error

#62 opened 4 months ago by

parkerbotta

New activity in google/gemma-7b 3 months ago

Getting permission issue while trying to access the fine tuned model which is present at rachiteagles/cover_letter

#97 opened 5 months ago by

rachiteagles

New activity in google/gemma-1.1-7b-it 3 months ago

loss padding_side

#12 opened 6 months ago by

NickyNicky

New activity in google/gemma-2b 3 months ago

Gemma tokenizer issue

#37 opened 7 months ago by

Akshayextreme

New activity in google/gemma-7b-it 3 months ago

ValueError with multi A100 GPUS

#28 opened 7 months ago by

saireddy

New activity in google/gemma-7b 3 months ago

8-bit precision error

17

#32 opened 7 months ago by

saireddy

New activity in google/gemma-7b 4 months ago

When to release the 'function call' version

#65 opened 7 months ago by

qijizhuahuli

New activity in google/codegemma-7b 4 months ago

context window size?

#10 opened 6 months ago by

ichigoberry

New activity in google/gemma-2b-it 4 months ago

What do they mean by maj@1 ?

#44 opened 4 months ago by

joserass

New activity in google/gemma-2b 4 months ago

Memory requirements to load the model