Batch Inference

by dfrank - opened Aug 18

Aug 18

Hi I've been having some problems doing batch inference. The only thing that seems to work is using tokenizer.padding_side = "right" but the results I get are inconsistent with respect to a single inference (by a lot). Any advice?

lkv

Google org Aug 21

Hi @dfrank , This is because padding the inputs to the right in batch inference can result in different token sequences than padding to the left during single inference. Make sure your model is set to evaluation mode (e.g., model.eval() in PyTorch). This will disable any dropout layers, which could introduce variability in your outputs. if you have any concerns let us know. Thank you

lkv

Google org Aug 22

@dfrank , I hope you got the clarification. please confirm if you have any concerns. Thank you.

dfrank

about 1 month ago

Thanks @lvk , I had indeed forgotten to set the model to evaluation. However, even though I have already done it, I still can't get consistent result between batch and single inference. I also find it strange to have to set the padding size to the right. When I set the padding to the left (which conceptually I think makes the most sense) I only get NaN in my logits.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment