Make transformers inference code CPU compatible

by MoritzLaurer HF staff - opened Sep 15

Sep 15

Very cool small models!
The custom code in modeling_internvl_chat.py currently has a GPU requirement hardcoded with model_inputs['input_ids'].cuda() etc.
To make the code also run on CPUs (especially for the nice small models), using e.g. model_inputs['input_ids'].to(model.device) would be better

czczup

OpenGVLab org Sep 18

Thanks for pointing that out! We'll replace .cuda() with .to(model.device) to support both GPUs and CPUs, making the code more flexible.

czczup changed discussion status to closed Sep 18

czczup

OpenGVLab org Sep 24

Now this code is CPU compatible.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment