florentgbelidji
/

vit-gpt2-image-captioning

Inference Endpoints

Model card Files Files and versions Community

vit-gpt2-image-captioning / handler.py

florentgbelidji's picture

florentgbelidji HF staff

Added handler.py

35bf230 over 1 year ago

history blame contribute delete

799 Bytes

	import torch
	from typing import Dict, List, Any
	from transformers import pipeline


	# check for GPU
	device = torch.device("cuda" if torch.cuda.is_available() else "cpu")


	class EndpointHandler():
	def __init__(self, path=""):
	# Preload all the elements you are going to need at inference.
	# pseudo:
	self.pipeline= pipeline("image-to-text", model="nlpconnect/vit-gpt2-image-captioning", device=device)

	def __call__(self, data: Dict[str, Any]) -> List[Dict[str, Any]]:
	"""
	data args:
	inputs (:obj: `str` \| `PIL.Image` \| `np.array`)
	kwargs
	Return:
	A :obj:`list` \| `dict`: will be serialized and returned
	"""

	inputs = data.pop("inputs", data)
	return self.pipeline(inputs)