Posts

Showing posts from April 25, 2024

[Day 115] Exploring HuggingFace's capabilities and submitting 3rd homework from the ML with Graphs course

Image
 Hello :) Today is Day 115! A quick summary of today: submitted 3rd homework of XCS224W:ML with Graphs explored different huggingface capabilities with DeepLearning.AI As for the homework , we are not allowed to share anything from it. But I can happily share I got full marks ^^ As for the huggingface tutorial It showcased the different type of models that were available. Below is a summary.  Building a chat pipeline Text translation Text summarization Zero-shot audio classifier Apparently, the model seens the audio differently -  1 second of high resolution audio appears to the model as if it is 12 seconds of audio. Text to speech Object detection (code before the pic: od_pipe = pipeline("object-detection", "./models/facebook/detr-resnet-50")) We can use gradio as a sample interface I passed a picture of mine to check haha We can also get natural language descriptions Image captioning Example image Using the dog and woman pic again for multimodal QA Zero-shot image