A few days ago, OpenAI added a preview version of GPT4-V to their API, allowing the introduction of images to the chat endpoint. After reading about the David Attenborogh-narration proof of concept, I decided to test out this functionality by mocking up a movie-summarizer that uses exclusively the video component of the movie.