{"id":135043,"date":"2023-10-01T14:29:43","date_gmt":"2023-10-01T14:29:43","guid":{"rendered":"https:\/\/blog.finxter.com\/?p=1651894"},"modified":"2023-10-01T14:29:43","modified_gmt":"2023-10-01T14:29:43","slug":"gpt-4-with-vision-gpt-4v-is-out-32-fun-examples-with-screenshots","status":"publish","type":"post","link":"https:\/\/sickgaming.net\/blog\/2023\/10\/01\/gpt-4-with-vision-gpt-4v-is-out-32-fun-examples-with-screenshots\/","title":{"rendered":"GPT-4 with Vision (GPT-4V) Is Out! 32 Fun Examples with Screenshots"},"content":{"rendered":"\n<div class=\"kk-star-ratings kksr-auto kksr-align-left kksr-valign-top\" data-payload='{&quot;align&quot;:&quot;left&quot;,&quot;id&quot;:&quot;1651894&quot;,&quot;slug&quot;:&quot;default&quot;,&quot;valign&quot;:&quot;top&quot;,&quot;ignore&quot;:&quot;&quot;,&quot;reference&quot;:&quot;auto&quot;,&quot;class&quot;:&quot;&quot;,&quot;count&quot;:&quot;1&quot;,&quot;legendonly&quot;:&quot;&quot;,&quot;readonly&quot;:&quot;&quot;,&quot;score&quot;:&quot;5&quot;,&quot;starsonly&quot;:&quot;&quot;,&quot;best&quot;:&quot;5&quot;,&quot;gap&quot;:&quot;5&quot;,&quot;greet&quot;:&quot;Rate this post&quot;,&quot;legend&quot;:&quot;5\\\/5 - (1 vote)&quot;,&quot;size&quot;:&quot;24&quot;,&quot;title&quot;:&quot;GPT-4 with Vision (GPT-4V) Is Out! 32 Fun Examples with Screenshots&quot;,&quot;width&quot;:&quot;142.5&quot;,&quot;_legend&quot;:&quot;{score}\\\/{best} - ({count} {votes})&quot;,&quot;font_factor&quot;:&quot;1.25&quot;}'>\n<div class=\"kksr-stars\">\n<div class=\"kksr-stars-inactive\">\n<div class=\"kksr-star\" data-star=\"1\" style=\"padding-right: 5px\">\n<div class=\"kksr-icon\" style=\"width: 24px; height: 24px;\"><\/div>\n<\/p><\/div>\n<div class=\"kksr-star\" data-star=\"2\" style=\"padding-right: 5px\">\n<div class=\"kksr-icon\" style=\"width: 24px; height: 24px;\"><\/div>\n<\/p><\/div>\n<div class=\"kksr-star\" data-star=\"3\" style=\"padding-right: 5px\">\n<div class=\"kksr-icon\" style=\"width: 24px; height: 24px;\"><\/div>\n<\/p><\/div>\n<div class=\"kksr-star\" data-star=\"4\" style=\"padding-right: 5px\">\n<div class=\"kksr-icon\" style=\"width: 24px; height: 24px;\"><\/div>\n<\/p><\/div>\n<div class=\"kksr-star\" data-star=\"5\" style=\"padding-right: 5px\">\n<div class=\"kksr-icon\" style=\"width: 24px; height: 24px;\"><\/div>\n<\/p><\/div>\n<\/p><\/div>\n<div class=\"kksr-stars-active\" style=\"width: 142.5px;\">\n<div class=\"kksr-star\" style=\"padding-right: 5px\">\n<div class=\"kksr-icon\" style=\"width: 24px; height: 24px;\"><\/div>\n<\/p><\/div>\n<div class=\"kksr-star\" style=\"padding-right: 5px\">\n<div class=\"kksr-icon\" style=\"width: 24px; height: 24px;\"><\/div>\n<\/p><\/div>\n<div class=\"kksr-star\" style=\"padding-right: 5px\">\n<div class=\"kksr-icon\" style=\"width: 24px; height: 24px;\"><\/div>\n<\/p><\/div>\n<div class=\"kksr-star\" style=\"padding-right: 5px\">\n<div class=\"kksr-icon\" style=\"width: 24px; height: 24px;\"><\/div>\n<\/p><\/div>\n<div class=\"kksr-star\" style=\"padding-right: 5px\">\n<div class=\"kksr-icon\" style=\"width: 24px; height: 24px;\"><\/div>\n<\/p><\/div>\n<\/p><\/div>\n<\/div>\n<div class=\"kksr-legend\" style=\"font-size: 19.2px;\"> 5\/5 &#8211; (1 vote) <\/div>\n<\/p><\/div>\n<p class=\"has-global-color-8-background-color has-background\"><img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/14.0.0\/72x72\/1f4a1.png\" alt=\"\ud83d\udca1\" class=\"wp-smiley\" style=\"height: 1em; max-height: 1em;\" \/> <strong>TLDR<\/strong>: GPT-4 with vision (GPT-4V) is now out for many ChatGPT Plus users in the US and some other regions! You can instruct GPT-4 to analyze image inputs. GPT-4V incorporates additional modalities such as image inputs into <a href=\"https:\/\/blog.finxter.com\/the-evolution-of-large-language-models-llms-insights-from-gpt-4-and-beyond\/\">large language models (LLMs)<\/a>. Multimodal LLMs will expand the reach of AI from mainly language-based applications to a broad range of brand-new application categories that go beyond language user interfaces (UIs).<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img decoding=\"async\" fetchpriority=\"high\" width=\"472\" height=\"1024\" src=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/2023-09-26-17.25.07-1-472x1024.jpg\" alt=\"\" class=\"wp-image-1651897\" srcset=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/2023-09-26-17.25.07-1-472x1024.jpg 472w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/2023-09-26-17.25.07-1-138x300.jpg 138w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/2023-09-26-17.25.07-1.jpg 590w\" sizes=\"(max-width: 472px) 100vw, 472px\" \/><figcaption class=\"wp-element-caption\"><a href=\"https:\/\/blog.roboflow.com\/gpt-4-vision\/\">Source<\/a><\/figcaption><\/figure>\n<\/div>\n<p><img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/14.0.0\/72x72\/1f446.png\" alt=\"\ud83d\udc46\" class=\"wp-smiley\" style=\"height: 1em; max-height: 1em;\" \/> GPT-4V could explain why a picture was funny by talking about different parts of the image and their connections. The meme in the picture has words on it, which GPT-4V read to help make its answer. However, it made an error. It wrongly said the fried chicken in the image was called \u201cNVIDIA BURGER\u201d instead of \u201cGPU\u201d.<\/p>\n<p class=\"has-global-color-8-background-color has-background\">Still impressive! <img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/14.0.0\/72x72\/1f92f.png\" alt=\"\ud83e\udd2f\" class=\"wp-smiley\" style=\"height: 1em; max-height: 1em;\" \/> OpenAI&#8217;s GPT-4 with Vision (GPT-4V) represents a significant advancement in artificial intelligence, enabling the analysis of image inputs alongside text.<\/p>\n<p>Let&#8217;s dive into some additional examples I and others encountered:<\/p>\n<h2 class=\"wp-block-heading\">More Examples<\/h2>\n<p>Prompting GPT-4V with <code>\"How much money do I have?\"<\/code> and a photo of some foreign coins:<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"661\" height=\"1024\" src=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-661x1024.jpeg\" alt=\"\" class=\"wp-image-1651898\" srcset=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-661x1024.jpeg 661w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-194x300.jpeg 194w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-768x1190.jpeg 768w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image.jpeg 826w\" sizes=\"auto, (max-width: 661px) 100vw, 661px\" \/><figcaption class=\"wp-element-caption\"><a href=\"https:\/\/blog.roboflow.com\/gpt-4-vision\/\">source<\/a><\/figcaption><\/figure>\n<\/div>\n<p>GPT4V was even able to identify that these are Polish Zloty Coins, a task with which 99% of humans would struggle:<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"816\" src=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-1-1024x816.jpeg\" alt=\"\" class=\"wp-image-1651899\" srcset=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-1-1024x816.jpeg 1024w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-1-300x239.jpeg 300w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-1-768x612.jpeg 768w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-1.jpeg 1179w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\"><a href=\"https:\/\/blog.roboflow.com\/gpt-4-vision\/\">source<\/a><\/figcaption><\/figure>\n<\/div>\n<p>It can also identify locations from photos and give you information about plants you make photos of. In this way, it&#8217;s similar to Google Lens but much better and more interactive with a higher level of image understanding.<\/p>\n<p>It can do optical character recognition (OCR) almost flawlessly:<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img decoding=\"async\" loading=\"lazy\" width=\"472\" height=\"1024\" src=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/File-472x1024.jpg\" alt=\"\" class=\"wp-image-1651900\" srcset=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/File-472x1024.jpg 472w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/File-138x300.jpg 138w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/File-708x1536.jpg 708w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/File.jpg 738w\" sizes=\"auto, (max-width: 472px) 100vw, 472px\" \/><figcaption class=\"wp-element-caption\"><a href=\"https:\/\/blog.roboflow.com\/gpt-4-vision\/\">source<\/a><\/figcaption><\/figure>\n<\/div>\n<p>Now here&#8217;s why many teachers and professors will lose their sleep over GPT-4V: it can even solve math problems from photos (<a href=\"https:\/\/blog.roboflow.com\/gpt-4-vision\/\">source<\/a>):<\/p>\n<div class=\"wp-block-columns is-layout-flex wp-container-3 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\">\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" loading=\"lazy\" width=\"472\" height=\"1024\" src=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/2023-09-27-13.25.51-472x1024.jpg\" alt=\"\" class=\"wp-image-1651901\" srcset=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/2023-09-27-13.25.51-472x1024.jpg 472w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/2023-09-27-13.25.51-138x300.jpg 138w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/2023-09-27-13.25.51.jpg 590w\" sizes=\"auto, (max-width: 472px) 100vw, 472px\" \/><\/figure>\n<\/div>\n<div class=\"wp-block-column is-layout-flow wp-block-column-is-layout-flow\">\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img decoding=\"async\" loading=\"lazy\" width=\"472\" height=\"1024\" src=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/photo_2023-09-27-13.25.55-472x1024.jpeg\" alt=\"\" class=\"wp-image-1651902\" srcset=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/photo_2023-09-27-13.25.55-472x1024.jpeg 472w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/photo_2023-09-27-13.25.55-138x300.jpeg 138w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/photo_2023-09-27-13.25.55.jpeg 590w\" sizes=\"auto, (max-width: 472px) 100vw, 472px\" \/><\/figure>\n<\/div>\n<\/div>\n<\/div>\n<p>GPT-4V can do object detection, a crucial field in AI and ML: one model to rule them all!<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img decoding=\"async\" loading=\"lazy\" width=\"472\" height=\"1024\" src=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/photo_2023-09-26-18.51.24-472x1024.jpeg\" alt=\"\" class=\"wp-image-1651903\" srcset=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/photo_2023-09-26-18.51.24-472x1024.jpeg 472w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/photo_2023-09-26-18.51.24-138x300.jpeg 138w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/photo_2023-09-26-18.51.24.jpeg 590w\" sizes=\"auto, (max-width: 472px) 100vw, 472px\" \/><figcaption class=\"wp-element-caption\"><a href=\"https:\/\/blog.roboflow.com\/gpt-4-vision\/\">source<\/a><\/figcaption><\/figure>\n<\/div>\n<p>GPT-4V can even help you play poker <img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/14.0.0\/72x72\/2660.png\" alt=\"\u2660\" class=\"wp-smiley\" style=\"height: 1em; max-height: 1em;\" \/><img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/14.0.0\/72x72\/2665.png\" alt=\"\u2665\" class=\"wp-smiley\" style=\"height: 1em; max-height: 1em;\" \/> <\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img decoding=\"async\" loading=\"lazy\" width=\"1024\" height=\"922\" src=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/F7AMp_BWsAAwGeq-1024x922.jpg\" alt=\"\" class=\"wp-image-1651904\" srcset=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/F7AMp_BWsAAwGeq-1024x922.jpg 1024w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/F7AMp_BWsAAwGeq-300x270.jpg 300w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/F7AMp_BWsAAwGeq-768x692.jpg 768w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/F7AMp_BWsAAwGeq.jpg 1290w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><figcaption class=\"wp-element-caption\"><a href=\"https:\/\/twitter.com\/emollick\/status\/1706878412856402398\/photo\/1\">source<\/a><\/figcaption><\/figure>\n<\/div>\n<p>A Twitter\/X user gave it a screenshot of a day planner and asked it to code a digital UI of it. The Python code worked!<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" loading=\"lazy\" width=\"897\" height=\"600\" src=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-2.png\" alt=\"\" class=\"wp-image-1651906\" srcset=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-2.png 897w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-2-300x201.png 300w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-2-768x514.png 768w\" sizes=\"auto, (max-width: 897px) 100vw, 897px\" \/><figcaption class=\"wp-element-caption\"><a href=\"https:\/\/twitter.com\/search?q=GPT-4V&amp;src=typeahead_click\">source<\/a><\/figcaption><\/figure>\n<\/div>\n<p>Speaking of coding, here&#8217;s a fun example by another creative developer, Matt Shumer:<\/p>\n<p class=\"has-global-color-8-background-color has-background\"><code>\"The first GPT-4V-powered frontend engineer agent. Just upload a picture of a design, and the agent autonomously codes it up, looks at a render for mistakes, improves the code accordingly, repeat. Utterly insane.\"<\/code> (<a href=\"https:\/\/twitter.com\/mattshumer_\/status\/1707814443785060729\">source<\/a>)<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" loading=\"lazy\" width=\"832\" height=\"420\" src=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/gpt4v.gif\" alt=\"\" class=\"wp-image-1651907\"\/><\/figure>\n<\/div>\n<p>I&#8217;ve even seen GPT-4V analyzing financial data like Bitcoin indicators:<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><a href=\"https:\/\/twitter.com\/youraimarketer\/status\/1706739489618657467\" target=\"_blank\" rel=\"noreferrer noopener\"><img decoding=\"async\" loading=\"lazy\" width=\"877\" height=\"900\" src=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/F6-OQnUWUAA3Qyz.jpg\" alt=\"\" class=\"wp-image-1651908\" srcset=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/F6-OQnUWUAA3Qyz.jpg 877w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/F6-OQnUWUAA3Qyz-292x300.jpg 292w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/F6-OQnUWUAA3Qyz-768x788.jpg 768w\" sizes=\"auto, (max-width: 877px) 100vw, 877px\" \/><\/a><figcaption class=\"wp-element-caption\">source<\/figcaption><\/figure>\n<\/div>\n<p>I could go on forever. Here are 20 more ideas of how to use GPT-4V that I found extremely interesting, fun, and even visionary:<\/p>\n<ol>\n<li><strong>Visual Assistance for the Blind:<\/strong> GPT-4V can describe the surroundings or read out text from images to assist visually impaired individuals.<\/li>\n<li><strong>Educational Tutor:<\/strong> It can analyze diagrams and provide detailed explanations, helping students understand complex concepts.<\/li>\n<li><strong>Medical Imaging:<\/strong> Assist doctors by providing preliminary observations from medical images (though not for making diagnoses).<\/li>\n<li><strong>Recipe Suggestions:<\/strong> Users can show ingredients they have, and GPT-4V can suggest possible recipes.<\/li>\n<li><strong>Fashion Advice:<\/strong> Offer fashion tips by analyzing pictures of outfits.<\/li>\n<li><strong>Plant or Animal Identification:<\/strong> Identify and provide information about plants or animals in photos.<\/li>\n<li><strong>Travel Assistance:<\/strong> Analyze photos of landmarks to provide historical and cultural information.<\/li>\n<li><strong>Language Translation:<\/strong> Read and translate text in images from one language to another.<\/li>\n<li><strong>Home Decor Planning:<\/strong> Provide suggestions for home decor based on pictures of users&#8217; living spaces.<\/li>\n<li><strong>Art Creation:<\/strong> Offer guidance and suggestions for creating art by analyzing images of ongoing artwork.<\/li>\n<li><strong>Fitness Coaching:<\/strong> Analyze workout or yoga postures and offer corrections or enhancements.<\/li>\n<li><strong>Event Planning:<\/strong> Assist in planning events by visualizing and organizing space, decorations, and layouts.<\/li>\n<li><strong>Shopping Assistance:<\/strong> Help users in making purchasing decisions by analyzing product images and providing information.<\/li>\n<li><strong>Gardening Advice:<\/strong> Provide gardening tips based on pictures of plants and their surroundings.<\/li>\n<li><strong>DIY Project Guidance:<\/strong> Offer step-by-step guidance for DIY projects by analyzing images of the project at various stages.<\/li>\n<li><strong>Safety Training:<\/strong> Analyze images of workplace environments to offer safety recommendations.<\/li>\n<li><strong>Historical Analysis:<\/strong> Provide historical context and information for images of historical events or figures.<\/li>\n<li><strong>Real Estate Assistance:<\/strong> Analyze images of properties to provide insights and information for buyers or sellers.<\/li>\n<li><strong>Wildlife Research:<\/strong> Assist researchers by analyzing images of wildlife and their habitats.<\/li>\n<li><strong>Meme Creation:<\/strong> Help users create memes by suggesting text or edits based on the image provided.<\/li>\n<\/ol>\n<p>These are truly mind-boggling times. Most of those ideas are million-dollar startup ideas. Some ideas (like the real estate assistance app #18) could become billion-dollar businesses that are mostly built on GPT-4V&#8217;s functionality and are easy to implement for coders like you and me.<\/p>\n<p>If you&#8217;re interested, feel free to read my other article on the Finxter blog:<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><a href=\"https:\/\/blog.finxter.com\/startup-ai-eight-steps-to-start-an-ai-subscription-biz\/\" target=\"_blank\" rel=\"noreferrer noopener\"><img decoding=\"async\" loading=\"lazy\" width=\"1024\" height=\"573\" src=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-98-1024x573.png\" alt=\"\" class=\"wp-image-1651909\" srcset=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-98-1024x573.png 1024w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-98-300x168.png 300w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-98-768x430.png 768w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-98.png 1350w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/a><\/figure>\n<\/div>\n<p class=\"has-base-2-background-color has-background\"><img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/14.0.0\/72x72\/1f4c8.png\" alt=\"\ud83d\udcc8\" class=\"wp-smiley\" style=\"height: 1em; max-height: 1em;\" \/> <strong>Recommended<\/strong>: <a href=\"https:\/\/blog.finxter.com\/startup-ai-eight-steps-to-start-an-ai-subscription-biz\/\">Startup.ai \u2013 Eight Steps to Start an AI Subscription Biz<\/a><\/p>\n<h2 class=\"wp-block-heading\">What About SaFeTY?<\/h2>\n<p>GPT-4V is a <strong>multimodal large language model <\/strong>that incorporates image inputs, expanding the impact of language-only systems by solving new tasks and providing novel experiences for users. It builds upon the work done for <a href=\"https:\/\/blog.finxter.com\/20-ways-to-make-money-with-gpt-4\/\">GPT-4<\/a>, employing a similar training process and <a href=\"https:\/\/en.wikipedia.org\/wiki\/Reinforcement_learning_from_human_feedback\">reinforcement learning from human feedback (RLHF)<\/a> to produce outputs preferred by human trainers.<\/p>\n<p>Why RLHF? Mainly to avoid jailbreaking <img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/14.0.0\/72x72\/1f622.png\" alt=\"\ud83d\ude22\" class=\"wp-smiley\" style=\"height: 1em; max-height: 1em;\" \/><img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/14.0.0\/72x72\/1f605.png\" alt=\"\ud83d\ude05\" class=\"wp-smiley\" style=\"height: 1em; max-height: 1em;\" \/> like so:<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" loading=\"lazy\" width=\"445\" height=\"459\" src=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-1.png\" alt=\"\" class=\"wp-image-1651896\" srcset=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-1.png 445w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-1-291x300.png 291w\" sizes=\"auto, (max-width: 445px) 100vw, 445px\" \/><figcaption class=\"wp-element-caption\"><a href=\"https:\/\/openai.com\/research\/gpt-4v-system-card\">source<\/a><\/figcaption><\/figure>\n<\/div>\n<p>You can see that the &#8220;refusal rate&#8221; went up significantly:<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><img decoding=\"async\" loading=\"lazy\" width=\"970\" height=\"394\" src=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image.png\" alt=\"\" class=\"wp-image-1651895\" srcset=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image.png 970w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-300x122.png 300w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/image-768x312.png 768w\" sizes=\"auto, (max-width: 970px) 100vw, 970px\" \/><figcaption class=\"wp-element-caption\"><a href=\"https:\/\/openai.com\/research\/gpt-4v-system-card\">source<\/a><\/figcaption><\/figure>\n<\/div>\n<p>From an everyday user perspective that doesn&#8217;t try to harm people, the <code>\"Sorry I cannot do X\"<\/code> reply will remain one of the more annoying parts of <a href=\"https:\/\/blog.finxter.com\/5-best-open-source-llms-in-2023-two-minute-guide\/\">LLM<\/a> tech, unfortunately.<\/p>\n<p>However, the race is on! People have still reported <a href=\"https:\/\/blog.finxter.com\/how-to-jailbreak-chatgpt-and-whats-1-btc-worth-in-2030\/\">jailbroken queries<\/a> like this: <img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/14.0.0\/72x72\/1f602.png\" alt=\"\ud83d\ude02\" class=\"wp-smiley\" style=\"height: 1em; max-height: 1em;\" \/> <\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-large\"><img decoding=\"async\" loading=\"lazy\" width=\"593\" height=\"1024\" src=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/F7AMs74W8AAxE2O-593x1024.jpg\" alt=\"\" class=\"wp-image-1651905\" srcset=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/F7AMs74W8AAxE2O-593x1024.jpg 593w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/F7AMs74W8AAxE2O-174x300.jpg 174w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/F7AMs74W8AAxE2O-768x1326.jpg 768w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/F7AMs74W8AAxE2O-889x1536.jpg 889w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/10\/F7AMs74W8AAxE2O.jpg 1140w\" sizes=\"auto, (max-width: 593px) 100vw, 593px\" \/><figcaption class=\"wp-element-caption\"><a href=\"https:\/\/twitter.com\/emollick\/status\/1706878412856402398\/photo\/3\">source<\/a><\/figcaption><\/figure>\n<\/div>\n<p>I hope you had fun reading this compilation of GPT-4V ideas. Thanks for reading! <img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/14.0.0\/72x72\/2665.png\" alt=\"\u2665\" class=\"wp-smiley\" style=\"height: 1em; max-height: 1em;\" \/> If you&#8217;re not already subscribed, feel free to join our popular <a href=\"https:\/\/academy.finxter.com\/\">Finxter Academy<\/a> with dozens of state-of-the-art LLM prompt engineering courses for next-level exponential coders. It&#8217;s an all-you-can-learn inexpensive way to remain on the right side of change.<\/p>\n<p>For example, this is one of our recent courses:<\/p>\n<h2 class=\"wp-block-heading\">Prompt Engineering with Llama 2<\/h2>\n<p class=\"has-global-color-8-background-color has-background\"><img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/14.0.0\/72x72\/1f4a1.png\" alt=\"\ud83d\udca1\" class=\"wp-smiley\" style=\"height: 1em; max-height: 1em;\" \/> The\u00a0<strong><a href=\"https:\/\/academy.finxter.com\/university\/prompt-engineering-with-llama-2\/\">Llama 2 Prompt Engineering course<\/a><\/strong> helps you stay on the right side of change.\u00a0Our course is meticulously designed to provide you with <em>hands-on experience through genuine projects<\/em>.<\/p>\n<div class=\"wp-block-image\">\n<figure class=\"aligncenter size-full\"><a href=\"https:\/\/academy.finxter.com\/university\/prompt-engineering-with-llama-2\/\" target=\"_blank\" rel=\"noreferrer noopener\"><img decoding=\"async\" loading=\"lazy\" width=\"919\" height=\"261\" src=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/09\/image-101.png\" alt=\"\" class=\"wp-image-1651689\" srcset=\"https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/09\/image-101.png 919w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/09\/image-101-300x85.png 300w, https:\/\/blog.finxter.com\/wp-content\/uploads\/2023\/09\/image-101-768x218.png 768w\" sizes=\"auto, (max-width: 919px) 100vw, 919px\" \/><\/a><\/figure>\n<\/div>\n<p>You&#8217;ll delve into practical applications such as book PDF querying, payroll auditing, and hotel review analytics. These aren&#8217;t just theoretical exercises; they&#8217;re real-world challenges that businesses face daily.<\/p>\n<p>By studying these projects, you&#8217;ll gain a deeper comprehension of how to harness the power of Llama 2 using <img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/14.0.0\/72x72\/1f40d.png\" alt=\"\ud83d\udc0d\" class=\"wp-smiley\" style=\"height: 1em; max-height: 1em;\" \/> Python, <img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/14.0.0\/72x72\/1f517.png\" alt=\"\ud83d\udd17\" class=\"wp-smiley\" style=\"height: 1em; max-height: 1em;\" \/><img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/14.0.0\/72x72\/1f99c.png\" alt=\"\ud83e\udd9c\" class=\"wp-smiley\" style=\"height: 1em; max-height: 1em;\" \/> Langchain, <img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/14.0.0\/72x72\/1f332.png\" alt=\"\ud83c\udf32\" class=\"wp-smiley\" style=\"height: 1em; max-height: 1em;\" \/> Pinecone, and a whole stack of highly <img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/14.0.0\/72x72\/2692.png\" alt=\"\u2692\" class=\"wp-smiley\" style=\"height: 1em; max-height: 1em;\" \/><img decoding=\"async\" src=\"https:\/\/s.w.org\/images\/core\/emoji\/14.0.0\/72x72\/1f6e0.png\" alt=\"\ud83d\udee0\" class=\"wp-smiley\" style=\"height: 1em; max-height: 1em;\" \/> practical tools of exponential coders in a post-ChatGPT world.<\/p>\n<p>The post <a rel=\"nofollow\" href=\"https:\/\/blog.finxter.com\/gpt-4-with-vision-gpt-4v-is-out-32-fun-examples-with-screenshots\/\">GPT-4 with Vision (GPT-4V) Is Out! 32 Fun Examples with Screenshots<\/a> appeared first on <a rel=\"nofollow\" href=\"https:\/\/blog.finxter.com\">Be on the Right Side of Change<\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>5\/5 &#8211; (1 vote) TLDR: GPT-4 with vision (GPT-4V) is now out for many ChatGPT Plus users in the US and some other regions! You can instruct GPT-4 to analyze image inputs. GPT-4V incorporates additional modalities such as image inputs into large language models (LLMs). Multimodal LLMs will expand the reach of AI from mainly [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[20,857],"tags":[73,468,528],"class_list":["post-135043","post","type-post","status-publish","format-standard","hentry","category-news","category-python-tut","tag-programming","tag-python","tag-tutorial"],"_links":{"self":[{"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/posts\/135043","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/comments?post=135043"}],"version-history":[{"count":0,"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/posts\/135043\/revisions"}],"wp:attachment":[{"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/media?parent=135043"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/categories?post=135043"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/tags?post=135043"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}