Sick Gaming
[Tut] GPT-4 is Out! A New Language Model on Steroids - Printable Version

+- Sick Gaming (https://www.sickgaming.net)
+-- Forum: Programming (https://www.sickgaming.net/forum-76.html)
+--- Forum: Python (https://www.sickgaming.net/forum-83.html)
+--- Thread: [Tut] GPT-4 is Out! A New Language Model on Steroids (/thread-100869.html)



[Tut] GPT-4 is Out! A New Language Model on Steroids - xSicKxBot - 03-15-2023

GPT-4 is Out! A New Language Model on Steroids

<div>
<div class="kk-star-ratings kksr-auto kksr-align-left kksr-valign-top" data-payload='{&quot;align&quot;:&quot;left&quot;,&quot;id&quot;:&quot;1208854&quot;,&quot;slug&quot;:&quot;default&quot;,&quot;valign&quot;:&quot;top&quot;,&quot;ignore&quot;:&quot;&quot;,&quot;reference&quot;:&quot;auto&quot;,&quot;class&quot;:&quot;&quot;,&quot;count&quot;:&quot;3&quot;,&quot;legendonly&quot;:&quot;&quot;,&quot;readonly&quot;:&quot;&quot;,&quot;score&quot;:&quot;5&quot;,&quot;starsonly&quot;:&quot;&quot;,&quot;best&quot;:&quot;5&quot;,&quot;gap&quot;:&quot;5&quot;,&quot;greet&quot;:&quot;Rate this post&quot;,&quot;legend&quot;:&quot;5\/5 - (3 votes)&quot;,&quot;size&quot;:&quot;24&quot;,&quot;width&quot;:&quot;142.5&quot;,&quot;_legend&quot;:&quot;{score}\/{best} - ({count} {votes})&quot;,&quot;font_factor&quot;:&quot;1.25&quot;}'>
<div class="kksr-stars">
<div class="kksr-stars-inactive">
<div class="kksr-star" data-star="1" style="padding-right: 5px">
<div class="kksr-icon" style="width: 24px; height: 24px;"></div>
</p></div>
<div class="kksr-star" data-star="2" style="padding-right: 5px">
<div class="kksr-icon" style="width: 24px; height: 24px;"></div>
</p></div>
<div class="kksr-star" data-star="3" style="padding-right: 5px">
<div class="kksr-icon" style="width: 24px; height: 24px;"></div>
</p></div>
<div class="kksr-star" data-star="4" style="padding-right: 5px">
<div class="kksr-icon" style="width: 24px; height: 24px;"></div>
</p></div>
<div class="kksr-star" data-star="5" style="padding-right: 5px">
<div class="kksr-icon" style="width: 24px; height: 24px;"></div>
</p></div>
</p></div>
<div class="kksr-stars-active" style="width: 142.5px;">
<div class="kksr-star" style="padding-right: 5px">
<div class="kksr-icon" style="width: 24px; height: 24px;"></div>
</p></div>
<div class="kksr-star" style="padding-right: 5px">
<div class="kksr-icon" style="width: 24px; height: 24px;"></div>
</p></div>
<div class="kksr-star" style="padding-right: 5px">
<div class="kksr-icon" style="width: 24px; height: 24px;"></div>
</p></div>
<div class="kksr-star" style="padding-right: 5px">
<div class="kksr-icon" style="width: 24px; height: 24px;"></div>
</p></div>
<div class="kksr-star" style="padding-right: 5px">
<div class="kksr-icon" style="width: 24px; height: 24px;"></div>
</p></div>
</p></div>
</div>
<div class="kksr-legend" style="font-size: 19.2px;"> 5/5 – (3 votes) </div>
</p></div>
<p>Hold onto your hats, folks! OpenAI has just introduced <a rel="noreferrer noopener" href="https://openai.com/research/gpt-4" data-type="URL" data-id="https://openai.com/research/gpt-4" target="_blank">GPT-4</a>, a colossal leap forward in deep learning. </p>
<p>This large multimodal model is more than just a text-based smarty-pants; it can process <strong>both images and text</strong>, producing text outputs that are nothing short of impressive. </p>
<p class="has-base-background-color has-background"><img src="https://s.w.org/images/core/emoji/14.0.0/72x72/2694.png" alt="⚔" class="wp-smiley" style="height: 1em; max-height: 1em;" /> <strong>Showdown</strong>: At the end of this article, I’ll let both GPT-3.5 and GPT-4 write a short story. You’ll be mind-blown by the difference!</p>
<p>But how does it stack up against its predecessor, GPT-3.5? </p>
<p>In this beginner-friendly blog post, we’ll break down the key differences, explain why GPT-4 is such a big deal, and give you some mind-blowing statistics to showcase its extraordinary capabilities.</p>
<p class="has-base-background-color has-background"><img src="https://s.w.org/images/core/emoji/14.0.0/72x72/2705.png" alt="✅" class="wp-smiley" style="height: 1em; max-height: 1em;" /> <strong>Recommended</strong>: <a href="https://cdn.openai.com/papers/gpt-4.pdf" data-type="URL" data-id="https://cdn.openai.com/papers/gpt-4.pdf" target="_blank" rel="noreferrer noopener">Read the full paper here.</a></p>
<div class="wp-block-image">
<figure class="aligncenter size-full"><img loading="lazy" decoding="async" width="640" height="313" src="https://blog.finxter.com/wp-content/uploads/2023/03/image-153.png" alt="" class="wp-image-1208924" srcset="https://blog.finxter.com/wp-content/uploads/2023/03/image-153.png 640w, https://blog.finxter.com/wp-content/uploads/2023/03/image-153-300x147.png 300w" sizes="(max-width: 640px) 100vw, 640px" /></figure>
</div>
<h2>GPT-4: The Superstar Student</h2>
<p class="has-global-color-8-background-color has-background">Picture this: GPT-4 just took a simulated bar exam and scored in the top 10% of test-takers! To put that in perspective, GPT-3.5’s score was around the bottom 10%. </p>
<p>That’s a massive improvement, and it’s all thanks to OpenAI’s tireless efforts to fine-tune GPT-4 over six months using lessons from adversarial testing and ChatGPT. The result? Improved factuality, steerability, and adherence to guardrails.</p>
<div class="wp-block-image">
<figure class="aligncenter size-full"><img decoding="async" loading="lazy" width="790" height="300" src="https://blog.finxter.com/wp-content/uploads/2023/03/image-154.png" alt="" class="wp-image-1208925" srcset="https://blog.finxter.com/wp-content/uploads/2023/03/image-154.png 790w, https://blog.finxter.com/wp-content/uploads/2023/03/image-154-300x114.png 300w, https://blog.finxter.com/wp-content/uploads/2023/03/image-154-768x292.png 768w" sizes="(max-width: 790px) 100vw, 790px" /></figure>
</div>
<h2>Supercomputer Stack</h2>
<div class="wp-block-image">
<figure class="aligncenter size-large"><img decoding="async" loading="lazy" width="1024" height="356" src="https://blog.finxter.com/wp-content/uploads/2023/03/image-155-1024x356.png" alt="" class="wp-image-1208927" srcset="https://blog.finxter.com/wp-content/uploads/2023/03/image-155-1024x356.png 1024w, https://blog.finxter.com/wp-content/uploads/2023/03/image-155-300x104.png 300w, https://blog.finxter.com/wp-content/uploads/2023/03/image-155-768x267.png 768w, https://blog.finxter.com/wp-content/uploads/2023/03/image-155-1536x533.png 1536w, https://blog.finxter.com/wp-content/uploads/2023/03/image-155-2048x711.png 2048w" sizes="(max-width: 1024px) 100vw, 1024px" /><figcaption class="wp-element-caption"><a href="https://azure.microsoft.com/en-us/products/cognitive-services/openai-service" data-type="URL" data-id="https://azure.microsoft.com/en-us/products/cognitive-services/openai-service" target="_blank" rel="noreferrer noopener">source</a></figcaption></figure>
</div>
<p>The brainpower behind GPT-4 isn’t all software – there’s some serious hardware at play too. </p>
<p>Over the past two years, OpenAI rebuilt its entire deep learning stack, partnering with Azure to create a supercomputer specifically designed for their workload. </p>
<p>GPT-3.5 was like a test run — an impressive one nonetheless — allowing the team to work out the kinks and refine their approach. </p>
<p>And boy, did they succeed!</p>
<p>GPT-4’s training was stable and predictable, making it the first large model with performance that could be accurately forecasted ahead of time.</p>
<div class="wp-block-image">
<figure class="aligncenter size-full"><img decoding="async" loading="lazy" width="792" height="157" src="https://blog.finxter.com/wp-content/uploads/2023/03/image-156.png" alt="" class="wp-image-1208929" srcset="https://blog.finxter.com/wp-content/uploads/2023/03/image-156.png 792w, https://blog.finxter.com/wp-content/uploads/2023/03/image-156-300x59.png 300w, https://blog.finxter.com/wp-content/uploads/2023/03/image-156-768x152.png 768w" sizes="(max-width: 792px) 100vw, 792px" /></figure>
</div>
<div class="wp-block-image">
<figure class="aligncenter size-full"><img decoding="async" loading="lazy" width="513" height="320" src="https://blog.finxter.com/wp-content/uploads/2023/03/image-157.png" alt="" class="wp-image-1208930" srcset="https://blog.finxter.com/wp-content/uploads/2023/03/image-157.png 513w, https://blog.finxter.com/wp-content/uploads/2023/03/image-157-300x187.png 300w" sizes="(max-width: 513px) 100vw, 513px" /></figure>
</div>
<h2>Text Input Capability: Now Available for Everyone!</h2>
<p>GPT-4’s text input capabilities have been released via ChatGPT and the API (with a waitlist), so everyone can enjoy its remarkable advancements. </p>
<p>OpenAI is also working on its image input capability, collaborating with a single partner to prepare it for wider availability. </p>
<p>Plus, they’re open-sourcing <strong>OpenAI Evals</strong>, a framework for evaluating AI model performance, so you can help guide future improvements.</p>
<h2>GPT-4 vs. GPT-3.5: The Showdown</h2>
<div class="wp-block-image">
<figure class="aligncenter size-large"><img decoding="async" loading="lazy" width="1024" height="340" src="https://blog.finxter.com/wp-content/uploads/2023/03/image-152-1024x340.png" alt="" class="wp-image-1208921" srcset="https://blog.finxter.com/wp-content/uploads/2023/03/image-152-1024x340.png 1024w, https://blog.finxter.com/wp-content/uploads/2023/03/image-152-300x100.png 300w, https://blog.finxter.com/wp-content/uploads/2023/03/image-152-768x255.png 768w, https://blog.finxter.com/wp-content/uploads/2023/03/image-152.png 1379w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure>
</div>
<p>When it comes to casual conversation, the differences between GPT-3.5 and GPT-4 might seem subtle. But the magic happens when the task complexity increases. </p>
<p>GPT-4 is more reliable, creative, and capable of handling nuanced instructions than GPT-3.5. </p>
<p>OpenAI tested both models on various benchmarks, including simulated exams designed for humans. GPT-4’s performance was nothing short of astonishing.</p>
<div class="wp-block-image">
<figure class="aligncenter size-full"><img decoding="async" loading="lazy" width="770" height="690" src="https://blog.finxter.com/wp-content/uploads/2023/03/image-158.png" alt="" class="wp-image-1208931" srcset="https://blog.finxter.com/wp-content/uploads/2023/03/image-158.png 770w, https://blog.finxter.com/wp-content/uploads/2023/03/image-158-300x269.png 300w, https://blog.finxter.com/wp-content/uploads/2023/03/image-158-768x688.png 768w" sizes="(max-width: 770px) 100vw, 770px" /></figure>
</div>
<p>Above 50% is super-human average performance! <img src="https://s.w.org/images/core/emoji/14.0.0/72x72/1f680.png" alt="?" class="wp-smiley" style="height: 1em; max-height: 1em;" /></p>
<h2>Beyond English: Multilingual Mastery</h2>
<p>GPT-4 doesn’t just excel in English; it shines in other languages too. </p>
<p>OpenAI tested GPT-4 using the MMLU benchmark, a suite of 14,000 multiple-choice problems across 57 subjects, translated into various languages. </p>
<div class="wp-block-image">
<figure class="aligncenter size-full"><img decoding="async" loading="lazy" width="816" height="969" src="https://blog.finxter.com/wp-content/uploads/2023/03/image-159.png" alt="" class="wp-image-1208932" srcset="https://blog.finxter.com/wp-content/uploads/2023/03/image-159.png 816w, https://blog.finxter.com/wp-content/uploads/2023/03/image-159-253x300.png 253w, https://blog.finxter.com/wp-content/uploads/2023/03/image-159-768x912.png 768w" sizes="(max-width: 816px) 100vw, 816px" /></figure>
</div>
<p>In 24 of 26 languages tested, GPT-4 outperformed GPT-3.5 and other large language models—even in low-resource languages like Latvian, Welsh, and Swahili!</p>
<h2>A Handy Helper in the Office</h2>
<p>GPT-4 isn’t just flexing its muscles in the lab; it’s making a real-world impact too. </p>
<p>OpenAI has been using GPT-4 internally for support, sales, content moderation, programming, and even evaluating AI outputs. It’s a versatile and powerful tool that’s ready to revolutionize the way we work.</p>
<p>GPT-4 is significantly better than previous versions in a wide variety of general-purpose skills:</p>
<div class="wp-block-image">
<figure class="aligncenter size-full"><img decoding="async" loading="lazy" width="774" height="586" src="https://blog.finxter.com/wp-content/uploads/2023/03/image-160.png" alt="" class="wp-image-1208933" srcset="https://blog.finxter.com/wp-content/uploads/2023/03/image-160.png 774w, https://blog.finxter.com/wp-content/uploads/2023/03/image-160-300x227.png 300w, https://blog.finxter.com/wp-content/uploads/2023/03/image-160-768x581.png 768w" sizes="(max-width: 774px) 100vw, 774px" /></figure>
</div>
<h2>Visual Inputs: A Sneak Peek into the Future</h2>
<p>But wait, there’s more! <img src="https://s.w.org/images/core/emoji/14.0.0/72x72/1f92f.png" alt="?" class="wp-smiley" style="height: 1em; max-height: 1em;" /></p>
<p>GPT-4 can also handle visual inputs, allowing users to prompt the model with both text and images. This means it can generate text outputs based on a variety of image types, like documents with text and photographs, diagrams, or screenshots. </p>
<div class="wp-block-image">
<figure class="aligncenter size-full"><img decoding="async" loading="lazy" width="823" height="959" src="https://blog.finxter.com/wp-content/uploads/2023/03/image-151.png" alt="" class="wp-image-1208920" srcset="https://blog.finxter.com/wp-content/uploads/2023/03/image-151.png 823w, https://blog.finxter.com/wp-content/uploads/2023/03/image-151-257x300.png 257w, https://blog.finxter.com/wp-content/uploads/2023/03/image-151-768x895.png 768w" sizes="(max-width: 823px) 100vw, 823px" /></figure>
</div>
<p>GPT-4’s capabilities in this domain are similar to its text-only prowess and can be enhanced with test-time techniques developed for text-only language models, such as few-shot and chain-of-thought prompting. </p>
<p>Keep in mind that image inputs are still in the research preview stage and not yet publicly available.</p>
<h2>Conclusion</h2>
<p>In a nutshell, GPT-4 is the AI game-changer we’ve all been waiting for. </p>
<p>With human-level performance on various professional and academic benchmarks, impressive multilingual capabilities, and the ability to process both text and image inputs, it’s easy to see why GPT-4 is making waves in the AI community. </p>
<p>As OpenAI continues to refine and expand GPT-4’s capabilities, we can expect even more groundbreaking advancements in the near future. So, stay tuned, AI has never been more exciting!</p>
<p class="has-base-background-color has-background"><img src="https://s.w.org/images/core/emoji/14.0.0/72x72/1f680.png" alt="?" class="wp-smiley" style="height: 1em; max-height: 1em;" /> <strong>Recommended</strong>: <a rel="noreferrer noopener" href="https://blog.finxter.com/building-a-movie-recommendation-app-with-chatgpt/" data-type="URL" data-id="https://blog.finxter.com/building-a-movie-recommendation-app-with-chatgpt/" target="_blank">ChatGPT at the Heart – Building a Movie Recommendation Python Web App in 2023</a></p>
<h2>Interesting Facts and Stats GPT-4</h2>
<ol>
<li>GPT-4 is a multimodal model, capable of processing both text and image inputs.</li>
<li>It has shown human-level performance on various professional and academic benchmarks.</li>
<li>On a simulated bar exam, GPT-4 scored in the top 10% of test-takers, while GPT-3.5 scored in the bottom 10%.</li>
<li>OpenAI spent six months fine-tuning GPT-4 using lessons from adversarial testing and ChatGPT.</li>
<li>GPT-4 was tested on the MMLU benchmark, a suite of 14,000 multiple-choice problems across 57 subjects.</li>
<li>In 24 of the 26 languages tested on the MMLU benchmark, GPT-4 outperformed GPT-3.5 and other large language models.</li>
<li>OpenAI used GPT-4 internally for various tasks, including support, sales, content moderation, and programming.</li>
<li>GPT-4 can generate text outputs from inputs consisting of interspersed text and images.</li>
<li>The development of GPT-4 involved rebuilding OpenAI’s entire deep learning stack and co-designing a supercomputer with Azure.</li>
<li>GPT-4 considerably outperforms existing large language models and most state-of-the-art models on traditional benchmarks designed for machine learning models.</li>
</ol>
<h2>Appendix: GPT-3.5 vs GPT-4 Short Story</h2>
<p>To showcase the new capabilities of GPT-4, I let both write a short story with the following prompt:</p>
<pre class="wp-block-preformatted"><strong><code>Write a funny soccer story for kids that teaches the kid that persistence is key for success in the style of Rowling.</code></strong></pre>
<p>Here’s the output of GPT-3.5:</p>
<pre class="wp-block-preformatted"><code>Once upon a time, in a small village in England, there was a young boy named Harry who loved playing soccer more than anything in the world. He dreamed of one day playing for the famous Manchester United team, but he wasn't very good at it. Harry would spend hours every day practicing his soccer skills, but no matter how much he tried, he just couldn't seem to improve. His kicks were weak, his passes were sloppy, and he always seemed to trip over his own feet. One day, Harry heard that the Manchester United team was holding tryouts in his town. He knew that he wasn't good enough to make the team, but he decided to give it a try anyway. He put on his favorite soccer jersey, laced up his cleats, and headed to the field. When he arrived, he saw hundreds of other kids just like him, all vying for a chance to play for Manchester United. Harry watched as the other kids showed off their skills, dribbling the ball with ease, making perfect passes, and scoring goal after goal. Harry knew that he couldn't compete with those kids, but he refused to give up. He stepped up to the ball, took a deep breath, and kicked it with all his might. The ball flew into the air, wobbling and spinning wildly, and Harry watched in horror as it sailed straight into the goalpost and bounced back toward him. But Harry didn't give up. He chased after the ball, ignoring the snickers and jeers of the other kids, and kicked it again. And again. And again. Each time, the ball sailed in a different direction, bouncing off the other kids, the goalposts, and even the referee's head. But Harry didn't care. He was having the time of his life, and he refused to let anyone else bring him down. He kicked the ball with all his might, laughing and shouting and cheering himself on. And then, something amazing happened. The Manchester United coach, who had been watching the tryouts from the sidelines, walked up to Harry and tapped him on the shoulder. "Son," the coach said, "I've never seen anyone as persistent as you. You may not be the best soccer player on this field, but you have heart. And that's something that can't be taught." And with that, the coach offered Harry a spot on the Manchester United team. Harry couldn't believe his ears. He had made it! He had achieved his dream! From that day on, Harry worked harder than ever before, practicing every day and pushing himself to be the best he could be. And in the end, his persistence paid off. He became one of the greatest soccer players in the world, known not just for his skill, but for his determination and his unwavering spirit. So remember, kids: persistence is key. If you want to achieve your dreams, you can't give up. You have to keep pushing, keep trying, and keep kicking that soccer ball with all your might. Who knows? Maybe one day, you'll be the next Harry of the soccer world!</code></pre>
<p>And here’s the impressive output of GPT-4, notice the unique use of words and creative ideas and turns in the storyline! <img src="https://s.w.org/images/core/emoji/14.0.0/72x72/1f447.png" alt="?" class="wp-smiley" style="height: 1em; max-height: 1em;" /></p>
<pre class="wp-block-preformatted"><code>Once upon a time in the magical land of Fussballia, there lived a young and spirited soccer player named Jorrick Kickalot. Jorrick was no ordinary soccer player. He had an extraordinary talent of kicking soccer balls with the precision of a master archer. However, his small size made it difficult for him to make the local team, the Fussballia Falcons. Jorrick's best friend, a tiny but wise owl named Hootius Whistlebeak, would always accompany him to the games. One day, Hootius spotted a secret sign in the sky. It was an invitation to the legendary Flying Cleat Cup, a mysterious soccer tournament that only appeared once every 100 years. "Jorrick, this is your chance to prove yourself!" said Hootius excitedly, flapping his wings in anticipation. Together, they embarked on a long and treacherous journey to the secret location of the Flying Cleat Cup. They encountered peculiar creatures, such as the Zigzag Zephyrs, who could change the direction of the wind with a flick of their tails, and the elusive Balliphants, giant soccer balls with elephant trunks. Finally, they reached the hidden stadium, where they found teams of all shapes, sizes, and magical abilities. Jorrick felt nervous and unsure, but he remembered Hootius' words: "Persistence is the key to success!" The tournament began with a grand fanfare. In the first match, Jorrick and his makeshift team of magical creatures faced off against the towering Stomposaurs, a team of dinosaur-like creatures with powerful legs. Despite their intimidating size, Jorrick and his teammates worked together, and he managed to score the winning goal with a perfectly placed kick. In the next round, they faced the clever Whizbang Wizards, a team of magical beings who could make the ball disappear and reappear in the most unexpected places. Jorrick, however, was not to be outwitted. With Hootius' help, he kept a sharp eye on the ball and successfully led his team to another victory. The final match was against the reigning champions, the Enchanted Elves, known for their unmatched speed and agility. Jorrick was exhausted and felt the weight of the pressure on his shoulders. His legs were heavy, and his heart pounded in his chest. As the match began, the Enchanted Elves darted around the field, scoring goal after goal. It seemed like all hope was lost. But then, Hootius flew down to Jorrick and whispered, "Remember, persistence is the key to success!" With renewed determination, Jorrick summoned every ounce of strength he had left. He dodged, weaved, and kicked with all his might. Miraculously, he scored an equalizer in the final minutes of the game, sending it into extra time. In the final moments, Jorrick took a deep breath and prepared for a corner kick. He spotted Hootius in the stands, who gave him a reassuring nod. Jorrick took a step back, and with a powerful swing of his leg, sent the ball soaring through the air. The ball curved around the defenders, flew past the goalie, and nestled perfectly into the corner of the net. The stadium erupted in cheers as Jorrick's team celebrated their unexpected victory. They had won the Flying Cleat Cup, and Jorrick had proven that size did not matter when it came to playing soccer. Jorrick and Hootius returned home to Fussballia, where they were greeted as heroes. And from that day on, Jorrick Kickalot became a legend, not only for his incredible skills but also for his unwavering persistence. Young soccer players from all over the land would come to watch Jorrick play and learn from him. They admired not just his talent, but also his determination and grit. Jorrick always took the time to share his wisdom with the aspiring athletes, reminding them of the important lesson he had learned: "Persistence is the key to success!" As for Hootius Whistlebeak, he became the official mascot of the Fussballia Falcons, always encouraging the players to never give up, no matter the odds. Years later, the story of Jorrick Kickalot and his magical adventure at the Flying Cleat Cup became a beloved tale in Fussballia. Parents would tell their children the story of the young soccer player who, against all odds, achieved greatness through his sheer persistence and determination. And so, the legacy of Jorrick Kickalot lived on, inspiring generations of young soccer players to believe in themselves and never give up, no matter how tough the challenges they faced. They knew that if they could persist like Jorrick, they too could achieve their dreams and become the best soccer players they could be.</code></pre>
</div>


https://www.sickgaming.net/blog/2023/03/14/gpt-4-is-out-a-new-language-model-on-steroids/