Create an account


Thread Rating:
  • 0 Vote(s) - 0 Average
  • 1
  • 2
  • 3
  • 4
  • 5
[Tut] Choose the Best Open-Source LLM with This Powerful Tool

#1
Choose the Best Open-Source LLM with This Powerful Tool

<div>
<div class="kk-star-ratings kksr-auto kksr-align-left kksr-valign-top" data-payload='{&quot;align&quot;:&quot;left&quot;,&quot;id&quot;:&quot;1380730&quot;,&quot;slug&quot;:&quot;default&quot;,&quot;valign&quot;:&quot;top&quot;,&quot;ignore&quot;:&quot;&quot;,&quot;reference&quot;:&quot;auto&quot;,&quot;class&quot;:&quot;&quot;,&quot;count&quot;:&quot;1&quot;,&quot;legendonly&quot;:&quot;&quot;,&quot;readonly&quot;:&quot;&quot;,&quot;score&quot;:&quot;5&quot;,&quot;starsonly&quot;:&quot;&quot;,&quot;best&quot;:&quot;5&quot;,&quot;gap&quot;:&quot;5&quot;,&quot;greet&quot;:&quot;Rate this post&quot;,&quot;legend&quot;:&quot;5\/5 - (1 vote)&quot;,&quot;size&quot;:&quot;24&quot;,&quot;title&quot;:&quot;Choose the Best Open-Source LLM with This Powerful Tool&quot;,&quot;width&quot;:&quot;142.5&quot;,&quot;_legend&quot;:&quot;{score}\/{best} - ({count} {votes})&quot;,&quot;font_factor&quot;:&quot;1.25&quot;}'>
<div class="kksr-stars">
<div class="kksr-stars-inactive">
<div class="kksr-star" data-star="1" style="padding-right: 5px">
<div class="kksr-icon" style="width: 24px; height: 24px;"></div>
</p></div>
<div class="kksr-star" data-star="2" style="padding-right: 5px">
<div class="kksr-icon" style="width: 24px; height: 24px;"></div>
</p></div>
<div class="kksr-star" data-star="3" style="padding-right: 5px">
<div class="kksr-icon" style="width: 24px; height: 24px;"></div>
</p></div>
<div class="kksr-star" data-star="4" style="padding-right: 5px">
<div class="kksr-icon" style="width: 24px; height: 24px;"></div>
</p></div>
<div class="kksr-star" data-star="5" style="padding-right: 5px">
<div class="kksr-icon" style="width: 24px; height: 24px;"></div>
</p></div>
</p></div>
<div class="kksr-stars-active" style="width: 142.5px;">
<div class="kksr-star" style="padding-right: 5px">
<div class="kksr-icon" style="width: 24px; height: 24px;"></div>
</p></div>
<div class="kksr-star" style="padding-right: 5px">
<div class="kksr-icon" style="width: 24px; height: 24px;"></div>
</p></div>
<div class="kksr-star" style="padding-right: 5px">
<div class="kksr-icon" style="width: 24px; height: 24px;"></div>
</p></div>
<div class="kksr-star" style="padding-right: 5px">
<div class="kksr-icon" style="width: 24px; height: 24px;"></div>
</p></div>
<div class="kksr-star" style="padding-right: 5px">
<div class="kksr-icon" style="width: 24px; height: 24px;"></div>
</p></div>
</p></div>
</div>
<div class="kksr-legend" style="font-size: 19.2px;"> 5/5 – (1 vote) </div>
</p></div>
<p><a href="https://blog.finxter.com/the-open-source-ecosystem-outruns-tech-giants-a-shift-in-ai-landscape/" data-type="post" data-id="1373436" target="_blank" rel="noreferrer noopener">Open-source LLMs</a> have taken the world by storm in just a little over 2 months, ever since LLaMA’s weights were made available for anyone to tinker and play with. Just less than 2 weeks after the untrained LLaMA model was released by Meta.</p>
<p class="has-global-color-8-background-color has-background"><img src="https://s.w.org/images/core/emoji/14.0.0/72x72/1f4a1.png" alt="?" class="wp-smiley" style="height: 1em; max-height: 1em;" /> A <strong>model’s weights</strong> are the values set to each parameter after training the model on a dataset, with the parameters being various factors (such as token size, number of layers) that allow the model to give more complex answers to what’s input by the user.</p>
<p>This led to a flurry of advancements from dedicated open-source community members. Through just the use of their personal hardware, they were able to make leaps and bounds in their quest to place the most powerful AI in the hands of everyday people.</p>
<p class="has-base-2-background-color has-background"><img src="https://s.w.org/images/core/emoji/14.0.0/72x72/1f468-200d-1f4bb.png" alt="?‍?" class="wp-smiley" style="height: 1em; max-height: 1em;" /> <strong>Recommended</strong>: <a href="https://blog.finxter.com/a-quick-and-dirty-dip-into-cutting-edge-open-source-llm-research/" data-type="URL" data-id="https://blog.finxter.com/a-quick-and-dirty-dip-into-cutting-edge-open-source-llm-research/" target="_blank" rel="noreferrer noopener">A Quick and Dirty Dip Into Cutting-Edge Open-Source LLM Research</a></p>
<p>OpenAI’s leadership seems to have taken quite a notice of these events, because they seem to be planning to release an open-source LLM, according to <a href="https://www.reuters.com/technology/openai-readies-new-open-source-ai-model-information-2023-05-15/" target="_blank" rel="noreferrer noopener">a report by Reuters</a>. It’s virtually unanimous that OpenAI’s <a href="https://blog.finxter.com/10-high-iq-things-gpt-4-can-do-that-gpt-3-5-cant/" data-type="post" data-id="1257087" target="_blank" rel="noreferrer noopener">GPT-4</a> is the best-performing LLM model out there. So an open-source model from them would be no small event, even if it is weaker than GPT-4.</p>
<p>Finding out exactly how an OpenAI foundation model is built would give the open-source community a wealth of knowledge that they can apply to their other projects.</p>
<p>It would also go to show how seriously OpenAI views open source and the community surrounding it. It would show that they’re fully aware that the only chance for them to maintain their LLM dominance is if they allowed the world to improve and iterate on their designs.</p>
<p>Open-source showing such swift and definite progress toward <a href="https://blog.finxter.com/google-says-we-have-no-moat-and-neither-does-openai/" data-type="post" data-id="1339877" target="_blank" rel="noreferrer noopener">taking the crown away from OpenAI can hardly be a surprise</a>. The law of the wisdom of the crowd was foretelling of that. The insight and understanding of the relative few can never match the capability of the collective knowledge and experience of the tens of millions.</p>
<h2 class="wp-block-heading">The Ultimate Open-Source LLM Battle – Who Wins?</h2>
<p>In a <a rel="noreferrer noopener" href="https://chat.lmsys.org/?arena" target="_blank">chatbot arena site</a> managed by LYMSYS, visitors are asked to enter a prompt, and two randomly-selected models will each provide a response. </p>
<div class="wp-block-image">
<figure class="aligncenter size-large"><img loading="lazy" decoding="async" width="1024" height="721" src="https://blog.finxter.com/wp-content/uploads/2023/05/image-336-1024x721.png" alt="" class="wp-image-1380746" srcset="https://blog.finxter.com/wp-content/uploads/2023/05/image-336-1024x721.png 1024w, https://blog.finxter.com/wp-content/uplo...00x211.png 300w, https://blog.finxter.com/wp-content/uplo...68x540.png 768w, https://blog.finxter.com/wp-content/uplo...ge-336.png 1485w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure>
</div>
<p>The model that the user chooses as having given the best response is then raised up on the leaderboard while the other gets lowered.</p>
<p>The following models are the top three highest-performing models in that arena, just behind GPT-4 (ELO rating of 1274), Anthropic’s Claude (rating of 1224), and GPT-3.5-Turbo (rating of 1155).</p>
<figure class="wp-block-image size-large"><img decoding="async" loading="lazy" width="1024" height="695" src="https://blog.finxter.com/wp-content/uploads/2023/05/image-335-1024x695.png" alt="" class="wp-image-1380744" srcset="https://blog.finxter.com/wp-content/uploads/2023/05/image-335-1024x695.png 1024w, https://blog.finxter.com/wp-content/uplo...00x204.png 300w, https://blog.finxter.com/wp-content/uplo...68x521.png 768w, https://blog.finxter.com/wp-content/uplo...ge-335.png 1093w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure>
<h2 class="wp-block-heading"><strong>Vicuna-13B</strong></h2>
<p>Trained by LYMSYS, an open research organization based in UC Berkeley, it is the most promising model from the LLaMA leak. </p>
<div class="wp-block-image">
<figure class="aligncenter size-full"><img decoding="async" loading="lazy" width="731" height="408" src="https://blog.finxter.com/wp-content/uploads/2023/05/image-337.png" alt="" class="wp-image-1380747" srcset="https://blog.finxter.com/wp-content/uploads/2023/05/image-337.png 731w, https://blog.finxter.com/wp-content/uplo...00x167.png 300w" sizes="(max-width: 731px) 100vw, 731px" /></figure>
</div>
<p class="has-base-2-background-color has-background"><img src="https://s.w.org/images/core/emoji/14.0.0/72x72/1f4a1.png" alt="?" class="wp-smiley" style="height: 1em; max-height: 1em;" /> <strong>Recommended</strong>: <a href="https://blog.finxter.com/11-best-chatgpt-alternatives/" data-type="post" data-id="1341399" target="_blank" rel="noreferrer noopener">11 Best ChatGPT Alternatives</a></p>
<p>It reportedly achieves 90% response quality compared to <a href="https://blog.finxter.com/did-chatgpt-just-kill-freelancing-%f0%9f%98%b5/" data-type="post" data-id="1348498" target="_blank" rel="noreferrer noopener">ChatGPT</a> and Google’s Bard, using a casual evaluation method done through GPT-4. They were able to accomplish this with just a training cost of $300. It has a rating of 1083.</p>
<div class="wp-block-image">
<figure class="aligncenter size-large"><img decoding="async" loading="lazy" width="1024" height="559" src="https://blog.finxter.com/wp-content/uploads/2023/05/image-334-1024x559.png" alt="" class="wp-image-1380736" srcset="https://blog.finxter.com/wp-content/uploads/2023/05/image-334-1024x559.png 1024w, https://blog.finxter.com/wp-content/uplo...00x164.png 300w, https://blog.finxter.com/wp-content/uplo...68x419.png 768w, https://blog.finxter.com/wp-content/uplo...36x838.png 1536w, https://blog.finxter.com/wp-content/uplo...ge-334.png 1600w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure>
</div>
<h2 class="wp-block-heading"><strong>Koala-13B</strong></h2>
<p>Coming from BAIR, another group within UC Berkeley, this is a dialogue model meant for academic research. It aims to answer the question of whether open-source models can overcome the massive scale advantage of closed models through better curation of training data. It comes in with a rating of 1022.</p>
<div class="wp-block-image">
<figure class="aligncenter size-large"><img decoding="async" loading="lazy" width="1024" height="559" src="https://blog.finxter.com/wp-content/uploads/2023/05/image-332-1024x559.png" alt="" class="wp-image-1380734" srcset="https://blog.finxter.com/wp-content/uploads/2023/05/image-332-1024x559.png 1024w, https://blog.finxter.com/wp-content/uplo...00x164.png 300w, https://blog.finxter.com/wp-content/uplo...68x419.png 768w, https://blog.finxter.com/wp-content/uplo...36x838.png 1536w, https://blog.finxter.com/wp-content/uplo...ge-332.png 1600w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure>
</div>
<h2 class="wp-block-heading"><strong>RWKV-4-Raven-14B</strong></h2>
<p>Impressively, this model was developed by a single person known by the username BlinkDL. </p>
<p>Even more impressively, it’s an RNN LLM (Recurrent Neural Network) rather than the ubiquitous Transformer <a href="https://blog.finxter.com/the-evolution-of-large-language-models-llms-insights-from-gpt-4-and-beyond/" data-type="post" data-id="1267220" target="_blank" rel="noreferrer noopener">LLM</a>. The advent of Transformers is what led to the power of GPT-4 being achieved. </p>
<p>People like BlinkDL figuring out ways to optimize more archaic architectures could soon lead to a hybrid architecture that overtakes Transformers in both performance and speed. This model’s rating is a respectable 989.</p>
<div class="wp-block-image">
<figure class="aligncenter size-large"><img decoding="async" loading="lazy" width="1024" height="559" src="https://blog.finxter.com/wp-content/uploads/2023/05/image-333-1024x559.png" alt="" class="wp-image-1380735" srcset="https://blog.finxter.com/wp-content/uploads/2023/05/image-333-1024x559.png 1024w, https://blog.finxter.com/wp-content/uplo...00x164.png 300w, https://blog.finxter.com/wp-content/uplo...68x419.png 768w, https://blog.finxter.com/wp-content/uplo...36x838.png 1536w, https://blog.finxter.com/wp-content/uplo...ge-333.png 1600w" sizes="(max-width: 1024px) 100vw, 1024px" /></figure>
</div>
<h2 class="wp-block-heading">Civilization-Defining Power Through Artificial General Intelligence</h2>
<p>Open-source is a term that can bring out patronizing feelings in people because, after all, a lot of the best programs we know today are closed-source and are chosen by billions of people each year. But that is only due to there being no real reason for the wider community to develop superior open-source alternatives preferred by the wider public.</p>
<p>It’s a much different case with AI. </p>
<p>A few companies holding such immense and civilization-defining power for themselves is not a future that anyone who truly understands the capabilities of AI would want.</p>
<p><a href="https://blog.finxter.com/what-is-artificial-general-intelligence-a-comprehensive-overview/" data-type="post" data-id="1238791" target="_blank" rel="noreferrer noopener">Artificial general intelligence</a> is just around the corner, and with it, a complete reimagining of society as we know it. It is a tool that every single person should have equal access to. That reality would bring about a golden age that humanity has never before experienced in all its history.</p>
<p>No matter what anyone says, hoarding any AI knowledge for oneself is a complete disservice to the good of humanity.</p>
<p>Rather than being reserved for the privileged few, a world where AI can be developed and iterated upon by any and all is the only way any sort of utopia can be achieved. Through open-source AI, the dreams and optimism of some of our favorite <a href="https://blog.finxter.com/how-i-created-a-sci-fi-story-book-using-chatgpt-and-midjourney/" data-type="post" data-id="1252692" target="_blank" rel="noreferrer noopener">sci-fi stories</a> will finally be brought to life.</p>
<p class="has-base-2-background-color has-background"><img src="https://s.w.org/images/core/emoji/14.0.0/72x72/1f4a1.png" alt="?" class="wp-smiley" style="height: 1em; max-height: 1em;" /> <strong>Recommended</strong>: <a href="https://blog.finxter.com/minigpt-4-the-latest-breakthrough-in-language-generation-technology/" data-type="URL" data-id="https://blog.finxter.com/minigpt-4-the-latest-breakthrough-in-language-generation-technology/" target="_blank" rel="noreferrer noopener">MiniGPT-4: The Latest Breakthrough in Language Generation Technology</a></p>
</div>


https://www.sickgaming.net/blog/2023/05/...rful-tool/
Reply



Forum Jump:


Users browsing this thread:
2 Guest(s)

Forum software by © MyBB Theme © iAndrew 2016