{"id":25182,"date":"2018-06-13T14:09:39","date_gmt":"2018-06-13T14:09:39","guid":{"rendered":"http:\/\/www.sickgaming.net\/blog\/2018\/06\/13\/ai-is-coming-to-edge-computing-devices\/"},"modified":"2018-06-13T14:09:39","modified_gmt":"2018-06-13T14:09:39","slug":"ai-is-coming-to-edge-computing-devices","status":"publish","type":"post","link":"https:\/\/sickgaming.net\/blog\/2018\/06\/13\/ai-is-coming-to-edge-computing-devices\/","title":{"rendered":"AI Is Coming to Edge Computing Devices"},"content":{"rendered":"<div><img decoding=\"async\" src=\"http:\/\/www.sickgaming.net\/blog\/wp-content\/uploads\/2018\/06\/ai-is-coming-to-edge-computing-devices.jpg\" class=\"ff-og-image-inserted\" \/><\/div>\n<p><span><span>Very few non-server systems run software that could be called machine learning (ML) and artificial intelligence (AI). Yet, server-class \u201cAI on the Edge\u201d applications are coming to embedded devices, and Arm intends to fight with Intel and AMD over every last one of them. <\/span><\/span><\/p>\n<p><span><span>Arm recently <\/span><a href=\"https:\/\/www.arm.com\/news\/2018\/05\/arm-announces-new-suite-of-ip-for-premium-mobile-experiences\"><span>announced<\/span><\/a><span> a new <\/span><a href=\"https:\/\/community.arm.com\/processors\/b\/blog\/posts\/cortex-a76-laptop-class-performance-with-mobile-efficiency\"><span>Cortex-A76<\/span><\/a><span> architecture that is claimed to boost the processing of AI and ML algorithms on edge computing devices by a factor of four. This does not include ML performance gains promised by the new Mali-G76 GPU. There\u2019s also a Mali-V76 VPU designed for high-res video. The Cortex-A76 and two Mali designs are designed to \u201ccomplement\u201d Arm\u2019s Project Trillium Machine Learning processors (see below).<\/span><\/span><\/p>\n<h3><span><span>Improved performance<\/span><\/span><\/h3>\n<p><span><span>The Cortex-A76 differs from the <\/span><a href=\"https:\/\/www.linux.com\/news\/mediateks-10nm-mobile-focused-soc-will-tap-cortex-a73-and-a32\"><span>Cortex-A73<\/span><\/a><span> and <\/span><a href=\"http:\/\/linuxgizmos.com\/arm-debuts-cortex-a75-and-cortex-a55-with-ai-in-mind\/\"><span>Cortex-A75<\/span><\/a><span> IP designs in that it\u2019s designed as much for laptops as for smartphones and high-end embedded devices. Cortex-A76 provides \u201c35 percent more performance year-over-year,\u201d compared to Cortex-A75, claims Arm. The IP, which is expected to arrive in products a year from now, is also said to provide 40 percent improved efficiency.<\/span><\/span><\/p>\n<p><span><span>Like Cortex-A75, which is equivalent to the latest Kyro cores available on Qualcomm\u2019s <\/span><a href=\"http:\/\/linuxgizmos.com\/hot-chips-on-parade-at-mwc-and-embedded-world\/\"><span>Snapdragon 845<\/span><\/a><span>, the Cortex-A76 supports <\/span><a href=\"http:\/\/linuxgizmos.com\/arm-boosts-big-little-with-dynamiq-and-launches-linux-dev-kit\/\"><span>DynamIQ<\/span><\/a><span>, Arm\u2019s more flexible version of its Big.Little multi-core scheme. Unlike Cortex-A75, which was announced with a Cortex-A55 companion chip, Arm had no new DynamIQ companion for the Cortex-A76.<\/span><\/span><\/p>\n<p><span><span>Cortex-A76 enhancements are said to include decoupled branch\u00a0prediction\u00a0and instruction fetch, as well as Arm\u2019s\u00a0first 4-wide decode core,\u00a0which boosts the maximum instruction per cycle capability.\u00a0There\u2019s also higher integer and vector execution throughput, including support for dual-issue native 16B\u00a0(128-bit) vector\u00a0and floating-point\u00a0units. Finally, the new full-cache memory hierarchy is \u201cco-optimized for latency and\u00a0bandwidth,\u201d says Arm.<\/span><\/span><\/p>\n<p><span><span>Unlike the latest high-end Cortex-A releases, Cortex-A76 represents \u201ca\u00a0brand new microarchitecture,\u201d says Arm. This is confirmed by <\/span><a href=\"https:\/\/www.anandtech.com\/show\/12785\/arm-cortex-a76-cpu-unveiled-7nm-powerhouse\"><span>AnandTech\u2019s<\/span><\/a><span> usual deep-dive analysis. Cortex-A73 and -A75 debuted elements of the new \u201cArtemis\u201d architecture, but the Cortex-A76 is built from scratch with Artemis. <\/span><\/span><\/p>\n<p><span><span>The Cortex-A76<\/span><span>\u00a0should arrive on 7nm-fabricated TSMC products running at 3GHz, says AnandTech. The 4x improvements in ML workloads are primarily due to new optimizations in the ASIMD pipelines \u201cand how dot products are handled,\u201d says the story. <\/span><\/span><\/p>\n<p><span><span>Meanwhile, <\/span><a href=\"https:\/\/www.theregister.co.uk\/2018\/05\/31\/arm_cortex_a76\/\"><span>The Register<\/span><\/a><span> noted that Cortex-A76 is Arm\u2019s first design that will exclusively run 64-bit kernel-level code. The cores will support 32-bit code, but only at non-privileged levels, says the story..<\/span><\/span><\/p>\n<h3><span><span>Mali-G76 GPU and Mali-G72 VPU<\/span><\/span><\/h3>\n<p><span><span>The new Mali-<\/span><span>G76<\/span><span> GPU announced with Cortex-A76 targets gaming, VR, AR, and <\/span><span>on-device ML<\/span><span>. The <\/span><span>Mali-G76 is said to provide 30 percent more efficiency and performance density and 1.5x improved performance for mobile gaming. The Bifrost architecture GPU also provides 2.7x ML performance improvements\u00a0compared to the Mali-G72, which was announced last year with the Cortex-A75.<\/span><\/span><\/p>\n<p><span><span>The Mali-V76 VPU supports <\/span><span>UHD 8K viewing experiences. It\u2019s <\/span><span>aimed at <\/span><span>4&#215;4 video walls, which are especially popular in China and is designed to support the 8K video coverage, which Japan is promising for the 2020 Olympics. 8K@60 streams require four times the bandwidth of 4K@60 streams. To achieve this, Arm added an extra AXI bus and doubled the line buffers throughout the video pipeline. The VPU also supports 8K@30 decode.<\/span><\/span><\/p>\n<h3><span><span>Project Trillium\u2019s ML chip detailed<\/span><\/span><\/h3>\n<p><span><span>Arm previously revealed other details about the <\/span><a href=\"https:\/\/developer.arm.com\/products\/processors\/machine-learning\/arm-ml-processor\"><span>Machine Learning<\/span><\/a><span> (ML) <\/span><span>processor, also referred to as MLP. The ML chip will accelerate AI applications including machine translation and face recognition.\u00a0<\/span><\/span><\/p>\n<p><span><span>The new processor architecture is part of the Project Trillium initiative for AI, and follows Arm\u2019s second-gen Object Detection (OD) Processor for optimizing visual processing and people\/object detection.\u00a0The ML design will initially debut as a <\/span><span>co-processor in mobile phones by late 2019.<\/span><\/span><\/p>\n<p><span><span>Join us at <\/span><a href=\"https:\/\/events.linuxfoundation.org\/events\/elc-openiot-europe-2018\/\"><span>Open Source Summit + Embedded Linux Conference Europe<\/span><\/a><span> in Edinburgh, UK on October 22-24, 2018, for 100+ sessions on Linux, Cloud, Containers, AI, Community, and more. <\/span><\/span><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Very few non-server systems run software that could be called machine learning (ML) and artificial intelligence (AI). Yet, server-class \u201cAI on the Edge\u201d applications are coming to embedded devices, and Arm intends to fight with Intel and AMD over every last one of them. Arm recently announced a new Cortex-A76 architecture that is claimed to [&hellip;]<\/p>\n","protected":false},"author":2,"featured_media":25183,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[40],"tags":[],"class_list":["post-25182","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-linux-freebsd-unix"],"_links":{"self":[{"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/posts\/25182","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/comments?post=25182"}],"version-history":[{"count":0,"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/posts\/25182\/revisions"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/media\/25183"}],"wp:attachment":[{"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/media?parent=25182"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/categories?post=25182"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/sickgaming.net\/blog\/wp-json\/wp\/v2\/tags?post=25182"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}