{"id":207,"date":"2019-01-14T16:37:06","date_gmt":"2019-01-14T16:37:06","guid":{"rendered":"http:\/\/pepcc.inesctec.pt\/?page_id=207"},"modified":"2020-09-01T13:12:19","modified_gmt":"2020-09-01T13:12:19","slug":"challenges","status":"publish","type":"page","link":"https:\/\/pepcc.inesctec.pt\/?page_id=207","title":{"rendered":"Project Activities"},"content":{"rendered":"\t\t<div data-elementor-type=\"wp-post\" data-elementor-id=\"207\" class=\"elementor elementor-207\">\n\t\t\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-a123c69 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"a123c69\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-07266f5\" data-id=\"07266f5\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-59467cf elementor-widget elementor-widget-heading\" data-id=\"59467cf\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h1 class=\"elementor-heading-title elementor-size-xl\">Project Activities<\/h1>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-4f92c7e elementor-widget-divider--view-line elementor-widget elementor-widget-divider\" data-id=\"4f92c7e\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"divider.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"elementor-divider\">\n\t\t\t<span class=\"elementor-divider-separator\">\n\t\t\t\t\t\t<\/span>\n\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-fe2578f elementor-widget elementor-widget-heading\" data-id=\"fe2578f\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">Background<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-33d7d14 elementor-widget elementor-widget-wp-widget-text\" data-id=\"33d7d14\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"wp-widget-text.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t<div class=\"textwidget\"><p>Modern embedded systems (ES) and HPC systems must operate under evermore similar requirements or constraints, which has pressured both domains into increasingly heterogeneous systems. While ES are put under increasing load by complex applications, HPC systems are struggling to provide power-efficiency as they scale [10].\u00a0 The use of specialized and potentially self-adaptive hardware which targets specific and very demanding tasks has been identifies as a possible solution to address these issues for both domains [9].<\/p>\n<p>Many applications in both domains have a small number of regular computational kernels that account for most of the execution time and energy consumption. The manual development of accelerators requires significant design time and hardware expertise. However, it is vital to not compromise developer productivity by demanding manual hardware development and source code alterations.<\/p>\n<p>The PEPCC project was <strong>created from the converging scientific works of its team members<\/strong>, which collectively address the issue of how to design systems, in the embedded and high-performance computing (HPC) domains, which provide computation that is both fast and power-efficient, by means of heterogenous computing architectures and respective compilation methods which target them [1-8].<\/p>\n<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-1027b5d elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"1027b5d\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-06bfb2f\" data-id=\"06bfb2f\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-86a5956 elementor-widget-divider--view-line elementor-widget elementor-widget-divider\" data-id=\"86a5956\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"divider.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"elementor-divider\">\n\t\t\t<span class=\"elementor-divider-separator\">\n\t\t\t\t\t\t<\/span>\n\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-0b13885 elementor-widget elementor-widget-heading\" data-id=\"0b13885\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">Objectives and Research Plan<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-7efdfbe elementor-widget elementor-widget-wp-widget-text\" data-id=\"7efdfbe\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"wp-widget-text.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t<div class=\"textwidget\"><p>PEPPC aims to address the highlighted issues regarding efficient computation by furthering exploring the research paths of its constituent <a href=\"http:\/\/pepcc.inesctec.pt\/?page_id=70\">team members<\/a> by structuring said efforts into a <strong>convergent, coherent, and structured<\/strong> overarching research plan.<\/p>\n<p>Given the convergence of requirements between ES and HPC, the mapping of computations to specialized or reconfigurable circuits at runtime (dynamic mapping) has growing importance as a research goal. An efficient implementation of this capability will\u00a0allow for hardware resources of reconfigurable circuits to be exploited by any application running in the system,\u00a0 addressing both power efficiency and performance concerns, and contribute to code portability and performance scalability. If effective and efficient, runtime mapping schemes will increase productivity by offloading demanding computation to specialized hardware with minimal developer intervention, and based on actual application workload. Exploiting runtime information is paramount in order to fulfill this goal.<\/p>\n<p>The project is divided into several activities, whose joint goal is too:<\/p>\n<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-e38d749 elementor-widget elementor-widget-wp-widget-text\" data-id=\"e38d749\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"wp-widget-text.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t<div class=\"textwidget\"><p style=\"margin: auto;text-align: center\"><strong><em>Improve performance and power efficiency of regular computational kernels in ES and HPS systems,\u00a0by efficient runtime mapping of computations to specialized CGRA, while ensuring the most transparency to the application developer.<\/em><\/strong><\/p>\n<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-48f5423 elementor-widget elementor-widget-wp-widget-text\" data-id=\"48f5423\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"wp-widget-text.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t<div class=\"textwidget\"><p>This will be accomplished by researching advanced <strong>binary trace analysis<\/strong> methods for Just-in-Time hardware generation and (re-)configuration. Information from this process will also be used to dynamically adapt existing CGRAs by hardware modification, or to select existing specialized CGRAs based on the workload to process. To study solutions which balance specialization and programmability, the project will also further develop the existing Versat architecture [5]. Finally, the binary analysis and hardware generation methods to be developed in the aforementioned activities will require studying methods though which they can be exploited and integrated into ES and HPC systems.<\/p>\n<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-0014745 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"0014745\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-ff6a982\" data-id=\"ff6a982\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<section class=\"elementor-section elementor-inner-section elementor-element elementor-element-75de36e elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"75de36e\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-a47b9ef\" data-id=\"a47b9ef\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-b1c40a1 elementor-widget elementor-widget-heading\" data-id=\"b1c40a1\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">A1.Advanced Trace Analysis for JIT Hardware Generation<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-8e9d6d6\" data-id=\"8e9d6d6\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-f32d071 elementor-widget-divider--view-line elementor-widget elementor-widget-divider\" data-id=\"f32d071\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"divider.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"elementor-divider\">\n\t\t\t<span class=\"elementor-divider-separator\">\n\t\t\t\t\t\t<\/span>\n\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<div class=\"elementor-element elementor-element-c67d9a5 elementor-widget elementor-widget-wp-widget-text\" data-id=\"c67d9a5\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"wp-widget-text.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t<div class=\"textwidget\"><p>Extracting more information from the binary traces will enable the identification and exploitation of more parallelism than currently exposed by static analysis. Binary traces called Megablocks [2,3] will be used as the starting point for the representation of knowledge about repetitive instruction traces; the model will be expanded to include the additional extracted information. The project will address these specific issues: <strong>loop dependency analysis, memory access disambiguation<\/strong> <strong>and detection of streaming access patterns, and data specialization.<\/strong><\/p>\n<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<section class=\"elementor-section elementor-inner-section elementor-element elementor-element-6b3a263 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"6b3a263\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-ac75030\" data-id=\"ac75030\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-4841b3f elementor-view-default elementor-widget elementor-widget-icon\" data-id=\"4841b3f\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"icon.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"elementor-icon-wrapper\">\n\t\t\t<div class=\"elementor-icon\">\n\t\t\t\t\t\t\t<i class=\"fa fa-star-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-60e818e\" data-id=\"60e818e\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-e6c36ba elementor-widget elementor-widget-heading\" data-id=\"e6c36ba\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-small\">Milestones<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-8dd0019 elementor-align-start elementor-icon-list--layout-traditional elementor-list-item-link-full_width elementor-widget elementor-widget-icon-list\" data-id=\"8dd0019\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"icon-list.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<ul class=\"elementor-icon-list-items\">\n\t\t\t\t\t\t\t<li class=\"elementor-icon-list-item\">\n\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-icon\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<i class=\"fa fa-square-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-text\">M1.1: Algorithm for constrained carried loop dependency detection from binary traces<\/span>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t<li class=\"elementor-icon-list-item\">\n\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-icon\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<i class=\"fa fa-square-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-text\">M1.2: Algorithm for memory access disambiguation and detection of streaming memory access patterns<\/span>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t<li class=\"elementor-icon-list-item\">\n\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-icon\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<i class=\"fa fa-square-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-text\">M1.3: Algorithm for data  specialization of Megablocks<\/span>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t<li class=\"elementor-icon-list-item\">\n\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-icon\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<i class=\"fa fa-square-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-text\">M1.4: Experimental evaluation of proposed methods<\/span>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t<\/ul>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-778ef1e elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"778ef1e\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-bf0c90e\" data-id=\"bf0c90e\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<section class=\"elementor-section elementor-inner-section elementor-element elementor-element-d4db6de elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"d4db6de\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-e7d1b6b\" data-id=\"e7d1b6b\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-43d1a99 elementor-widget elementor-widget-heading\" data-id=\"43d1a99\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">A2.Customized CGRAs for Dynamic Mapping of Computations<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-858644d\" data-id=\"858644d\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-7ab341d elementor-widget-divider--view-line elementor-widget elementor-widget-divider\" data-id=\"7ab341d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"divider.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"elementor-divider\">\n\t\t\t<span class=\"elementor-divider-separator\">\n\t\t\t\t\t\t<\/span>\n\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<div class=\"elementor-element elementor-element-a02c62c elementor-widget elementor-widget-wp-widget-text\" data-id=\"a02c62c\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"wp-widget-text.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t<div class=\"textwidget\"><p>The project will develop CGRA architectures which efficiently support dynamic mapping of computations, as well as the respective scheduling and hardware compilation algorithms and implementations. The point is to develop CGRA architectures which are suitable targets for JIT hardware generation\/(re-)configuration. Two strategies will be employed: 1) designing an architecture for FPGA devices wherein a fixed set of computing resources is complemented by resorting to Dynamic Partial Reconfiguration to add\/modify additional specialized hardware,\u00a0 and 2) improving existing CGRA architectures via multistage interconnection networks to allow for more versatile data flow at an affordable hardware overhead.<\/p>\n<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<section class=\"elementor-section elementor-inner-section elementor-element elementor-element-5d84ac2 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"5d84ac2\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-2c94370\" data-id=\"2c94370\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-051f550 elementor-view-default elementor-widget elementor-widget-icon\" data-id=\"051f550\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"icon.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"elementor-icon-wrapper\">\n\t\t\t<div class=\"elementor-icon\">\n\t\t\t\t\t\t\t<i class=\"fa fa-star-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-52c1bf3\" data-id=\"52c1bf3\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-aac17a7 elementor-widget elementor-widget-heading\" data-id=\"aac17a7\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-small\">Milestones<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-fba95fd elementor-align-start elementor-icon-list--layout-traditional elementor-list-item-link-full_width elementor-widget elementor-widget-icon-list\" data-id=\"fba95fd\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"icon-list.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<ul class=\"elementor-icon-list-items\">\n\t\t\t\t\t\t\t<li class=\"elementor-icon-list-item\">\n\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-icon\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<i class=\"fa fa-square-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-text\">M2.1: CGRA Generation 1 - Module for dynamic mapping of Megablocks to customized 1D CGRAs<\/span>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t<li class=\"elementor-icon-list-item\">\n\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-icon\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<i class=\"fa fa-square-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-text\">M2.2: Prototype validation of M2.1<\/span>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t<li class=\"elementor-icon-list-item\">\n\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-icon\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<i class=\"fa fa-square-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-text\">M2.3: CGRA Generation 2 - Enchancement for memory disambiguation and data resuse<\/span>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t<li class=\"elementor-icon-list-item\">\n\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-icon\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<i class=\"fa fa-square-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-text\">M2.4: Prototype validation of M2.3<\/span>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t<\/ul>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-db07daf elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"db07daf\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-26cd512\" data-id=\"26cd512\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<section class=\"elementor-section elementor-inner-section elementor-element elementor-element-cab1209 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"cab1209\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-1c37056\" data-id=\"1c37056\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-173ae18 elementor-widget elementor-widget-heading\" data-id=\"173ae18\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">A3.Programmable CGRA Architecture for Fast Reconfiguration<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-2827426\" data-id=\"2827426\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-d5e4417 elementor-widget-divider--view-line elementor-widget elementor-widget-divider\" data-id=\"d5e4417\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"divider.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"elementor-divider\">\n\t\t\t<span class=\"elementor-divider-separator\">\n\t\t\t\t\t\t<\/span>\n\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<div class=\"elementor-element elementor-element-c23fc63 elementor-widget elementor-widget-wp-widget-text\" data-id=\"c23fc63\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"wp-widget-text.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t<div class=\"textwidget\"><p>We aim to improve existing CGRA architectures, namely Versat [5], by employing the results of <strong>A1<\/strong> regarding runtime binary information. Namely, improving Versat requires: 1) extracting loop parameters, 2) sequencing configurations in order to speed up CGRA reconfiguration time and hide data movement by pre-fetching, and 3) exposing data level parallelism, to enhance acceleration beyond pipelining. Another aspect that can be explored with the Versat controller is Thread Level Parallelism (TLP). Small independent datapaths can be set to run within Versat, since the 4 dual-port embedded memories present in the architecture have independent address generation units for each port.<\/p>\n<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<section class=\"elementor-section elementor-inner-section elementor-element elementor-element-a360be4 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"a360be4\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-f3cdfaf\" data-id=\"f3cdfaf\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-6ffa25f elementor-view-default elementor-widget elementor-widget-icon\" data-id=\"6ffa25f\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"icon.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"elementor-icon-wrapper\">\n\t\t\t<div class=\"elementor-icon\">\n\t\t\t\t\t\t\t<i class=\"fa fa-star-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-fcc9e0a\" data-id=\"fcc9e0a\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-86e28ee elementor-widget elementor-widget-heading\" data-id=\"86e28ee\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-small\">Milestones<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-d917bdb elementor-align-start elementor-icon-list--layout-traditional elementor-list-item-link-full_width elementor-widget elementor-widget-icon-list\" data-id=\"d917bdb\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"icon-list.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<ul class=\"elementor-icon-list-items\">\n\t\t\t\t\t\t\t<li class=\"elementor-icon-list-item\">\n\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-icon\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<i class=\"fa fa-square-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-text\">M3.1: Extracting high-level Intermediate Representations (IRs) from memory access patterns and dependencies<\/span>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t<li class=\"elementor-icon-list-item\">\n\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-icon\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<i class=\"fa fa-square-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-text\">M3.2: Translation of IR into Versat assembly code<\/span>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t<li class=\"elementor-icon-list-item\">\n\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-icon\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<i class=\"fa fa-square-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-text\">M3.3: Integration into HPC and ES environments<\/span>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t<\/ul>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-7cf7894 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"7cf7894\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-7cf5563\" data-id=\"7cf5563\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<section class=\"elementor-section elementor-inner-section elementor-element elementor-element-55ab1e0 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"55ab1e0\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-b2a7264\" data-id=\"b2a7264\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-5f721b1 elementor-widget elementor-widget-heading\" data-id=\"5f721b1\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">A4.Run-time Management for HPC Systems<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-871f092\" data-id=\"871f092\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-94ca7d7 elementor-widget-divider--view-line elementor-widget elementor-widget-divider\" data-id=\"94ca7d7\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"divider.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"elementor-divider\">\n\t\t\t<span class=\"elementor-divider-separator\">\n\t\t\t\t\t\t<\/span>\n\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<div class=\"elementor-element elementor-element-5dcea77 elementor-widget elementor-widget-wp-widget-text\" data-id=\"5dcea77\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"wp-widget-text.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t<div class=\"textwidget\"><p>We will exploit the opportunities offered by the Intermediate Representation&#8217;s (IR) access to an HPC system whose nodes have an experimental Intel chip that combines a 12-core Xeon and a large Intel Arria 10 FPGA, a powerful platform that will enable us to measure the impact of the proposed approaching a live environment. The objective is to implement a hardware accelerator infrastructure, based on tailored CGRAs, and run-time management algorithms to support <em>N<\/em> customized accelerators, dynamically choosing the most appropriate hardware to execute heavy workload,\u00a0 along with mapping the binary representations (Megablocks) of the workload to said chosen hardware. This approach will be compared with the use of the Versat programmable CGRA.<strong><br \/>\n<\/strong><\/p>\n<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<section class=\"elementor-section elementor-inner-section elementor-element elementor-element-a2513a6 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"a2513a6\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-bd416bb\" data-id=\"bd416bb\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-6083d1d elementor-view-default elementor-widget elementor-widget-icon\" data-id=\"6083d1d\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"icon.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"elementor-icon-wrapper\">\n\t\t\t<div class=\"elementor-icon\">\n\t\t\t\t\t\t\t<i class=\"fa fa-star-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-029545a\" data-id=\"029545a\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-b1a6cee elementor-widget elementor-widget-heading\" data-id=\"b1a6cee\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-small\">Milestones<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-837428a elementor-align-start elementor-icon-list--layout-traditional elementor-list-item-link-full_width elementor-widget elementor-widget-icon-list\" data-id=\"837428a\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"icon-list.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<ul class=\"elementor-icon-list-items\">\n\t\t\t\t\t\t\t<li class=\"elementor-icon-list-item\">\n\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-icon\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<i class=\"fa fa-square-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-text\">M4.1: Creation of a CGRA library based on binary trace analysis<\/span>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t<li class=\"elementor-icon-list-item\">\n\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-icon\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<i class=\"fa fa-square-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-text\">M4.2: Runtime management - Use fixed policies for accelerator (i.e., CGRA) selection<\/span>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t<li class=\"elementor-icon-list-item\">\n\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-icon\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<i class=\"fa fa-square-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-text\">M4.3: Runtime management - Use guided policies for accelerator (i.e., CGRA) selection<\/span>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t<li class=\"elementor-icon-list-item\">\n\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-icon\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<i class=\"fa fa-square-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-text\">M4.4: Experimental evaluation of performance and power efficiency vs. static accelerators<\/span>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t<\/ul>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-76eed97 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"76eed97\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-b20d316\" data-id=\"b20d316\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<section class=\"elementor-section elementor-inner-section elementor-element elementor-element-fb38aa0 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"fb38aa0\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-93df879\" data-id=\"93df879\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-5c1035f elementor-widget elementor-widget-heading\" data-id=\"5c1035f\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-default\">A5.Run-time Management for Embedded Systems<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-3586a65\" data-id=\"3586a65\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-edd437f elementor-widget-divider--view-line elementor-widget elementor-widget-divider\" data-id=\"edd437f\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"divider.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"elementor-divider\">\n\t\t\t<span class=\"elementor-divider-separator\">\n\t\t\t\t\t\t<\/span>\n\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<div class=\"elementor-element elementor-element-ca0043b elementor-widget elementor-widget-wp-widget-text\" data-id=\"ca0043b\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"wp-widget-text.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t\t<div class=\"textwidget\"><p>This activity addresses the implementation of the complete runtime mapping infrastructure for embedded systems. We will follow a hardware\/software co-design approach, as we will analyze which components need to be implemented as custom hardware and each ones can be implemented by simple processors (such as a MicroBlaze or an embedded ARM). The extraction of trace information at runtime requires the processing of large amounts of data with limited resources. Therefore, our approach will be centered on stream-based versions of the algorithms developed in <strong>A1<\/strong>, in order to deal with a number of system constraints such as storage of execution traces and intermediate representations. The algorithms will have to work on limited scope using local memories storing windows of execution traces.<\/p>\n<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<section class=\"elementor-section elementor-inner-section elementor-element elementor-element-3a4a0d6 elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"3a4a0d6\" data-element_type=\"section\" data-e-type=\"section\">\n\t\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-9e9bc9d\" data-id=\"9e9bc9d\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-b362e20 elementor-view-default elementor-widget elementor-widget-icon\" data-id=\"b362e20\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"icon.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"elementor-icon-wrapper\">\n\t\t\t<div class=\"elementor-icon\">\n\t\t\t\t\t\t\t<i class=\"fa fa-star-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t<div class=\"elementor-column elementor-col-50 elementor-inner-column elementor-element elementor-element-fe61a0e\" data-id=\"fe61a0e\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-418b1d6 elementor-widget elementor-widget-heading\" data-id=\"418b1d6\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"heading.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<h2 class=\"elementor-heading-title elementor-size-small\">Milestones<\/h2>\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-32c0e07 elementor-align-start elementor-icon-list--layout-traditional elementor-list-item-link-full_width elementor-widget elementor-widget-icon-list\" data-id=\"32c0e07\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"icon-list.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<ul class=\"elementor-icon-list-items\">\n\t\t\t\t\t\t\t<li class=\"elementor-icon-list-item\">\n\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-icon\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<i class=\"fa fa-square-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-text\">M5.1: Embedded Runtime Management 1<\/span>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t<li class=\"elementor-icon-list-item\">\n\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-icon\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<i class=\"fa fa-square-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-text\">M5.2: Embedded Runtime Management 2 - Versat Support<\/span>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t<li class=\"elementor-icon-list-item\">\n\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-icon\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<i class=\"fa fa-square-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-text\">M5.3: HLS\/GPU Accelerators<\/span>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t\t\t<li class=\"elementor-icon-list-item\">\n\t\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-icon\">\n\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t\t<i class=\"fa fa-square-o\" aria-hidden=\"true\"><\/i>\n\t\t\t\t\t\t\t\t\t\t\t\t\t<\/span>\n\t\t\t\t\t\t\t\t\t\t<span class=\"elementor-icon-list-text\">M5.4: Experimental evaluation of accelerators<\/span>\n\t\t\t\t\t\t\t\t\t<\/li>\n\t\t\t\t\t\t<\/ul>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<section class=\"elementor-section elementor-top-section elementor-element elementor-element-c06153b elementor-section-boxed elementor-section-height-default elementor-section-height-default\" data-id=\"c06153b\" data-element_type=\"section\" data-e-type=\"section\" data-settings=\"{&quot;shape_divider_bottom&quot;:&quot;mountains&quot;}\">\n\t\t\t\t\t<div class=\"elementor-shape elementor-shape-bottom\" aria-hidden=\"true\" data-negative=\"false\">\n\t\t\t<svg xmlns=\"http:\/\/www.w3.org\/2000\/svg\" viewBox=\"0 0 1000 100\" preserveAspectRatio=\"none\">\n\t<path class=\"elementor-shape-fill\" opacity=\"0.33\" d=\"M473,67.3c-203.9,88.3-263.1-34-320.3,0C66,119.1,0,59.7,0,59.7V0h1000v59.7 c0,0-62.1,26.1-94.9,29.3c-32.8,3.3-62.8-12.3-75.8-22.1C806,49.6,745.3,8.7,694.9,4.7S492.4,59,473,67.3z\"\/>\n\t<path class=\"elementor-shape-fill\" opacity=\"0.66\" d=\"M734,67.3c-45.5,0-77.2-23.2-129.1-39.1c-28.6-8.7-150.3-10.1-254,39.1 s-91.7-34.4-149.2,0C115.7,118.3,0,39.8,0,39.8V0h1000v36.5c0,0-28.2-18.5-92.1-18.5C810.2,18.1,775.7,67.3,734,67.3z\"\/>\n\t<path class=\"elementor-shape-fill\" d=\"M766.1,28.9c-200-57.5-266,65.5-395.1,19.5C242,1.8,242,5.4,184.8,20.6C128,35.8,132.3,44.9,89.9,52.5C28.6,63.7,0,0,0,0 h1000c0,0-9.9,40.9-83.6,48.1S829.6,47,766.1,28.9z\"\/>\n<\/svg>\t\t<\/div>\n\t\t\t\t\t<div class=\"elementor-container elementor-column-gap-default\">\n\t\t\t\t\t<div class=\"elementor-column elementor-col-100 elementor-top-column elementor-element elementor-element-a71ba8d\" data-id=\"a71ba8d\" data-element_type=\"column\" data-e-type=\"column\">\n\t\t\t<div class=\"elementor-widget-wrap elementor-element-populated\">\n\t\t\t\t\t\t<div class=\"elementor-element elementor-element-376b083 elementor-widget-divider--view-line elementor-widget elementor-widget-divider\" data-id=\"376b083\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"divider.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t\t\t<div class=\"elementor-divider\">\n\t\t\t<span class=\"elementor-divider-separator\">\n\t\t\t\t\t\t<\/span>\n\t\t<\/div>\n\t\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t<div class=\"elementor-element elementor-element-ccfc2fa elementor-widget elementor-widget-html\" data-id=\"ccfc2fa\" data-element_type=\"widget\" data-e-type=\"widget\" data-widget_type=\"html.default\">\n\t\t\t\t<div class=\"elementor-widget-container\">\n\t\t\t\t\t<ul class=\"papercite_bibliography\">\n<li> [1] <a href=\"http:\/\/dx.doi.org\/10.1109\/TVLSI.2018.2874079\" class=\"papercite_doi\" title=\"View document on publisher site\"><img decoding=\"async\" src=\"http:\/\/pepcc.inesctec.pt\/wp-content\/plugins\/papercite\/img\/external.png\" alt=\"[DOI]\" width=\"10\" height=\"10\"><\/a>        N. Paulino, J. C. Ferreira, and J. M. P. Cardoso, \u201cDynamic Partial Reconfiguration of Customized Single-Row Accelerators,\u201d <span style=\"font-style: italic\">IEEE Transactions on Very Large Scale Integration (VLSI) Systems<\/span>, vol. 27, iss. 1, pp. 116-125, 2019. <a href=\"javascript:void(0)\" id=\"papercite_0\" class=\"papercite_toggle\">[Bibtex]<\/a> \n<div class=\"papercite_bibtex\" id=\"papercite_0_block\">\n<pre><code class=\"tex bibtex\">@ARTICLE{8502926,\nauthor={Paulino, Nuno and C. Ferreira, Jo\u00e3o and M. P. Cardoso, Jo\u00e3o},\njournal={{IEEE Transactions on Very Large Scale Integration (VLSI) Systems}},\ntitle={{Dynamic Partial Reconfiguration of Customized Single-Row Accelerators}},\nyear={2019},\nvolume={27},\nnumber={1},\npages={116-125},\ndoi={10.1109\/TVLSI.2018.2874079},\nISSN={1063-8210},\nmonth={Jan}\n}<\/code><\/pre>\n<\/div>\n<\/li>\n<li> [2] <a href=\"http:\/\/dx.doi.org\/10.1109\/FPT.2010.5681454\" class=\"papercite_doi\" title=\"View document on publisher site\"><img decoding=\"async\" src=\"http:\/\/pepcc.inesctec.pt\/wp-content\/plugins\/papercite\/img\/external.png\" alt=\"[DOI]\" width=\"10\" height=\"10\"><\/a>        J. Bispo and J. M. P. Cardoso, \u201cOn Identifying and Optimizing Instruction Sequences for Dynamic Compilation,\u201d in <span style=\"font-style: italic\">2010 International Conference on Field-Programmable Technology (FPT)<\/span>,  2010, pp. 437-440. <a href=\"javascript:void(0)\" id=\"papercite_1\" class=\"papercite_toggle\">[Bibtex]<\/a>\n<div class=\"papercite_bibtex\" id=\"papercite_1_block\">\n<pre><code class=\"tex bibtex\">@INPROCEEDINGS{5681454,\nauthor={Bispo, Jo\u00e3o and M. P. Cardoso, Jo\u00e3o},\nbooktitle={{2010 International Conference on Field-Programmable Technology (FPT)}},\ntitle={{On Identifying and Optimizing Instruction Sequences for Dynamic Compilation}},\nyear={2010},\npages={437-440},\ndoi={10.1109\/FPT.2010.5681454},\nmonth={Dec}\n}<\/code><\/pre>\n<\/div>\n<\/li>\n<li> [3] <a href=\"http:\/\/dx.doi.org\/10.1109\/FPL.2010.61\" class=\"papercite_doi\" title=\"View document on publisher site\"><img decoding=\"async\" src=\"http:\/\/pepcc.inesctec.pt\/wp-content\/plugins\/papercite\/img\/external.png\" alt=\"[DOI]\" width=\"10\" height=\"10\"><\/a>        J. Bispo and J. M. P. Cardoso, \u201cOn Identifying Segments of Traces for Dynamic Compilation,\u201d in <span style=\"font-style: italic\">2010 International Conference on Field Programmable Logic and Applications (FPL)<\/span>,  2010, pp. 263-266.  <a href=\"javascript:void(0)\" id=\"papercite_2\" class=\"papercite_toggle\">[Bibtex]<\/a>\n<div class=\"papercite_bibtex\" id=\"papercite_2_block\">\n<pre><code class=\"tex bibtex\">@INPROCEEDINGS{5694260,\nauthor={Bispo, Jo\u00e3o and M. P. Cardoso, Jo\u00e3o},\nbooktitle={{2010 International Conference on Field Programmable Logic and Applications (FPL)}},\ntitle={{On Identifying Segments of Traces for Dynamic Compilation}},\nyear={2010},\npages={263-266},\ndoi={10.1109\/FPL.2010.61},\nmonth={Aug}\n}<\/code><\/pre>\n<\/div>\n<\/li>\n<li> [4] <a href=\"http:\/\/dx.doi.org\/10.1109\/TVLSI.2016.2573640\" class=\"papercite_doi\" title=\"View document on publisher site\"><img decoding=\"async\" src=\"http:\/\/pepcc.inesctec.pt\/wp-content\/plugins\/papercite\/img\/external.png\" alt=\"[DOI]\" width=\"10\" height=\"10\"><\/a>        N. Paulino, J. C. Ferreira, and J. M. P. Cardoso, \u201cGeneration of Customized Accelerators for Loop Pipelining of Binary Instruction Traces,\u201d <span style=\"font-style: italic\">IEEE Transactions on Very Large Scale Integration (VLSI) Systems<\/span>, vol. 25, iss. 1, pp. 21-34, 2017.  <a href=\"javascript:void(0)\" id=\"papercite_3\" class=\"papercite_toggle\">[Bibtex]<\/a>\n<div class=\"papercite_bibtex\" id=\"papercite_3_block\">\n<pre><code class=\"tex bibtex\">@ARTICLE{7506263,\nauthor={Paulino, Nuno and C. Ferreira, Jo\u00e3o and M. P. Cardoso, Jo\u00e3o},\njournal={{IEEE Transactions on Very Large Scale Integration (VLSI) Systems}},\ntitle={{Generation of Customized Accelerators for Loop Pipelining of Binary Instruction Traces}},\nyear={2017},\nvolume={25},\nnumber={1},\npages={21-34},\ndoi={10.1109\/TVLSI.2016.2573640},\nmonth={Jan}\n}<\/code><\/pre>\n<\/div>\n<\/li>\n<li> [5] <a href=\"http:\/\/dx.doi.org\/10.1007\/978-3-319-61982-8_17\" class=\"papercite_doi\" title=\"View document on publisher site\"><img decoding=\"async\" src=\"http:\/\/pepcc.inesctec.pt\/wp-content\/plugins\/papercite\/img\/external.png\" alt=\"[DOI]\" width=\"10\" height=\"10\"><\/a>        J. D. Lopes and J. T. de Sousa, \u201cVersat, a Minimal Coarse-Grain Reconfigurable Array,\u201d in <span style=\"font-style: italic\">High Performance Computing for Computational Science \u2013 VECPAR 2016<\/span>, Cham,  2017, p. 174\u2013187.     <a href=\"javascript:void(0)\" id=\"papercite_4\" class=\"papercite_toggle\">[Bibtex]<\/a>\n<div class=\"papercite_bibtex\" id=\"papercite_4_block\">\n<pre><code class=\"tex bibtex\">@InProceedings{10.1007\/978-3-319-61982-8_17,\nauthor={D. Lopes, Jo\u00e3o and T. de Sousa, Jos\u00e9},\neditor={Dutra, In\u00eas and Camacho, Rui and Barbosa, Jorge and Marques, Osni},\ntitle={{Versat, a Minimal Coarse-Grain Reconfigurable Array}},\nbooktitle={{High Performance Computing for Computational Science -- VECPAR 2016}},\nyear={2017},\npublisher={{Springer International Publishing}},\naddress={Cham},\npages={174--187},\ndoi={10.1007\/978-3-319-61982-8_17},\nisbn={978-3-319-61982-8}\n}<\/code><\/pre>\n<\/div>\n<\/li>\n<li> [6] <a href=\"http:\/\/dx.doi.org\/10.1145\/2629468\" class=\"papercite_doi\" title=\"View document on publisher site\"><img decoding=\"async\" src=\"http:\/\/pepcc.inesctec.pt\/wp-content\/plugins\/papercite\/img\/external.png\" alt=\"[DOI]\" width=\"10\" height=\"10\"><\/a>        N. Paulino, J. C. Ferreira, and J. M. P. Cardoso, \u201cA Reconfigurable Architecture for Binary Acceleration of Loops with Memory Accesses,\u201d <span style=\"font-style: italic\">ACM Transactions on Reconfigurable Technology and Systems (TRETS)<\/span>, vol. 7, iss. 4, p. 29:1\u201329:20, 2014.    <a href=\"javascript:void(0)\" id=\"papercite_5\" class=\"papercite_toggle\">[Bibtex]<\/a>\n<div class=\"papercite_bibtex\" id=\"papercite_5_block\">\n<pre><code class=\"tex bibtex\">@article{Paulino:2014:RAB:2699137.2629468,\nauthor = {Paulino, Nuno and C. Ferreira, Jo\u00e3o and M. P. Cardoso, Jo\u00e3o},\ntitle = {{A Reconfigurable Architecture for Binary Acceleration of Loops with Memory Accesses}},\njournal = {{ACM Transactions on Reconfigurable Technology and Systems (TRETS)}},\nissue_date = {January 2015},\nvolume = {7},\nnumber = {4},\nmonth = dec,\nyear = {2014},\nissn = {1936-7406},\npages = {29:1--29:20},\narticleno = {29},\nnumpages = {20},\nurl = {http:\/\/doi.acm.org\/10.1145\/2629468},\ndoi = {10.1145\/2629468},\nacmid = {2629468},\npublisher = {ACM},\naddress = {New York, NY, USA}\n}<\/code><\/pre>\n<\/div>\n<\/li>\n<li> [7] <a href=\"http:\/\/dx.doi.org\/10.1109\/TII.2012.2235844\" class=\"papercite_doi\" title=\"View document on publisher site\"><img decoding=\"async\" src=\"http:\/\/pepcc.inesctec.pt\/wp-content\/plugins\/papercite\/img\/external.png\" alt=\"[DOI]\" width=\"10\" height=\"10\"><\/a>        J. Bispo, N. Paulino, J. M. P. Cardoso, and J. C. Ferreira, \u201cTransparent Trace-Based Binary Acceleration for Reconfigurable HW\/SW Systems,\u201d <span style=\"font-style: italic\">IEEE Transactions on Industrial Informatics<\/span>, vol. 9, iss. 3, pp. 1625-1634, 2013.   <a href=\"javascript:void(0)\" id=\"papercite_6\" class=\"papercite_toggle\">[Bibtex]<\/a>\n<div class=\"papercite_bibtex\" id=\"papercite_6_block\">\n<pre><code class=\"tex bibtex\">@ARTICLE{6392266,\nauthor={Bispo, Jo\u00e3o and Paulino, Nuno and M. P. Cardoso, Jo\u00e3o and C. Ferreira, Jo\u00e3o},\njournal={{IEEE Transactions on Industrial Informatics}},\ntitle={{Transparent Trace-Based Binary Acceleration for Reconfigurable HW\/SW Systems}},\nyear={2013},\nvolume={9},\nnumber={3},\npages={1625-1634},\ndoi={10.1109\/TII.2012.2235844},\nISSN={1551-3203},\nmonth={Aug}\n}<\/code><\/pre>\n<\/div>\n<\/li>\n<li> [8] <a href=\"http:\/\/dx.doi.org\/10.1007\/978-3-642-36812-7_12\" class=\"papercite_doi\" title=\"View document on publisher site\"><img decoding=\"async\" src=\"http:\/\/pepcc.inesctec.pt\/wp-content\/plugins\/papercite\/img\/external.png\" alt=\"[DOI]\" width=\"10\" height=\"10\"><\/a>        N. Paulino, J. C. Ferreira, and J. M. P. Cardoso, \u201cArchitecture for Transparent Binary Acceleration of Loops with Memory Accesses,\u201d in <span style=\"font-style: italic\">Reconfigurable Computing: Architectures, Tools and Applications<\/span>, Berlin, Heidelberg,  2013, p. 122\u2013133.   <a href=\"javascript:void(0)\" id=\"papercite_7\" class=\"papercite_toggle\">[Bibtex]<\/a>\n<div class=\"papercite_bibtex\" id=\"papercite_7_block\">\n<pre><code class=\"tex bibtex\">@InProceedings{10.1007\/978-3-642-36812-7_12,\nauthor={Paulino, Nuno and C. Ferreira, Jo\u00e3o and M. P. Cardoso, Jo\u00e3o},\neditor={{Philip Brisk, Jos\u00e9 Gabriel de Figueiredo Coutinho, and Pedro C. Diniz}},\ntitle={{Architecture for Transparent Binary Acceleration of Loops with Memory Accesses}},\nbooktitle={{Reconfigurable Computing: Architectures, Tools and Applications}},\nyear={2013},\npublisher={{Springer Berlin Heidelberg}},\naddress={Berlin, Heidelberg},\npages={122--133},\ndoi={10.1007\/978-3-642-36812-7_12},\nisbn={978-3-642-36812-7}\n}<\/code><\/pre>\n<\/div>\n<\/li>\n<li> [9] M. Duranton, K. De Bosschere, C. Gamrat, J. Maebe, H. Munk, and O. Zendra, <span style=\"font-style: italic\">HiPEAC Vision 2017 \u2013 HiPEAC High-Performance Embedded Architecture and Compilation<\/span>, 2017.    <a href=\"javascript:void(0)\" id=\"papercite_8\" class=\"papercite_toggle\">[Bibtex]<\/a>\n<div class=\"papercite_bibtex\" id=\"papercite_8_block\">\n<pre><code class=\"tex bibtex\">@misc{hipeac1,\nauthor = {Duranton, Marc and De Bosschere, Koen and Gamrat, Christian and Maebe, Jonas and Munk, Harm and Zendra, Olivier},\nisbn = {978-90-9030182-2},\npages = {1--167},\ntitle = {{HiPEAC Vision 2017 - HiPEAC High-Performance Embedded Architecture and Compilation}},\nyear = {2017},\n}<\/code><\/pre>\n<\/div>\n<\/li>\n<li> [10] M. Malms, J. Nomin\u00e9, and M. Ostasz, <span style=\"font-style: italic\">ETP4HPC Computing Strategic Research Agenda 2015 Update \u2013 HiPEAC High-Performance Embedded Architecture and Compilation<\/span>, 2015.    <a href=\"javascript:void(0)\" id=\"papercite_9\" class=\"papercite_toggle\">[Bibtex]<\/a>\n<div class=\"papercite_bibtex\" id=\"papercite_9_block\">\n<pre><code class=\"tex bibtex\">@misc{hipeac2,\nauthor={Malms, Michael and Nomin\u00e9, Jean-Philippe and Ostasz, Marcin},\ntitle = {{ETP4HPC Computing Strategic Research Agenda 2015 Update - HiPEAC High-Performance Embedded Architecture and Compilation}},\nyear={2015},\nisbn={}\n}<\/code><\/pre>\n<\/div>\n<\/li>\n<\/ul>\n\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t<\/section>\n\t\t\t\t<\/div>\n\t\t","protected":false},"excerpt":{"rendered":"<p>Project Activities Background Objectives and Research Plan A1.Advanced Trace Analysis for JIT Hardware Generation Milestones M1.1: Algorithm for constrained carried loop dependency detection from binary traces M1.2: Algorithm for memory access disambiguation and detection of streaming memory access patterns M1.3: Algorithm for data specialization of Megablocks M1.4: Experimental evaluation of proposed methods A2.Customized CGRAs for [&hellip;]<\/p>\n","protected":false},"author":4,"featured_media":0,"parent":0,"menu_order":0,"comment_status":"closed","ping_status":"closed","template":"","meta":{"footnotes":""},"class_list":["post-207","page","type-page","status-publish","hentry"],"_links":{"self":[{"href":"https:\/\/pepcc.inesctec.pt\/index.php?rest_route=\/wp\/v2\/pages\/207","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/pepcc.inesctec.pt\/index.php?rest_route=\/wp\/v2\/pages"}],"about":[{"href":"https:\/\/pepcc.inesctec.pt\/index.php?rest_route=\/wp\/v2\/types\/page"}],"author":[{"embeddable":true,"href":"https:\/\/pepcc.inesctec.pt\/index.php?rest_route=\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/pepcc.inesctec.pt\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=207"}],"version-history":[{"count":41,"href":"https:\/\/pepcc.inesctec.pt\/index.php?rest_route=\/wp\/v2\/pages\/207\/revisions"}],"predecessor-version":[{"id":621,"href":"https:\/\/pepcc.inesctec.pt\/index.php?rest_route=\/wp\/v2\/pages\/207\/revisions\/621"}],"wp:attachment":[{"href":"https:\/\/pepcc.inesctec.pt\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=207"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}