Fine-grained overhead characterisation of cross-ISA DBTO for multicore processor
The emergence of modern portable software, start to behaved hybrid shortlong running combined applications, in which an active apps may invoked others to fulfill task requirements. Thus the implementation of Dynamic Translation and Optimisation (DBTO) into heterogeneous multicore system-on-chip (SoC...
Saved in:
Main Authors: | , , |
---|---|
Format: | Article |
Published: |
Institute of Advanced Engineering and Science
2018
|
Online Access: | https://www.scopus.com/inward/record.uri?eid=2-s2.0-85044576410&doi=10.11591%2fijeecs.v10.i3.pp1036-1044&partnerID=40&md5=b1095cd4f2ee40b0762fd29af0a787fa http://eprints.utp.edu.my/20881/ |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Summary: | The emergence of modern portable software, start to behaved hybrid shortlong running combined applications, in which an active apps may invoked others to fulfill task requirements. Thus the implementation of Dynamic Translation and Optimisation (DBTO) into heterogeneous multicore system-on-chip (SoC) will require careful re-study, to ensure efficient usage of most available cores. In order to improve efficiency in supporting this Instruction Set Architecture (ISA) diversity of computing platforms, mix modes of statically and dynamically Binary Translation and Optimization system, or DBTO, need to utilize concurrent compilation techniques, to better service the combined applications processing. This research deep dived into finer-grained DBTO overhead analysis, to provide categorization and characterization of overhead sources in breakdown stages during concurrent instruction processing. A dual-engine of translation and optimization is constructed for finer managemnt of start-up overheads. Helper functions, i.e. LoadLink/StoreCondition (LL/SC) are derived from atomic instructions, to create multiple helper thread supported by multiple host cores, for better instruction translation and optimization operation concurrently. Our experiment platform, evaluated through PARSEC-3.0 benchmark suite, shows performance improvement approaching 2.0x for apps based programs and 1.25x for kernel based programs, for x86 to X86-64 emulation. This technique possess great potential and serve as research based platform for future binary translation technique development, including adaptive method. © 2018 Institute of Advanced Engineering and Science. All rights reserved. |
---|