Open Access
ARTICLE
An Enhanced Task Migration Technique Based on Convolutional Neural Network in Machine Learning Framework
1 Department of Computer Science, Faculty of Computer Science & IT, Superior University, Lahore, 54000, Pakistan
2 School of Electrical, Computer and Telecommunications Engineering, University of Wollongong, Wollongong, NSW 2522, Australia
3 School of Mathematics and Applied Statistics, University of Wollongong, Wollongong, NSW 2522, Australia
4 School of Information Technology, King’s Own Institute, Sydney, NSW 2000, Australia
5 Faculty of Computer and Information Systems, Islamic University of Madinah, Al Madinah Al Munawarah, 42351, Saudi Arabia
6 Department of Information Technology, Wentworth Institute of Higher Education, Sydney, NSW 2000, Australia
7 Faculty of Electrical Engineering and Technology, Superior University, Lahore, 54000, Pakistan
* Corresponding Author: Hamayun Khan. Email:
Computer Systems Science and Engineering 2025, 49, 317-331. https://doi.org/10.32604/csse.2025.061118
Received 18 November 2024; Accepted 17 January 2025; Issue published 19 March 2025
Abstract
The migration of tasks aided by machine learning (ML) predictions IN (DPM) is a system-level design technique that is used to reduce energy by enhancing the overall performance of the processor. In this paper, we address the issue of system-level higher task dissipation during the execution of parallel workloads with common deadlines by introducing a machine learning-based framework that includes task migration using energy-efficient earliest deadline first scheduling (EA-EDF). ML-based EA-EDF enhances the overall throughput and optimizes the energy to avoid delay and performance degradation in a multiprocessor system. The proposed system model allocates processors to the ready task set in such a way that their deadlines are guaranteed. A full task migration policy is also integrated to ensure proper task mapping that ensures inter-process linkage among the arrived tasks with the same deadlines. The execution of a task can halt on one CPU and reschedule the execution on a different processor to avoid delay and ensure meeting the deadline. Our approach shows promising potential for machine-learning-based schedulability analysis enables a comparison between different ML models and shows a promising reduction in energy as compared with other ML-aware task migration techniques for SoC like Multi-Layer Feed-Forward Neural Networks (MLFNN) based on convolutional neural network (CNN), Random Forest (RF) and Deep learning (DL) algorithm. The Simulations are conducted using super pipelined microarchitecture of advanced micro devices (AMD) XScale PXA270 using instruction and data cache per core 32 Kbyte I-cache and 32 Kbyte D-cache on various utilization factors 12%, 31% and 50%. The proposed approach consumes 5.3% less energy when almost half of the CPU is running and on a lower workload consumes 1.04% less energy. The proposed design accumulatively gives significant improvements by reducing the energy dissipation on three clock rates by 4.41%, on 624 MHz by 5.4% and 5.9% on applications operating on 416 and 312 MHz standard operating frequencies.Keywords
Energy-aware multiprocessor systems-on-chips (MPSoCs) are widely used for multimedia and gaming embedded systems. Multi-core processors are rapidly turning into the norm for embedded real-time systems because of the demand for multimedia and gaming systems. These processors take into consideration greater adaptability while making undertakings, giving a substantially more productive environment for task assignments [1]. Task dispatching is a significant part of multiprocessors this article presents a dividing technique for reducing the energy of CPU in multi-core frameworks that reduces the worst-case execution time (WCET) and the absolute energy utilization of the framework. Our calculation is contrasted with others and we exhibit its prevalence [2].
MPSoC consists of multiple processors and all the components on the same chip consume low energy and require less power because of the tightly integrated architecture [3]. The consumption of MPSoC occurs in many abstractions from the logic level to the circuit. It’s an integrated circuit and is designed for application with specific special software and hardware [4]. Proposed a scheduling technique for execution and assessment of CPU energy and memory of the central framework that is associated with multimedia applications. The proposed scheduler is dependent on a constant bandwidth server (CBS) [5]. Introduce an energy-efficient CPU scheduler that maps data using the earliest deadline first-based window constraint migration (EDF-WM) for multiprocessors in which memory reservation instrument uses paging calculations called shared anonymous private pages (PSAP) [6]. Introduce an energy-aware technique for the smooth allocation of the task. Task allocation expands the addition recurrence (usefulness) of its appointed tasks, which brings about expanded CPU clock recurrence [7]. Introduced an efficient processor with chip-packaged-based multi-core processing units, in which at least two processors are utilized to complete execution in equal intervals. This advancement in CPUs gives fast response and quicker execution times [8]. Introduce a Tasks scheduling based on logical link control (LLC) using Convolutional Neural Network (CNN) as shown in Fig. 1 based on the m processor scheduling algorithm (MPSA), L1-D, LLC and NOC Route showing multi-task mapping that creates an ideal schedule that expands the throughput of programs while fulfilling the processor’s mapping requirements [9].
Figure 1: Tasks scheduling based on LLC using CNN
Energy dissipation and Energy-Efficient Scheduling play a vital role in the Migration of tasks while using a Homogeneous platform to meet deadlines of tasks during execution [10]. An energy-efficient dissipation method is proposed to enhance the task migration abilities of MPSoC. Introduce an efficient energy dissipation strategy. Shrinkage of the chip and certainly increasing the electronic element transistor on a chip the frequency and power densities are gradually increasing causing many problems like power consumption and thermal issues as well as higher dissipation of energy [11]. Efficient multi-core systems are introduced but due to the increase in energy thermal issues arise that are the major problems [12].
Introduces a mechanism that energy consumption (Ecc) of executing a task on an MPSoC mainly by switching resources [13] as shown below in using Eq. (1):
Ecc=Pswitching.t=Ceff.V2.fCbf=Ceff.V2.Cb(Vdd−Vths)/.Vdd2(1)
In CMOS chips, the consumption of dynamic power due to the transistor’s switching can be calculated using Eq. (2):
Pdy=Cl∗Nc∗Vd∗f(2)
f=M∗(Vd−Vt)/Vd2∗f(3)
Slave and Master thread Migration ratio in AI-based MPSoC that shows core performance using AMD hops in Fig. 2. The load capacitance is denoted as Cl, Vd, Nc is voltage and no of state transitions. The clock frequency is represented as f while Vt represents the threshold voltage, Lg shows the logic gates used in the CMOS chip. The reduction constant is denoted as M as shown in Eq. (3). The accumulative power of the CMOS is shown in Eq. (4):
Pt=(C∗Vd2f+Lg∗(Vd∗M3)(4)
Figure 2: Slave and master thread migration ratio in AI-based MPSoC
Task migration based on GPU memory for power efficient MPSoC is a mechanism that is used to move an executing task from one host CPU in a distributed architecture to another CPU. DVFS based selection of a task and movement to the host CPU for a new time can task and the creation of the task on that host increase the performance, reliability and processing speed. The lack of task migration on affect the overall system performance [14]. Most of the current techniques improve energy and power management-related problems. The mapping of ready task allocation to CPU is another approach to scheduling, there are three main classes of task scheduling multiprocessor partitioned scheduling, restricted-migration scheduling, and full-migration scheduling.
1.3 Partitioned Migration Scheduling
Introduced an energy aware partitioned migration scheduling technique that is used for the optimization of energy-aware distributed multiprocessor. The system mapped each ready task to a single processor π at the system-design time [15]. Statically assignment of the task to the CPU is preferable for the uniprocessor schedule as illustrated in Fig. 3.
Figure 3: Tasks scheduling using partitioned migration
Introduced full migration task scheduling technique based on dynamic voltage and frequency scaling DVFS that is widely used as the least restrictive method widely used in scheduling. An energy and performance efficient DVFS scheme can handle task that halt on one CPU and reschedule the execution on a different processor to avoid delay and meet the deadline. Job-level parallelism is restricted because tasks cannot execute parallel on two or more different CPU cores [16]. Each arrived task is placed into a priority queue. The scheduler can have the option to decide what task requires execution on each CPU at the current time interval as shown in Fig. 4.
Figure 4: Tasks scheduling of tasks using partitioned migration
1.5 Restricted Migration Scheduling
Introduce a restricted migration scheduling based on a Resource-aware load balancing model for a batch of tasks (BoT) considering the deadline and best-fit migration policy for homogenious and heterogeneous distributed computing systems that migrate tasks between CPUs, each arrived task can execute on only one CPU core at a time [17]. Proposed a Dynamic scheduling technique for real-time tasks in heterogeneous multicore systems in which the arrivals of the task are not prioritized. In restricted migration, a scheduler can adapt to two levels. Generated tasks are placed on the global priority queue [18]. Reference [19] introduced an energy optimization for real-time multiprocessor that runs only periodic tasks with uncertain execution time by considering the migration policy to reduce the effects of the increase in temperature on the chip that can smoothly perform and achieve a normal working condition. A uniprocessor scheduling algorithm is used to schedule each processor’s assigned task as shown in Fig. 5.
Figure 5: Tasks scheduling using partitioned migration
System-level application-aware dynamic power management (DPM) in adaptive pipelined MPSoCs for software-based energy and power reduction schemes are implemented at the consumer level and the consumption of energy is reduced using the software. Instruction reordering and energy-efficient code are widely used software-based energy optimization techniques. Energy optimization using software-based methods includes instruction level, operating system (OS) level and compiler level implementation [20]. Introduce an Online dynamic power management (DPM) for efficient execution of interactive workloads AND instruction-level energy optimization technique. The instruction level analysis is used to assign a defined energy budget to any ready task. Proposed a compiler-level mechanism that plays an important role in the behavior of an application during execution and determines the no of instructions being executed, order as well as type and has a significant impact on the reduction of energy consumption [21]. Dynamic power management (DPM) for multidomain system-on-chip is introduced with an optimal control approach to reduce the consumption of energy and power. Introduces a memory latency optimization technique that is used for the latency optimizations at the compiler level That modifies the memory layout and program access pattern operating system (OS) level [22]. The implementation of Optimal Dynamic power management for multidomain is used for the optimization of software-level approach and also used in hardware as pointer synthesis [23].
Proposed a real-time model that is based on finite elements and used for the optimization and reduction of memory by considering dynamic memory and dynamic power approach as high-speed on-chip memory is widely used to reduce the consumption of energy during the run time. Introduce an instruction rescheduling technique that is highly used compiler-level energy and power reduction technique that is used to enhance the performance as well as also remove the pipeline stalls that cause a delay in processing [24]. Proposed an analysis and optimization of MPSoCs reliability test that utilize the operating system according to the requirement and demand of tasks related to OS processes. The OS level optimization technique is implemented on dynamic power management (DPM), CPU clock, scheduling algorithms, Input-output (I/O) and speed management software-based DPM technique using an operating system level is implemented to reduce energy for performance enhancement as shown in Eq. (5).
ET=Es+Ep+EC(5)
The energy consumed during the transmission is represented as ETransmission(ni,d) bits can be calculated using Eq. (6).
ETransmission(ni,d)≥{ni∗Ess+ni∗effs∗(d2)d≤doni∗Ess+ni∗effp∗(d4)d≥do(6)
Whereas Ess represents the energy dissipation per bit in the transmitter circuit effp and effs are the amplification energy [25]. Due to advancements in embedded technology, periodic activities show more computational demand in a multi-core system these activities are categorized as periodic task sets is continuously executed at definite rates with non-elastic deadlines. when the CPU is not working and is in an idle state then the maximum duration can be computed using λ so that ti with the earliest arrival αk. When another task with the nearest deadline is executed, the ti can be delayed using Eq. (7).
n∑i={1.,.,.n}/kciτi+ck+αkτk=1(7)
While another task t2 with higher priority arrive with the earliest deadline before the end of the execution task t1 then the length of the idle interval is denoted as λj and max time duration for the idle period is represented as αj can be measured using Eq. (8).
n∑i={1.,.,.n}/(k,j)ciτi+ck+αkτk+cj+αjτj=1(8)
The utilization bound ∝ is a function of 1umax(A) scheduling of n tasks on m identical CPUs. If n≤∝m then the CPU will be allocated to the task set while in case ≥∝m then ∝ the utilization bound for P-EDF on the CPU can be calculated using Eq. (9) is considered as:
usum≤∝m+1/∝+1(9)
Introduce a P-EDF-based utilization bound for identical MPSoC first fit decreasing (FFD), best fit (BF) and random fit (RF) are widely used for assigning tasks to CPU and it’s observed that all these variations have the same utilization bound. Introduce a mechanism to find the utilization bound that determines the least number of processors that are required to schedule n tasks with usum as shown below in Eq. (10).
m≥{1min{[n∝]ifusum≤1(10)
Marvel X-Scale AMD PXA-270 is widely used for symmetric multiprocessing when a real-time task load arrives and all the processors in a multi-core embedded system are of the same type. Simulation tools for real-time multiprocessors (STORM) implement various scheduling techniques on MPSoC-based architecture using both homogeneous and heterogeneous CPUs. STORM generates the energy profiles and evaluates performance and task scheduling on the MPSoC [26].
3 An Improved Machine Learning-Based EA-EDF Scheduling Model
The scheduling algorithm applies migration of task ti∈τ when a multiprocessor system allows a ti∈τ migrates if it starts execution on one core of the processor and later switches due to high utilization of the task and migrates the task to the other core.
Using the proposed full task migration strategy in which all the task ti∈τ are allowed to migrate at any point as per the condition of ui implemented in the proposed EA-EDF during their execution. However, the ti∈τ cannot execute parallel on multiple cores, it can execute on any single core selected from the core configuration at a time. This strategy is the most flexible and increases performance by allowing migration. Fig. 6 illustrates the proposed machine Learning-based task migration Model by integrating CNN into EA-EDF full migration of tasks using EA-EDF. The task set contains a ready task set that is allocated to the scheduler the proposed scheduler monitors the utilization factor and migrates the ti according to the configurations.
Figure 6: Machine learning-based task migration model using integrating CNN into EA-EDF
Considering the concurrent arrival of tasks having the same deadlines and priority and the task required the CPU cycle for mapping and allocation of resources access to CPU. Improper task scheduling consumes more energy and causes a delay in the CMOS chip. High task utilization without proper allocation of CPU degrades the reliability and performance of MPSoC, missing task (τ) deadlines particularly periodic ready task set τ=(τ1,τ2,τ3,…,τn) and scheduling cause a 10%–15% delay in CMOS circuits due to improper task migration between symmetric and asymmetric cores over m cores multiprocessor. When two processes require a CPU cycle at the same interval increases the CPU load and energy dissipation. Completion of a task by meeting the task deadlines and by allowing the suitable task migration policy and DPM-based core switching technique can reduce the consumption of energy and enhance the overall performance. The design issue of a predictable and energy-efficient migration of processes in shared-memory MPSoCs has not received enough attention. The objective of our research work presented in this article is to propose a more mature and improved task migration solution that supports the processor to reduce the dissipation of energy and increase the overall performance.
Dataset Description and Preprocessing
The PTB Suggestive ECG Data set, which comes from Physionet, is the dataset that was used in this research for processing over the CPU as a ready task the dataset is publically open and available for use as referred to in [27]. The 14,552 ECG accounts in this dataset are isolated into two packs are used during the testing and training process because using more than 14,552 samples can cause a very high upsurge and requires to make atlest three packs for the ML system in our upcoming research with more advanced ML system we will be able to use 21,837 ECG samples together. Considering two uniform multiprocessors denoted by P1 and P2 and a (R-T) system instance, I={t1,t2,…,tn}. Let sp(1) and sp(2) represent the schedule of instance t1,t2,…,tn on P1 and P2 respectively when the workload increases the tasks migrate to the core in the selected core configuration that is the least used then for instance I and time instant t. Consider a task set ti∈τ (t1,t2,….tn) and sort ready task queue ti∈τ. The migration of tasks occurs instantly when ui≥ max allowable workload. We have used a work function for MPSoC to elaborate the total average work done by all the tasks t of instance I over-scheduled time intervals t>0. The system work function is represented as W(sp(1),P,I,τ)≥W(sp(2),P,I,τ) migrates tasks on the core when ui≥ threshold workload without any delay by meeting all the deadlines.
Let’s assume the parameters (rk,ck,dk,pk,prk) of tasks ti∈τ be a task in RT instance {tk,…,tn} at counter time to Set the power parameter Psleep(β),Pidle(Υ)and Prunning(α) of AMD PXA-270 multiprocessor. Consider a task tk∈τ with the earliest arrival is ready for allocation to core configuration (λ1,λ2,…λn) if (ui) < max allowed τ load than the work function is represented as shown below in Eq. (11).
W(sp(1),P1,I,tk,to)W(sp(2),P2,I,tk,to)(11)
Eq. (11) states that initial. ui< max allowable workload (T). Considering the utilization for the whole instant I as (tk∈τ) ∈ I to calculate the overall response as shown below in Eq. (12).
W(sp(1),P1,I,to)<W(sp(2),P2,P,I,to)(12)
Eq. (13) illustrates if to=rk, then the work done on P1 for instance, I will increase as the arrival of the new task tk increases the overall utilization of the selected configuration. If (ui) ≥ max allowed τ load, rk, is the peak task load period.
W(sp(1),P1,I,rk,)≥W(sp(2),P2,I,rk,)(13)
Therefore, the completion of I={tk,…,tn} of RT instance schedule in sp(1) during the time interval [to,rk,) is greater than the schedule sp(2) during the same time interval. In processor schedule sp(1), the task tk is active during the entire interval [to,rk,) so at least two processors will be running during that time interval and the remaining will be in an idle state as shown below in Eq. (14).
to−rk=a1+a2+…+am(p)(14)
If tk schedules on P2 at [to,rk,) and P1 is already at its threshold utilization than using the proposed task migration framework strategy the arrival of any new task tk∈τ schedules and starts executing on the next available processor or core in the selected core configuration (λ1,λ2,…λn) that is in the least use according to the ui therefore it can be expressed as shown below in Eq. (15).
W(So,P2,I,rk,to)≤sp(1)(P2) * (to−rk)(15)
Task tk can schedule and start executing when the ui reaches its threshold If (ui) ≥ max allowed τ load enables the τ migration using the proposed EA-EDF. A minimum of one processor in the core configuration (λ1,λ2,…λn) must be in an idle state to schedule the task. In case all the processor m(p) of P are busy then the task tk has to wait till the availability of one processor. Therefore tk is guaranteed to be scheduled without missing its deadline for the set of unit time intervals (∑m(P)−1k=1ak).
Moreover in the ck WCET, tk Executes on the idle or slowest processor. When a task tk∈τ are executing some jobs on sk(Pm) Therefore Select the core configuration (λn) as per the (ui) requirement of arrived ti∈τ, either least used (λn) as shown below in Eq. (16).
W(So,P2,I,tk,to)≥m(P)−1∑k=1aksk(P)(16)
CPU core at the speed of kth the processor requires the switching of the processor’s Psleep(β) state to Pidle(Υ) in core configuration (λn) Switching (λn) from Psleep(β) to Pidle(Υ) to achieve optimal utilization of the core as the instant (∑m(P)k=1aksk(Pm)=δ is a higher utilization task than δ(Pm) must be less than the load for proper scheduling of tasks. Based on the above equation it’s confirmed that using unrestricted full migration permits all the tasks to migrate among processors in the configuration if required.
The proposed framework migrates tasks to the core when ui≥ threshold workload without any delay by meeting all the deadlines and satisfying the average work done by all the tasks t of instance I over scheduled time intervals t>0: The below mentioned are the various m cores distribution models selected during the ti∈τ workload on ui 12%, 31%, and 50% using the proposed task scheduling method. CPU in the sleep state is represented as Psleep(β) While the CPU in idle and executions state is denoted as, Pidle(Υ) and Prunning(α) at various standard frequencies, 624, 208 and 104 MHz of MARVEL AMD PXA-270 embedded MPSoC whereas the no of the core is represented as mn.
Eq. (17) represents the integration of a full task migration model for various utilization factors (ui) = 12% into EA-EDF scheduling algorithm for mapping and migration of arrived task under minimal utilization when the CPU α=2,β=6 considering the mapping core distribution as λ2: (m1, m2), (m4, m7), (m3, m5), (m6, m8).
uτ=n∑K=2capa=12%Uτ≤m(p)+2(17)
Eq. (18) represents the integration of a full task migration model for various utilization factors (ui) = 31% into EA-EDF scheduling algorithm for mapping and migration of arrived task under minimal utilization when the CPU α=4,β=4 considering the mapping core distribution as λ4: (m1, m2, m3, m4), (m5,m6, m7, m8).
uτ=n∑K=4capa=31%Uτ≤m(p)+4(18)
uτ=n∑K=6capa=50%Uτ≤m(p)+6(19)
Eq. (19) represents the integration of a full task migration model for various utilization factors (ui) = 50% into EA-EDF scheduling algorithm for mapping and migration of arrived task under minimal utilization when the CPU α=6,β=2 considering the mapping core distribution as λ6: (m1, m2, m3, m4,m5,m7), (m1, m2, m3, m6, m7, m8). Fig. 7 illustrates the various core configuration states of the proposed m cores distribution model using a full task migration policy for ti∈τ on MARVEL AMD PXA-270 embedded MPSoC based on utilization factor (ui).
Figure 7: Various proposed stages for the core distribution mode (λn) based on (ui)
The experimental evaluation of uniform multiprocessor is considered using STORM for the multithreaded application that executes an XML file containing the task (τ) parameters for ti∈τ, n = 4, n = 10 and n = 17. Fig. 8 represents the core distribution model for the X264 CPU core in which the period ranges between [4–25 ms]. The task set in Tables 1–3 on ui = 12, ui = 31, ui = 50% were evaluated on sd = 1000 ms simulation duration for embedded architecture supporting symmetric as well asymmetric CPU arrangement with m CPUs. Task ti∈τ, with ui=12\% ,ui = 31% and ui = 50% are allocated and mapped on the CPU cores (λ2,λ4,λ6∈λn).
Figure 8: Core distribution model for X264 CPU Core
All the tasks are periodic and denoted as τ=(t1,t2,…,tn) using the hardware and software architectural parameters of Marvel X-Scale AMD PXA-270 MPSoC based on the proposed full migration and scheduling policy with dynamic power management (DPM) integration based on the utilization factor (ui) of the CPU core with the same energy and power for all cores are the same. Table 1 represents the parameters of system timing requirements.
Table 2 represents the parameters of system timing requirements at 31% utilization factor considering 4 core CPU.
Fig. 9 represents the analysis that compares the energy dissipation of the proposed approach on numerous frequencies using AMD PXA-270. Table 3 represents the comparison of energy dissipation on various clock frequencies underutilization. ui = 6%, 10%, 20%, 36%, 55%, 60%, 62.5% and ui ≥ 63% when all the cores are running. Simulation shows that EA-EDF gives more energy-efficient results at lower (ui) utilization.
Figure 9: Comparison of energy consumption at ui = 6%, 20%, 31% and 53% at 624, 520 and 416 MHz
Figs. 10 and 11 represent the analysis that compares the performance of our proposed approach EA-EDF using AMD PXA-270 MPSoC in terms of dynamic energy dissipation for a different ready task set at various ui utilization factors with other conventional techniques like Multi-Layer Feed-Forward Neural Network (MLFNN) based on convolutional neural network (CNN), Random Forest (RF) and Deep learning (DL) algorithm at 624 and 416 MHz. The X-axis is representing the proposed EA-EDF ui utilization factor in comparison to other heuristic techniques increasing with the arrival of a new task ti∈τ; Y-axis represents the improvement % due to efficient task migration and DPM-enabled policy improving the performance by reducing energy dissipation.
Figure 10: Proposed EA-EDF energy consumption comparison with G-EDF, PDTM U-RT-DPM, TBP at 624 MHz
Figure 11: Proposed EA-EDF energy consumption comparison with EA-MLFNN, EA-CNN, EA-DL-EA-RF at 416 MHz
DPM is more convenient when tasks ti∈τ increases, the task starts migration and the shorter slack time intervals are optimized and move to an idle state due to the DPM approach once the heuristic starts executing the average energy is computed by considering that the release time of the task is its first arrival and each processor m with running states need to be allocated. The Simulation results show the maximum improvement. The average and the minimum improvement of our proposed approach compared to the GEDF approach are 4.3%, 2.25% and 0.48%, respectively. Proposed ML-based EA-EDF gives max improvement at (ui = 38%) on m = 5 running processor, while on average improvement occurs at (ui = 50%) on m = 6 running processor and the least improvement occurs at (ui = 6%) on m = 1 running the processor in an 8-processor platform. So we can generally state that when the ui utilization factor is higher due to a higher number of task executions making the dynamic power management technique more effective that's why we can see a significant improvement at (ui = 62%) The overall improvement is 4.3%, 1.27%, 0.26% and 1.02% on (ui = 50%) 3.37%, 1.46%, 0.58% and 0.35% in comparison with G-EDF, PDTM U-RT-DPM, and TBP utilization.
This article effectively demonstrates the viability of utilizing Convolutional Neural Network (CNN) based Multiprocessing architectures that are used for high-dimensional datasets, especially with available electrocardiography datasets to evaluate the task scheduling and migration that play a vital role in energy reduction, particularly in multiprocessor design. The rapid increase in the demand for multithreaded applications enhances the thermal heat dissipation in MPSoC. A full task migration policy based on Dynamic power management techniques (DPM) for efficient task allocation and scheduling is the finest integration used for decreasing the consumption of energy. MARVEL AMD PXA-270, PXA-271 and PXA-250, are using system-level DPM for switching CPU modes as per the need of the energy-power model. Various parameters like WCET, BCET, deadline, period and priority are the main characteristics of multithreaded real-time (R-T) applications. In this article, we have designed a system-level task migration policy used to tackle improper task allocation to the CPU core and inefficient task scheduling for real-time applications with deadline and priority constraints that ultimately reduce energy dissipation and improve overall performance.
Acknowledgement: The authors sincerely thank Prof. Son Lam Phung from the University of Wollongong, Australia for important suggestions and feedback on this paper.
Funding Statement: The authors received no specific funding for this study.
Author Contributions: Hamayun Khan is the corresponding author of this article, he worked on data analysis, the introduction and methodology of research paper by explaining the scope, context, research from write-up to analysis contributed critically to the research paper. Muhammad Atif Imtiaz worked on the development of the algorithm and results analysis in depth. Hira Siddique worked on the development of mathematical analysis of the research work and aligned the algorithm and flow chart analysis from implementation to results initiation. Arshad Ali worked on the proofreading and implemenation of the dataset, testing, and validation of the proposed model. Muhammad Zeeshan Baig worked on literature and data collection processes that helped in the implementation of the proposed model and migration policy using three-stage migration techniques in the research. Muhammad Tausif Afzal Rana validated the methodology and mathematical analysis by evaluating the model on STORM tool and generated high-resolution results for migration policies. Saif ur Rehman developed all the tables and calculated the utilization factor at various frequencies, especially implementation at 512 higher frequencies. Yazed Alsaawy’s role is to enhance the overall structure of the research paper by removing the typo mistakes supported in proofreading and English grammar enhancing the overall quality of work. All authors reviewed the results and approved the final version of the manuscript.
Availability of Data and Materials: All data are available from the corresponding author on reasonable request.
Ethics Approval: Not applicable.
Conflicts of Interest: The authors declare no conflicts of interest to report regarding the present study.
References
1. Irwin MJ, Benini L, Vijaykrishnan N, Kandemir M. Techniques for designing energy-aware MPSoCs. Multiprocess Syst-on-Chips. 2005;13(5):21–47. doi:10.1016/B978-012385251-9/50016-5. [Google Scholar] [CrossRef]
2. Burns A, Andy W. Dispatching domains for multiprocessor platforms and their representation in ada. In: International Conference on Reliable Software Technologies; 2010; Berlin/Heidelberg, Germany: Springer. p. 41–53. [Google Scholar]
3. Gonzalez-Martinez G, Sandoval-Arechiga R, Solis-Sanchez LO, Garcia-Luciano L, Ibarra-Delgado S, Solis-Escobedo JR, et al. A survey of MPSoC management toward self-awareness. Micromachines. 2024;15(5):577. doi:10.3390/mi15050577. [Google Scholar] [PubMed] [CrossRef]
4. Liu J, Mao M, Gao J, Bai J, Sun D. Hardware-accelerated YOLOv5 based on MPSoC. J Phys: Conf Ser. 2024;2732(1):012013. doi:10.1088/1742-6596/2732/1/012013. [Google Scholar] [CrossRef]
5. Verma P, Maurya AK, Yadav RS. A survey on energy-efficient workflow scheduling algorithms in cloud computing. Softw Pract Exp. 2024;54(5):637–82. doi:10.1002/spe.3292. [Google Scholar] [CrossRef]
6. Yu T, Zhong R, Janjic V, Petoumenos P, Zhai J, Leather H, et al. Collaborative heterogeneity-aware OS scheduler for asymmetric multicore processors. IEEE Trans Parallel Distrib Syst. 2021;32(5):1224–37. doi:10.1109/TPDS.2020.3045279. [Google Scholar] [CrossRef]
7. Rao W, Li H. Energy-aware scheduling algorithm for microservices in Kubernetes clouds. J Grid Comput. 2024;23(1):2. doi:10.1007/s10723-024-09788-w. [Google Scholar] [CrossRef]
8. Gaffour K, Benhaoua MK, Benyamina AEH, Singh AK. A new efficient multi-task applications mapping for three-dimensional network-on-chip based MPSoC. Concurr Comput Pract Exp. 2021;33(10):1–20. doi:10.1002/cpe.6194. [Google Scholar] [CrossRef]
9. Hu Y, Liu Y, Liu Z. A survey on convolutional neural network accelerators: GPU, FPGA and ASIC. In: 2022 14th International Conference on Computer Research and Development (ICCRD); 2022 Jan 7–9; Shenzhen, China: IEEE; 2022. p. 100–7. doi:10.1109/ICCRD54409.2022.9730377 [Google Scholar] [CrossRef]
10. Gonzalez R, Horowitz M. Energy dissipation in general purpose microprocessors. IEEE J Solid State Circuits. 1996;31(9):1277–84. doi:10.1109/4.535411. [Google Scholar] [CrossRef]
11. Choi S, Prasanna VK, Jang JW. Minimizing energy dissipation of matrix multiplication kernel on Virtex-II. In: Reconfigurable technology: FPGAs and reconfigurable processors for computing and communications IV. Boston, MA, USA: SPIE; 2002. doi:10.1117/12.455487. [Google Scholar] [CrossRef]
12. Ali H, Tariq U, Hardy J, Zhai X. A survey on system level energy optimisation for MPSoCs in IoT and consumer electronics. Comput Sci Rev. 2021;41(1):100–16. doi:10.1016/j.cosrev.2021.100416. [Google Scholar] [CrossRef]
13. Feliu J, Sahuquillo J, Petit S, Eeckhout L. Thread isolation to improve symbiotic scheduling on SMT multicore processors. IEEE Trans Parallel Distrib Syst. 2020;31(2):359–73. doi:10.1109/TPDS.2019.2934955. [Google Scholar] [CrossRef]
14. Somdip D, Isuwa S, Saha S, Singh AK, Maier KM. CPU-GPU-memory DVFS for power-efficient MPSoC in mobile cyber physical systems. Future Internet. 2022;14(3):91–103. doi:10.3390/fi14030091. [Google Scholar] [CrossRef]
15. Jiang E, Wang L, Wang J. Decomposition-based multi-objective optimization for energy-aware distributed hybrid flow shop scheduling with multiprocessor tasks. Tsinghua Sci Technol. 2021;26(5):646–63. doi:10.26599/TST.2021.9010007. [Google Scholar] [CrossRef]
16. Liang CY, Liu S, Chung EY, Gaudiot JL. An energy and performance efficient dvfs scheme for irregular parallel divide-and-conquer algorithms on the AMD scc. IEEE Comput Archit Lett. 2013;13(1):13–6. [Google Scholar]
17. Alam M, Haidri RA, Shahid M. Resource-aware load balancing model for batch of tasks (BoT) with best fit migration policy on heterogeneous distributed computing systems. Int J Pervasive Comput Commun. 2020;16(2):113–41. doi:10.1108/IJPCC-10-2019-0081. [Google Scholar] [CrossRef]
18. Baital K, Chakrabarti A. Dynamic scheduling of real-time tasks in heterogeneous multicore systems. IEEE Embedd Syst Lett. 2018;11(1):29–32. doi:10.1109/LES.2018.2846666. [Google Scholar] [CrossRef]
19. Huang K, Wang K, Zheng D, Jiang X, Zhang X, Yan R, et al. Expected energy optimization for real-time multiprocessor socs running periodic tasks with uncertain execution time. IEEE Trans Sustain Comput. 2018;6(3):398–411. doi:10.1109/TSUSC.2018.2853621. [Google Scholar] [CrossRef]
20. Haris J, Shafique M, Henkel J, Parameswaran S. System-level application-aware dynamic power management in adaptive pipelined MPSoCs for multimedia. In: IEEE/ACM International Conference on Computer-Aided Design (ICCAD); 2011; Washington, DC, USA. p. 616–23. [Google Scholar]
21. James B, Tenentes V, AlHashimi BM, Merrett GV. Online tuning of dynamic power management for efficient execution of interactive workloads. In: IEEE/ACM International Symposium on Low Power Electronics and Design (ISLPED); 2017; Boston, MA, USA. p. 1–6. [Google Scholar]
22. Bogdan P, Marculescu R, Jain S. Dynamic power management for multidomain system-on-chip platforms: an optimal control approach. ACM Trans Design Autom Electron Syst. 2013;18:1–20. doi:10.1145/2504904. [Google Scholar] [CrossRef]
23. Khan H, Din IU, Ali A, Husain M. An optimal DPM based energy-aware task scheduling for performance enhancement in embedded MPSoC. Comput Mater Contin. 2023;74(1):2097–113. doi:10.32604/cmc.2023.032999. [Google Scholar] [CrossRef]
24. Zhang X, Zhang W, Sun W, Wu H, Song A, Jha SK. A real-time cutting model based on finite element and order reduction. Comput Syst Sci Eng. 2022;43(1):1–15. doi:10.32604/csse.2022.024950. [Google Scholar] [CrossRef]
25. Kivilcim CA, Rosing TS, Mihic K, Leblebici Y. Analysis and optimization of MPSoC reliability. J Low Power Electron. 2006;2(1):56–69. doi:10.1166/jolpe.2006.007. [Google Scholar] [CrossRef]
26. Richard U, Deplanche AM, Trinquet Y. Storm a simulation tool for real-time multiprocessor scheduling evaluation. In: IEEE 15th Conference on Emerging Technologies & Factory Automation (ETFA); 2010; Bilbao, Spain. p. 1–8. [Google Scholar]
27. Wagner P, Strodthoff N. PTB-XL, a large publicly available electrocardiography dataset. Sci Data. 2020;7(1):1–15. doi:10.1038/s41597-020-0495-6. [Google Scholar] [PubMed] [CrossRef]
Cite This Article

This work is licensed under a Creative Commons Attribution 4.0 International License , which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.