Xiao et al., 2024 - Google Patents
CRS: A cost-aware resource scheduling framework for deep learning task orchestration in mobile cloudsXiao et al., 2024
- Document ID
- 14764058195750413975
- Author
- Xiao L
- Xiao Z
- Wu D
- Hu M
- Zhou Y
- Publication year
- Publication venue
- IEEE Transactions on Mobile Computing
External Links
Snippet
Deep learning (DL) has found extensive application in supporting various mobile applications. The efficient execution of DL tasks is paramount for ensuring the effectiveness of AI-driven mobile applications. While previous research has predominantly focused on …
- 238000013135 deep learning 0 title abstract description 78
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/48—Programme initiating; Programme switching, e.g. by interrupt
- G06F9/4806—Task transfer initiation or dispatching
- G06F9/4843—Task transfer initiation or dispatching by program, e.g. task dispatcher, supervisor, operating system
- G06F9/4881—Scheduling strategies for dispatcher, e.g. round robin, multi-level priority queues
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5027—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals
- G06F9/505—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resource being a machine, e.g. CPUs, Servers, Terminals considering the load
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5005—Allocation of resources, e.g. of the central processing unit [CPU] to service a request
- G06F9/5011—Allocation of resources, e.g. of the central processing unit [CPU] to service a request the resources being hardware resources other than CPUs, Servers and Terminals
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5083—Techniques for rebalancing the load in a distributed system
- G06F9/5088—Techniques for rebalancing the load in a distributed system involving task migration
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F9/00—Arrangements for programme control, e.g. control unit
- G06F9/06—Arrangements for programme control, e.g. control unit using stored programme, i.e. using internal store of processing equipment to receive and retain programme
- G06F9/46—Multiprogramming arrangements
- G06F9/50—Allocation of resources, e.g. of the central processing unit [CPU]
- G06F9/5061—Partitioning or combining of resources
- G06F9/5072—Grid computing
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06Q—DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTING PURPOSES, NOT OTHERWISE PROVIDED FOR
- G06Q10/00—Administration; Management
- G06Q10/06—Resources, workflows, human or project management, e.g. organising, planning, scheduling or allocating time, human or machine resources; Enterprise planning; Organisational models
- G06Q10/063—Operations research or analysis
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F2209/00—Indexing scheme relating to G06F9/00
- G06F2209/50—Indexing scheme relating to G06F9/50
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F15/00—Digital computers in general; Data processing equipment in general
- G06F15/16—Combinations of two or more digital computers each having at least an arithmetic unit, a programme unit and a register, e.g. for a simultaneous processing of several programmes
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06F—ELECTRICAL DIGITAL DATA PROCESSING
- G06F11/00—Error detection; Error correction; Monitoring
- G06F11/30—Monitoring
- G06F11/34—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment
- G06F11/3409—Recording or statistical evaluation of computer activity, e.g. of down time, of input/output operation; Recording or statistical evaluation of user activity, e.g. usability assessment for performance assessment
Similar Documents
| Publication | Publication Date | Title |
|---|---|---|
| Peng et al. | A multi-objective trade-off framework for cloud resource scheduling based on the deep Q-network algorithm | |
| Ye et al. | Deep learning workload scheduling in gpu datacenters: A survey | |
| Chaurasia et al. | Comprehensive survey on energy-aware server consolidation techniques in cloud computing | |
| Rekha et al. | Efficient task allocation approach using genetic algorithm for cloud environment | |
| Sun et al. | ET2FA: A hybrid heuristic algorithm for deadline-constrained workflow scheduling in cloud | |
| Gao et al. | Deep learning workload scheduling in gpu datacenters: Taxonomy, challenges and vision | |
| Dong et al. | Greedy scheduling of tasks with time constraints for energy-efficient cloud-computing data centers | |
| Zhu et al. | Scheduling stochastic multi-stage jobs to elastic hybrid cloud resources | |
| Yuan et al. | Temporal task scheduling of multiple delay-constrained applications in green hybrid cloud | |
| Çağlar et al. | Look-ahead energy efficient VM allocation approach for data centers | |
| Hashemi et al. | Gwo-sa: Gray wolf optimization algorithm for service activation management in fog computing | |
| Xiao et al. | CRS: A cost-aware resource scheduling framework for deep learning task orchestration in mobile clouds | |
| Liu et al. | KubFBS: A fine‐grained and balance‐aware scheduling system for deep learning tasks based on kubernetes | |
| Wang et al. | Resource scheduling techniques in cloud from a view of coordination: a holistic survey | |
| Syed et al. | Systematic review: particle swarm optimization (PSO) based load balancing for Cloud Computing | |
| Mukherjee et al. | Cloud computing resource management | |
| Sugumar | An Intelligent Predictive GPU Scheduling Framework for Deep Learning Workloads in Large-Scale Cloud Environments | |
| Pandya et al. | Dynamic resource allocation techniques in cloud computing | |
| Zhao et al. | Reducing the upfront cost of private clouds with clairvoyant virtual machine placement: Y. Zhao et al. | |
| Ravikumar et al. | Preemptive min max optimal cost based scheduling for improving the load balancing in virtualized cloud environment | |
| Mukherjee et al. | Task scheduling algorithm based on multi criteria decision making method for cloud computing environment: TSABMCDMCCE | |
| Patni et al. | Heuristic models for optimal host selection | |
| Tesfatsion et al. | Power and performance optimization in FPGA‐accelerated clouds | |
| Deepa et al. | Scheduling model for task loading in cloud data centres | |
| Tian et al. | Sophisticated orchestrating concurrent dlrm training on cpu/gpu platform |