T. William and . Kramer, PERCU: A Holistic Method for Evaluating High Performance Computing Systems, 2008.

H. Li, D. L. Groep, and L. Wolters, Workload Characteristics of a Multi-cluster Supercomputer, Lecture Notes in Computer Science, vol.3277, pp.176-193, 2004.
DOI : 10.1007/11407522_10

C. Anglano and M. Canonico, Scheduling algorithms for multiple Bag-of-Task applications on Desktop Grids: A knowledge-free approach, 2008 IEEE International Symposium on Parallel and Distributed Processing, pp.1-8, 2008.
DOI : 10.1109/IPDPS.2008.4536445

P. David and . Anderson, BOINC: A System for Public-Resource Computing and Storage, GRID, Rajkumar Buyya, pp.4-10, 2004.

F. Cappello, S. Djilali, G. Fedak, T. Hérault, F. Magniette et al., Computing on large-scale distributed systems: XtremWeb architecture, programming models, security, tests and convergence with grid, Future Generation Computer Systems, vol.21, issue.3, pp.417-437, 2005.
DOI : 10.1016/j.future.2004.04.011

URL : https://hal.archives-ouvertes.fr/in2p3-00163490

W. Cirne, F. Vilar-brasileiro, N. Andrade, L. Costa, A. Andrade et al., Labs of the World, Unite!!!, Journal of Grid Computing, vol.17, issue.2???4, pp.225-246, 2006.
DOI : 10.1007/s10723-006-9040-x

A. Butt, R. Zhang, and Y. Supercomputing, A Self-Organizing Flock of Condors, p.42, 2003.

G. Dror, L. Feitelson, and . Rudolph, Toward Convergence in Job Schedulers for Parallel Supercomputers, Job Scheduling Strategies for Parallel Processing, pp.1-26, 1996.

R. Lepère, D. Trystram, and G. J. Woeginger, Approximation Algorithms for Scheduling Malleable Tasks Under Precedence Constraints, Int. J. Found. Comput

K. Sheikh and . Ghafoor, Modeling of an adaptive parallel system with malleable applications in a distibuted computing environment, 2007.

X. Fan, W. Weber, and L. A. Barroso, Power provisioning for a warehouse-sized computer, ISCA '07: Proceedings of the 34th annual international symposium on Computer architecture, pp.13-23, 2007.
DOI : 10.1145/1250662.1250665

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.130.1279

J. Ebergen, J. Gainsley, and P. Cunningham, Transistor sizing: how to control the speed and energy consumption of a circuit, 10th International Symposium on Asynchronous Circuits and Systems, 2004. Proceedings., pp.51-61, 2004.
DOI : 10.1109/ASYNC.2004.1299287

W. Feng and T. Scogland, The Green500 List: Year one, 2009 IEEE International Symposium on Parallel & Distributed Processing, pp.1-7, 2009.
DOI : 10.1109/IPDPS.2009.5160978

A. Orgerie, L. Lefèvre, and J. Gelas, Save Watts in Your Grid: Green Strategies for Energy-Aware Framework in Large Scale Distributed Systems, 2008 14th IEEE International Conference on Parallel and Distributed Systems, 2008.
DOI : 10.1109/ICPADS.2008.97

URL : https://hal.archives-ouvertes.fr/ensl-00474726

A. Orgerie, L. Lefèvre, and J. Gelas, Chasing Gaps between Bursts: Towards Energy Efficient Large Scale Experimental Grids, 2008 Ninth International Conference on Parallel and Distributed Computing, Applications and Technologies, 2008.
DOI : 10.1109/PDCAT.2008.80

URL : https://hal.archives-ouvertes.fr/ensl-00469221

M. E. Rountree and . Femal, Analyzing the Energy-Time Trade-Off in High-Performance Computing Applications, IEEE Transactions on Parallel and Distributed Systems, vol.18, issue.6, pp.835-848, 2007.

T. Freeh and . Bletsch, Adagio: making DVS practical for complex HPC applications, ICS '09: Proceedings of the 23rd international conference on Supercomputing, pp.460-469, 2009.

E. Frachtenberg and U. Schwiegelshohn, New Challenges of Parallel Job Scheduling, Job Scheduling Strategies for Parallel Processing, pp.1-23, 2007.
DOI : 10.1007/978-3-540-78699-3_1

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.157.3770

S. J. Chapin, W. Cirne, D. G. Feitelson, J. P. Jones, S. T. Leutenegger et al., Benchmarks and Standards for the Evaluation of Parallel Job Schedulers, Job Scheduling Strategies for Parallel Processing 13th IPPS/10th SPDP'99 Workshop (5th JSSPP'99) of Lecture Notes in Computer Science (LNCS), pp.67-90, 1999.
DOI : 10.1007/3-540-47954-6_4

Y. Georgiou, O. Richard, and N. Capit, Evaluations of the Lightweight Grid CIGRI upon the Grid5000 Platform, Third IEEE International Conference on e-Science and Grid Computing (e-Science 2007), pp.279-286, 2007.
DOI : 10.1109/E-SCIENCE.2007.32

URL : https://hal.archives-ouvertes.fr/hal-00687520

Y. Georgiou, N. Capit, B. Bzeznik, and O. Richard, Simple, fault tolerant , lightweight grid computing approach for bag-of-tasks applications, 2008.

. Navaux, Supporting MPI Malleable Applications upon the OAR Resource Manager, COLIBRI Informatics Conference, 2009.
URL : https://hal.archives-ouvertes.fr/hal-00691414

C. Márcia, Y. Cera, O. Georgiou, N. Richard, P. Maillard et al., Supporting Malleability in Parallel Architectures with Dynamic CPUSETsMapping and Dynamic MPI, ICDCN, pp.242-257, 2010.

G. Da-costa, M. Dias-de-assunçassunç?assunção, J. Gelas, Y. Georgiou, L. Lefèvre et al., Multi-facet approach to reduce energy consumption in clouds and grids, Proceedings of the 1st International Conference on Energy-Efficient Computing and Networking, e-Energy '10, pp.95-104, 2010.
DOI : 10.1145/1791314.1791329

URL : https://hal.archives-ouvertes.fr/ensl-00517185

G. Da-costa, J. Gelas, Y. Georgiou, L. Lefèvre, A. Orgerie et al., The GREEN-NET framework: Energy efficiency in large scale distributed systems, 2009 IEEE International Symposium on Parallel & Distributed Processing, pp.1-8, 2009.
DOI : 10.1109/IPDPS.2009.5160975

URL : https://hal.archives-ouvertes.fr/ensl-00474732

G. Da-costa, J. Gelas, Y. Georgiou, L. Lefèvre, A. Orgerie et al., The GREEN-NET approach for supporting energy efficient solutions in Grids, 2009.

A. Sulistio, C. Shin-yeo, and R. Buyya, A taxonomy of computer-based simulations and its mapping to parallel and distributed systems simulation tools, Software: Practice and Experience, vol.34, issue.7, pp.653-673, 2004.
DOI : 10.1002/spe.585

J. Pujol, A. , and P. G. López, PlanetSim: An Extensible Simulation Tool for Peer-to-Peer Networks and Services, " in Peer-to-Peer Computing, pp.85-86, 2009.

A. Montresor and M. Jelasity, PeerSim: A Scalable P2P Simulator, " in Peer-to- Peer Computing, pp.99-100, 2009.

R. Buyya and M. M. Murshed, GridSim: a toolkit for the modeling and simulation of distributed resource management and scheduling for Grid computing, Concurrency and Computation: Practice and Experience, pp.13-15, 2002.
DOI : 10.1002/cpe.710

H. Casanova, A. Legrand, and M. Quinson, SimGrid: A Generic Framework for Large-Scale Distributed Experiments, Tenth International Conference on Computer Modeling and Simulation (uksim 2008), 2008.
DOI : 10.1109/UKSIM.2008.28

URL : https://hal.archives-ouvertes.fr/inria-00260697

K. Venkatesh-vishwanath, A. Vahdat, K. Yocum, and D. Gupta, Model- Net: Towards a DataCenter Emulation Environment, " in Peer-to-Peer Computing, pp.81-82, 2009.

H. Xia, H. Dail, H. Casanova, and A. A. Chien, The MicroGrid: Using Online Simulation to Predict Application Performance in Diverse Grid Network Environments, CLADE, p.52, 2004.

C. Louis, E. Canon, and . Jeannot, Wrekavoc: a tool for emulating heterogeneity, IPDPS, 2006.

F. Cappello and H. Bal, Toward an International Computer Science Grid Cluster Computing and the Grid, IEEE International Symposium on, vol.0, pp.3-12, 2007.
DOI : 10.1109/ccgrid.2007.115

B. Videau, C. Touati, and O. Richard, Toward an Experiment Engine for Lightweight Grids, Proceedings of the 1st International ICST Conference on Networks for Grid Applications, 2008.
DOI : 10.4108/gridnets.2007.2245

URL : https://hal.archives-ouvertes.fr/hal-01398153

W. Cirne and F. Berman, A comprehensive model of the supercomputer workload, Proceedings of the Fourth Annual IEEE International Workshop on Workload Characterization. WWC-4 (Cat. No.01EX538), pp.140-148, 2001.
DOI : 10.1109/WWC.2001.990753

W. Cirne and F. Berman, A model for moldable supercomputer jobs, Proceedings 15th International Parallel and Distributed Processing Symposium. IPDPS 2001, pp.59-59, 2001.
DOI : 10.1109/IPDPS.2001.925004

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.26.6080

E. Frachtenberg and D. G. Feitelson, Pitfalls in Parallel Job Scheduling Evaluation, Lecture Notes in Computer Science, vol.3834, pp.257-282, 2005.
DOI : 10.1007/11605300_13

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.157.5325

A. B. Downey and D. G. Feitelson, The elusive goal of workload characterization, ACM SIGMETRICS Performance Evaluation Review, vol.26, issue.4, pp.14-29, 1999.
DOI : 10.1145/309746.309750

A. T. Wong, L. Oliker, W. T. Kramer, T. L. Kaltz, and D. H. Bailey, ESP: A System Utilization Benchmark, ACM/IEEE SC 2000 Conference (SC'00), pp.52-52, 2000.
DOI : 10.1109/SC.2000.10056

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.104.9804

A. T. Wong, L. Oliker, W. T. Kramer, T. L. Kaltz, and D. H. Bailey, System Utilization Benchmark on the Cray T3E and IBM SP, Job Scheduling Strategies for Parallel Processing of Lecture Notes in Computer Science (LNCS), pp.56-67, 1911.
DOI : 10.1007/3-540-39997-6_4

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.25.7594

G. Dror and . Feitelson, A Critique of ESP, Lecture Notes in Computer Science, pp.68-73, 1911.

S. Prakas and B. Spielberg, Effective System Performance Suite on IBM e-server pSeries

G. Dror and . Feitelson, Metric and Workload Effects on Computer Systems Evaluation, IEEE Computer, vol.36, issue.9, pp.18-25, 2003.

G. Dror and . Feitelson, Experimental Computer Science: The Need for a Cultural Change

N. Capit, G. D. Costa, Y. Georgiou, G. Huard, C. Martin et al., A batch scheduler with high level components, CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005., pp.776-783, 2005.
DOI : 10.1109/CCGRID.2005.1558641

URL : https://hal.archives-ouvertes.fr/hal-00005106

Y. Georgiou, J. Leduc, B. Videau, J. Peyrard, and O. Richard, A tool for environment deployment in clusters and light grids, Proceedings 20th IEEE International Parallel & Distributed Processing Symposium, 2006.
DOI : 10.1109/IPDPS.2006.1639691

URL : https://hal.archives-ouvertes.fr/hal-00688748

G. Dror and . Feitelson, Experimental Computer Science: The Need for a Cultural Change

D. Bailey, T. Harris, W. Saphir, R. Van-der-wijngaart, A. Woo et al., The NAS Parallel Benchmarks 2.0, pp.94035-1000, 1995.
DOI : 10.1177/109434209100500306

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.104.3829

J. Subhlok, S. Venkataramaiah, and A. Singh, Characterizing NAS benchmark performance on shared heterogeneous networks, Proceedings 16th International Parallel and Distributed Processing Symposium, p.91, 2002.
DOI : 10.1109/IPDPS.2002.1015659

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.9.826

. Wong, Complex version of high performance computing LINPACK benchmark (HPL), Concurrency and Computation: Practice and Experience, pp.573-587, 2010.

C. Pinte, F. Menard, G. Duchene, and P. Bastien, Monte Carlo radiative transfer in protoplanetary disks, Astronomy and Astrophysics, vol.459, issue.3, 2006.
DOI : 10.1051/0004-6361:20053275

URL : https://hal.archives-ouvertes.fr/hal-00398574

N. Fallenbeck, H. Joachim-picht, M. Smith, and B. Freisleben, Xen and the Art of Cluster Scheduling, First International Workshop on Virtualization Technology in Distributed Computing (VTDC 2006), pp.237-244, 2006.
DOI : 10.1109/VTDC.2006.18

A. Joseph, M. L. Kaplan, and . Nelson, A Comparison of Queueing, Cluster and Distributed Computing Systems, NASA TM-109025 (Revision 1), pp.23681-23682, 1994.

J. Patton, J. , and C. Brickell, Second Evaluation of Job Queuing/Scheduling Software: Phase 1 Report, Tech. Rep, 1997.

Y. Yan, B. Chapman, ?. Sge, L. Pro, and L. , Comparative Study of Distributed Resource Management Systems, 2004.

A. Tarek, K. El-ghazawi, N. A. Gaj, F. Alexandridis, N. Vroman et al., A performance study of job management systems, Concurrency -Practice and Experience, pp.1229-1246, 2004.

N. Capit, G. D. Costa, Y. Georgiou, G. Huard, C. Martin et al., A batch scheduler with high level components, CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005., pp.776-783, 2005.
DOI : 10.1109/CCGRID.2005.1558641

URL : https://hal.archives-ouvertes.fr/hal-00005106

M. Susanne, D. J. Balle, and . Palermo, Enhancing an open source resource manager with multi-core/multi-threaded support, JSSPP'07: Proceedings of the 13th international conference on Job scheduling strategies for parallel processing, pp.37-50, 2008.

F. Broquedis, J. Clet-ortega, S. Moreaud, N. Furmento, B. Goglin et al., hwloc: A Generic Framework for Managing Hardware Affinities in HPC Applications, 2010 18th Euromicro Conference on Parallel, Distributed and Network-based Processing, pp.180-186, 2010.
DOI : 10.1109/PDP.2010.67

URL : https://hal.archives-ouvertes.fr/inria-00429889

A. Bhatele and L. V. Kalé, An evaluative study on the effect of contention on message latencies in large supercomputers, 2009 IEEE International Symposium on Parallel & Distributed Processing, pp.1-8, 2009.
DOI : 10.1109/IPDPS.2009.5161094

A. Bhatele, E. J. Bohm, and L. V. Kalé, Topology aware task mapping techniques: an api and case study, PPOPP, pp.301-302, 2009.

C. E. Leiserson, Fat-trees: Universal networks for hardware-efficient supercomputing, IEEE Transactions on Computers, vol.34, issue.10, 1985.
DOI : 10.1109/TC.1985.6312192

J. Navaridas, J. A. Pascual, and J. Miguel-alonso, Effects of Job and Task Placement on Parallel Scientific Applications Performance, 2009 17th Euromicro International Conference on Parallel, Distributed and Network-based Processing, pp.55-61, 2009.
DOI : 10.1109/PDP.2009.53

A. Bhatele and L. V. Kalé, Application-specific topology-aware mapping for three dimensional topologies, 2008 IEEE International Symposium on Parallel and Distributed Processing, pp.1-8, 2008.
DOI : 10.1109/IPDPS.2008.4536348

J. Navaridas, J. Miguel-alonso, F. J. Ridruejo, and W. Denzel, Reducing complexity in tree-like computer interconnection networks, Parallel Computing, vol.36, issue.2-3, pp.71-85, 2010.
DOI : 10.1016/j.parco.2009.12.004

B. Bilardi, An area lower bound for a class of fat-trees, ESA: Annual European Symposium on Algorithms, 1994.
DOI : 10.1007/BFb0049427

B. Bay, Deterministic On-Line Routing on Area-Universal Networks, JACM: Journal of the ACM, vol.42, 1995.
DOI : 10.1109/fscs.1990.89548

V. Subramani, R. Kettimuthu, S. Srinivasan, J. Johnston, and P. Sadayappan, Selective buddy allocation for scheduling parallel jobs on clusters, Proceedings. IEEE International Conference on Cluster Computing, p.107, 2002.
DOI : 10.1109/CLUSTR.2002.1137735

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.107.2086

Y. Aridor, T. Domany, O. Goldshmidt, E. Shmueli, J. E. Moreira et al., Multi-toroidal Interconnects: Using Additional Communication Links to Improve Utilization of Parallel Computers, Lecture Notes in Computer Science, vol.3277, pp.144-159, 2004.
DOI : 10.1007/11407522_8

. Citeseerx, On Enhancing the Reliability of Job Schedulers, 2008.

C. Engelmann, S. L. Scott, C. Leangsuksun, and X. He, Symmetric Active/Active High Availability for High-Performance Computing System Services: Accomplishments and Limitations, 2008 Eighth IEEE International Symposium on Cluster Computing and the Grid (CCGRID), pp.813-818, 2008.
DOI : 10.1109/CCGRID.2008.78

C. Engelmann, S. L. Scott, C. Leangsuksun, and X. He, Transparent Symmetric Active/Active Replication for Service-Level High Availability, Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid '07), pp.755-760, 2007.
DOI : 10.1109/CCGRID.2007.116

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.75.7315

K. Uhlemann, C. Engelmann, and S. L. Scott, JOSHUA: Symmetric Active/Active Replication for Highly Available HPC Job and Resource Management, 2006 IEEE International Conference on Cluster Computing, 2006.
DOI : 10.1109/CLUSTR.2006.311855

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.115.3518

W. Yu, J. Wu, and D. K. Panda, Fast and Scalable Startup of MPI Programs in InfiniBand Clusters, HiPC, pp.440-449, 2004.
DOI : 10.1007/978-3-540-30474-6_47

A. B. Yoo, M. A. Jette, and M. Grondona, SLURM: Simple Linux Utility for Resource Management, Job Scheduling Strategies for Parallel Processing, pp.44-60, 2003.
DOI : 10.1007/10968987_3

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.10.6834

. Taktuk, Adaptive Execution Deployment

G. Benoit-claudel, O. Huard, and . Richard, TakTuk, adaptive deployment of remote executions, HPDC '09: Proceedings of the 18th ACM international symposium on High performance distributed computing, pp.91-100, 2009.

E. Frachtenberg, F. Petrini, J. Fernández, S. Pakin, and S. Coll, STORM: Lightning-Fast Resource Management, ACM/IEEE SC 2002 Conference (SC'02), pp.1-26, 2002.
DOI : 10.1109/SC.2002.10057

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.153.8667

E. Frachtenberg, F. Petrini, J. Fernández, and S. Pakin, STORM: Scalable Resource Management for Large-Scale Parallel Computers, IEEE Transactions on Computers, vol.55, issue.12, pp.1572-1587, 2006.
DOI : 10.1109/TC.2006.206

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.156.7558

G. Dror, L. Feitelson, and . Rudolph, Parallel Job Scheduling: Issues and Approaches, Lecture Notes in Computer Science, vol.949, pp.1-18, 1995.

G. Dror, L. Feitelson, U. Rudolph, K. C. Schwiegelshohn, P. Sevcik et al., Theory and Practice in Parallel Job Scheduling, Job Scheduling Strategies for Parallel Processing IPPS'97 Workshop of Lecture Notes in Computer Science (LNCS), pp.1-34, 1997.

E. Frachtenberg and U. Schwiegelshohn, New Challenges of Parallel Job Scheduling, JSSPP, pp.1-23, 2007.
DOI : 10.1007/978-3-540-78699-3_1

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.157.3770

M. W. Margo, K. Yoshimoto, P. Kovatch, and P. Andrews, Impact of Reservations on Production Job Scheduling, JSSPP'07: Proceedings of the 13th international conference on Job scheduling strategies for parallel processing, pp.116-131, 2008.
DOI : 10.1007/978-3-540-78699-3_7

J. Skovira, W. Chan, H. Zhou, and D. A. Lifka, The EASY ??? LoadLeveler API project, IPPS '96: Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing, pp.41-47, 1996.
DOI : 10.1007/BFb0022286

D. Talby and D. G. Feitelson, Supporting priorities and improving utilization of the IBM SP scheduler using slack-based backfilling, Proceedings 13th International Parallel Processing Symposium and 10th Symposium on Parallel and Distributed Processing. IPPS/SPDP 1999, p.513, 1999.
DOI : 10.1109/IPPS.1999.760525

A. Mu, D. G. Weil, and . Feitelson, Utilization, Predictability, Workloads, and User Runtime Estimates in Scheduling the IBM SP2 with Backfilling, IEEE Trans. Parallel Distrib. Syst, vol.12, issue.6, pp.529-543, 2001.

A. Nissimov and D. G. Feitelson, Probabilistic Backfilling, Lecture Notes in Computer Science, vol.4942, pp.102-115, 2007.
DOI : 10.1007/978-3-540-78699-3_6

J. K. Ousterhout, Scheduling Techniques for concurrent systems, 3rd International Conference on Distributed Computing Systems, pp.22-30, 1982.

G. Dror, M. A. Feitelson, and . Jette, Improved Utilization and Responsiveness with Gang Scheduling, p.1291, 1997.

A. Bouteiller, H. Bouziane, T. Hérault, P. Lemarinier, and F. Cappello, Hybrid Preemptive Scheduling of Message Passing Interface Applications on Grids, International Journal of High Performance Computing Applications, vol.20, issue.1, pp.77-90, 2006.
DOI : 10.1177/1094342006062526

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.86.347

G. Berry, Preemption in concurrent systems, Proceedings of Foundations of Software Technology and Theoretical Computer Science, pp.72-93, 1993.
DOI : 10.1007/3-540-57529-4_44

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.42.1557

Q. Snell, M. J. Clement, and D. B. Jackson, Preemption Based Backfill, Job Scheduling Strategies for Parallel Processing, 8th International Workshop
DOI : 10.1007/3-540-36180-4_2

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.186.7780

E. Shmueli and D. G. Feitelson, Uncovering the Effect of System Performance on User Behavior from Traces of Parallel Systems, 2007 15th International Symposium on Modeling, Analysis, and Simulation of Computer and Telecommunication Systems, pp.274-280, 2007.
DOI : 10.1109/MASCOTS.2007.67

A. Streit, A Self-Tuning Job Scheduler Family with Dynamic Policy Switching, p.2537, 2002.
DOI : 10.1007/3-540-36180-4_1

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.18.8759

E. Frachtenberg, D. G. Feitelson, F. Petrini, and J. Fernandez, Adaptive parallel job scheduling with flexible coscheduling, IEEE Transactions on Parallel and Distributed Systems, vol.16, issue.11, pp.1066-1077, 2005.
DOI : 10.1109/TPDS.2005.130

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.76.6333

F. Wang, H. Franke, M. C. Papaefthymiou, P. Pattnaik, L. Rudolph et al., A gang scheduling design for multiprogrammed parallel computing environments, IPPS '96: Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing, pp.111-125, 1996.
DOI : 10.1007/BFb0022290

S. Zhou, X. Zheng, J. Wang, and P. Delisle, Utopia: A load sharing facility for large, heterogeneous distributed computer systems, Software: Practice and Experience, vol.2, issue.12, 1993.
DOI : 10.1002/spe.4380231203

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.121.1434

M. Philip, M. J. Papadopoulos, G. Katz, and . Bruno, NPACI Rocks: Tools and Techniques for Easily Deploying Manageable Linux Clusters, Cluster Computing, IEEE International Conference on, p.258, 2001.

D. B. Jackson, Q. Snell, and M. J. Clement, Core Algorithms of the Maui Scheduler, JSSPP '01: Revised Papers from the 7th International Workshop on Job Scheduling Strategies for Parallel Processing, pp.87-102, 2001.
DOI : 10.1007/3-540-45540-X_6

D. B. Jackson, New Issues and New Capabilities in HPC Scheduling with the Maui Scheduler

R. L. Henderson, Job scheduling under the Portable Batch System, IPPS '95: Proceedings of the Workshop on Job Scheduling Strategies for Parallel Processing, pp.279-294, 1995.
DOI : 10.1007/3-540-60153-8_34

D. Thain, T. Tannenbaum, and M. Livny, Distributed computing in practice: the Condor experience, Concurrency -Practice and Experience, pp.323-356, 2005.
DOI : 10.1002/cpe.938

X. Evers, R. Boontje, D. H. Epema, and R. Van-dantzig, Condor Flocking: Load Sharing Between Pools of Workstations, Tech. Rep, 1993.

M. J. Litzkow, M. Livny, and M. W. Mutka, Condor-a hunter of idle workstations, [1988] Proceedings. The 8th International Conference on Distributed, pp.104-111, 1988.
DOI : 10.1109/DCS.1988.12507

T. Tannenbaum, D. Wright, K. Miller, and M. Livny, Condor: a distributed job scheduler, pp.307-350, 2002.

W. Gentzsch, Sun Grid Engine: towards creating a compute power grid, Proceedings First IEEE/ACM International Symposium on Cluster Computing and the Grid
DOI : 10.1109/CCGRID.2001.923173

D. Templeton, Begginners guide to Sun Grid Engine 6.2 Installation and Configuration

B. Bode, D. M. Halstead, R. Kendall, Z. Lei, and D. Jackson, The Portable Batch Scheduler and the Maui Scheduler on Linux Clusters, Proceedings of the 4th

J. Bremond, High Availability Documentation on OAR -Admin Guide, Tech. Rep, 2009.

J. Duell, The design and implementation of Berkeley Lab's linux checkpoint/restart, 2005.

I. Foster, C. Kesselman, J. Nick, S. Tuecke, and E. , The Grid: Blueprint for a New Computing Infrastructure, 1999.

I. Foster and C. Kesselman, Globus: a Metacomputing Infrastructure Toolkit, International Journal of High Performance Computing Applications, vol.11, issue.2, pp.115-128, 1997.
DOI : 10.1177/109434209701100205

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.25.8254

. Raphaël, Grid'5000: a large scale and highly reconfigurable experimental Grid testbed, International Journal of High Performance Computing Applications, vol.20, issue.4, pp.481-494, 2006.

J. Frey, T. Tannenbaum, M. Livny, I. T. Foster, and S. Tuecke, Condor-G: a computation management agent for multi-institutional grids, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing, pp.237-246, 2002.
DOI : 10.1109/HPDC.2001.945176

I. Foster, Globus Toolkit Version 4: Software for Service-Oriented Systems, 2006.
DOI : 10.1007/11577188_2

D. P. Anderson, J. Cobb, E. Korpela, M. Lebofsky, and D. Werthimer, SETI@home: an experiment in public-resource computing, Communications of the ACM, vol.45, issue.11, pp.56-61, 2002.
DOI : 10.1145/581571.581573

D. Zhou and V. M. Lo, WaveGrid: a scalable fast-turnaround heterogeneous peer-based desktop grid system, IPDPS, 2006.

M. A. , S. Netto, R. N. Calheiros, R. K. Silva, A. F. César et al., Transparent Resource Allocation to Exploit Idle Cluster Nodes in Computational Grids, pp.238-245, 2005.
DOI : 10.1109/e-science.2005.83

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.72.6724

R. Medeiros, W. Cirne, F. Vilar-brasileiro, and J. P. Sauvé, Faults in grids: why are they so bad and what can be done about it?, Proceedings. First Latin American Web Congress, pp.18-24, 2003.
DOI : 10.1109/GRID.2003.1261694

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.71.9362

G. Kola, T. Kosar, and M. Livny, Phoenix: Making Data-Intensive Grid Applications Fault-Tolerant, Fifth IEEE/ACM International Workshop on Grid Computing, pp.251-258, 2004.
DOI : 10.1109/GRID.2004.51

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.129.7615

M. Moreira, /. Gupta-bluegene, and . Prototype, Filtering Failure Logs for a, DSN '05: Proceedings of the 2005 International Conference on Dependable Systems and Networks (DSN'05), pp.476-485, 2005.

M. Litzkow, T. Tannenbaum, J. Basney, and M. Livny, Checkpoint and Migration of UNIX Processes in the Condor Distributed Processing System, 1997.

F. Heine, M. Hovestadt, O. Kao, and A. Keller, Provision of Fault Tolerance with Grid-enabled and SLA-aware Resource Management Systems, Parallel Computing: Current & Future Issues of High-End Computing, pp.113-120, 2005.

P. Domingues, A. Andrzejak, and L. Silva, Using Checkpointing to Enhance Turnaround Time on Institutional Desktop Grids, 2006 Second IEEE International Conference on e-Science and Grid Computing (e-Science'06), p.73, 2006.
DOI : 10.1109/E-SCIENCE.2006.261157

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.124.6176

C. Anglano, J. Brevik, M. Canonico, D. Nurmi, and R. Wolski, Fault-aware scheduling for Bag-of-Tasks applications on Desktop Grids, 2006 7th IEEE/ACM International Conference on Grid Computing, pp.28-2956, 2006.
DOI : 10.1109/ICGRID.2006.310998

D. Kondo, A. A. Chien, and H. Casanova, Resource Management for Rapid Application Turnaround on Enterprise Desktop Grids, Proceedings of the ACM/IEEE SC2004 Conference, p.17, 2004.
DOI : 10.1109/SC.2004.50

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.1.5066

C. Weng and X. Lu, Heuristic scheduling for bag-of-tasks applications in combination with QoS in the computational grid, Future Generation Computer Systems, vol.21, issue.2, pp.271-280, 2005.
DOI : 10.1016/j.future.2003.10.004

S. Sankaran, J. M. Squyres, B. Barrett, and A. Lumsdaine, The Lam/Mpi Checkpoint/Restart Framework: System-Initiated Checkpointing, Proceedings , LACSI Symposium, pp.479-493, 2003.
DOI : 10.1177/1094342005056139

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.110.9068

N. Capit, G. D. Costa, Y. Georgiou, G. Huard, C. Rille-martin et al., A batch scheduler with high level components, CCGrid 2005. IEEE International Symposium on Cluster Computing and the Grid, 2005., 2005.
DOI : 10.1109/CCGRID.2005.1558641

URL : https://hal.archives-ouvertes.fr/hal-00005106

J. Blaise-omer-yenke, M. Méhaut, and . Tchuente, Scheduling Deadline- Constrained Checkpointing on Virtual Clusters, APSCC, pp.257-264, 2008.

C. R. Anderson, An Implementation of the Fast Multipole Method without Multipoles, SIAM Journal on Scientific and Statistical Computing, vol.13, issue.4, pp.923-947, 1992.
DOI : 10.1137/0913055

S. Clark, S. J. Yalda, . Graves, J. A. Ramachandran, E. Rushing et al., Service-Oriented Environments for Dynamically Interacting with Mesoscale Weather, Computing in Science and EngineeringCiSE), vol.7, issue.6, pp.12-29, 2005.

V. Laxmikant, S. Kalé, J. Kumar, and . Desouza, A Malleable-Job System for Timeshared Parallel Machines, Cluster Computing and the Grid, p.230, 2002.

J. Hungershöfer, On the Combined Scheduling of Malleable and Rigid Jobs, 16th Symposium on Computer Architecture and High Performance Computing, pp.206-213, 2004.
DOI : 10.1109/SBAC-PAD.2004.27

G. Utrera, J. Corbalán, and J. Labarta, Implementing malleability on MPI jobs, Proceedings. 13th International Conference on Parallel Architecture and Compilation Techniques, 2004. PACT 2004., pp.215-224, 2004.
DOI : 10.1109/PACT.2004.1342555

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.104.85

J. Hungershföer and A. Streit, Efficient Resource Management for Malleable Applications

T. J. Kaoutar-el-maghraoui, B. K. Desell, C. A. Szymanski, and . Varela, Malleable iterative MPI applications, Concurrency and Computation: Practice and Experience, pp.393-413, 2009.
DOI : 10.1002/cpe.1362

T. J. Kaoutar-el-maghraoui, B. K. Desell, C. A. Szymanski, and . Varela, Dynamic Malleability in Iterative MPI Applications, Seventh IEEE International Symposium on Cluster Computing and the Grid (CCGrid '07), pp.591-598, 2007.
DOI : 10.1109/CCGRID.2007.45

T. Desell, C. A. Maghraoui, and . Varela, Malleable applications for scalable high performance computing, Cluster Computing, vol.15, issue.5???6, pp.323-337, 2007.
DOI : 10.1007/s10586-007-0032-9

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.135.1753

J. Buisson, O. O. Sonmez, H. H. Mohamed, and D. H. Epema, Scheduling Malleable Applications in Multicluster Systems, Int. Conference on Cluster Computing, pp.372-381, 2007.
URL : https://hal.archives-ouvertes.fr/hal-00498790

V. Laxmikant, S. Kalé, J. Kumar, M. Desouza, S. Potnuru et al., Faucets: Efficient Resource Allocation on the Computational Grid, Proceedings of the 2004 International Conference on Parallel Processing, 2004.

R. H. Castain, T. S. Woodall, D. J. Daniel, J. M. Squyres, B. Barrett et al., The Open Run-Time Environment (OpenRTE): A Transparent Multi-cluster Environment for High-Performance Computing, Proceedings, 12th European PVM/MPI Users' Group Meeting, 2005.
DOI : 10.1007/11557265_31

R. L. Graham, S. Choi, D. J. Daniel, N. N. Desai, R. G. Minnich et al., A network-failuretolerant message-passing system for terascale clusters, International Journal of Parallel Programming, vol.31, issue.4, pp.285-303, 2003.
DOI : 10.1023/A:1024504726988

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.102.7415

. Watson, A Scalable Tools Communications Infrastructure, HPCS, pp.33-39, 2008.

M. Jeffrey, A. Squyres, L. Lumsdaine-for, and . Mpi, A Component Architecture, Proceedings, 10th European PVM/MPI Users' Group Meeting number 2840 in Lecture Notes in Computer Science, pp.379-387, 2003.

W. Gropp, E. Lusk, and R. Thakur, Using MPI-2 Advanced Features of the Message-Passing Interface, 1999.

M. Cera, G. Pezzi, E. Mathias, N. Maillard, and P. Navaux, Improving the Dynamic Creation of Processes in MPI-2, 13th European PVMMPI Users Group Meeting, pp.247-255, 2006.
DOI : 10.1007/11846802_37

X. Feng, R. Ge, and K. W. Cameron, Power and Energy Profiling of Scientific Applications on Distributed Systems, IPDPS '05: Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) -Papers, p.34, 2005.

S. Huang and W. Feng, Energy-Efficient Cluster Computing via Accurate Workload Characterization, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, pp.68-75, 2009.
DOI : 10.1109/CCGRID.2009.88

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.172.2224

J. Ebergen, J. Gainsley, and P. Cunningham, Transistor sizing: how to control the speed and energy consumption of a circuit, 10th International Symposium on Asynchronous Circuits and Systems, 2004. Proceedings., pp.51-61, 2004.
DOI : 10.1109/ASYNC.2004.1299287

J. Ho-ahn, J. Leverich, R. Schreiber, and N. P. Jouppi, Multicore DIMM: an Energy Efficient Memory Module with Independently Controlled DRAMs, IEEE Computer Architecture Letters, vol.8, issue.1, pp.5-8, 2009.
DOI : 10.1109/L-CA.2008.13

T. Kgil, S. D. Souza, A. G. Saidi, N. L. Binkert, R. G. Dreslinski et al., PicoServer: using 3D stacking technology to enable a compact energy efficient chip multiprocessor, ASPLOS, pp.117-128, 2006.
DOI : 10.1007/978-1-4419-0784-4_9

R. Bergamaschi, G. Han, A. Buyuktosunoglu, H. Patel, I. Nair et al., Exploring power management in multi-core systems, 2008 Asia and South Pacific Design Automation Conference, pp.708-713, 2008.
DOI : 10.1109/ASPDAC.2008.4484043

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.581.8956

R. Nagpal, R. Nagpal, Y. N. Srikant, and Y. N. Srikant, Exploring Energy-Performance Trade-Offs for Heterogeneous Interconnect Clustered VLIW Processors, Proc. of Intl. Conf. on High Performance Computing, 2005.
DOI : 10.1007/11945918_48

URL : http://citeseerx.ist.psu.edu/viewdoc/summary?doi=10.1.1.105.7642

M. S. Warren, E. H. Weigle, and W. Feng, High-Density Computing: A 240-Processor Beowulf in One Cubic Meter, ACM/IEEE SC 2002 Conference (SC'02), 2002.
DOI : 10.1109/SC.2002.10010

M. Lim, V. W. Freeh, and D. K. Lowenthal, Adaptive, Transparent Frequency and Voltage Scaling of Communication Phases in MPI Programs, ACM/IEEE SC 2006 Conference (SC'06), p.107, 2006.
DOI : 10.1109/SC.2006.11

Y. Lu and G. De-micheli, Adaptive Hard Disk Power Management on Personal Computers, GLS '99: Proceedings of the Ninth Great Lakes Symposium on VLSI, p.50, 1999.

K. Ratnesh, C. E. Sharma, C. D. Bash, R. J. Patel, J. S. Friedrich et al., Balance of Power: Dynamic Thermal Management for Internet Data Centers, IEEE Internet Computing, vol.9, issue.1, pp.42-49, 2005.

J. D. Moore, J. S. Chase, P. Ranganathan, and R. K. Sharma, Making Scheduling " Cool " : Temperature-Aware Workload Placement in Data Centers, USENIX Annual Technical Conference, pp.61-75, 2005.

G. Da-costa, J. Gelas, Y. Georgiou, L. Lefèvre, A. Orgerie et al., The GREEN-NET framework: Energy efficiency in large scale distributed systems, 2009 IEEE International Symposium on Parallel & Distributed Processing, 2009.
DOI : 10.1109/IPDPS.2009.5160975

URL : https://hal.archives-ouvertes.fr/ensl-00474732

G. Da-costa, M. Dias-de-assuncao, J. Gelas, Y. Georgiou, L. Lefèvre et al., Multi-facet approach to reduce energy consumption in clouds and grids, Proceedings of the 1st International Conference on Energy-Efficient Computing and Networking, e-Energy '10, 2010.
DOI : 10.1145/1791314.1791329

URL : https://hal.archives-ouvertes.fr/ensl-00517185

H. Li, D. L. Groep, and L. Wolters, Workload Characteristics of a Multi-cluster Supercomputer, Lecture Notes in Computer Science, vol.3277, pp.176-193, 2004.
DOI : 10.1007/11407522_10

Y. Georgiou, O. Richard, and N. Capit, Evaluations of the Lightweight Grid CIGRI upon the Grid5000 Platform, Third IEEE International Conference on e-Science and Grid Computing (e-Science 2007), pp.279-286, 2007.
DOI : 10.1109/E-SCIENCE.2007.32

URL : https://hal.archives-ouvertes.fr/hal-00687520

F. Cappello, Grid'5000: A Large Scale, Reconfigurable, Controlable and Monitorable Grid Platform, 6th IEEE/ACM International Workshop on Grid Computing, 2005.
URL : https://hal.archives-ouvertes.fr/inria-00000284

C. Mccurdy, A. L. Coxa, and J. Vetter, Investigating the TLB Behavior of High-end Scientific Applications on Commodity Microprocessors, ISPASS 2008, IEEE International Symposium on Performance Analysis of Systems and software, pp.95-104, 2008.
DOI : 10.1109/ISPASS.2008.4510742

R. Garg, S. Woo-son, M. Kandemir, P. Raghavan, and R. Prabhakar, Markov Model Based Disk Power Management for Data Intensive Workloads, 2009 9th IEEE/ACM International Symposium on Cluster Computing and the Grid, pp.76-83, 2009.
DOI : 10.1109/CCGRID.2009.67