Enreal: An energyaware resource allocation method for scientific workflow executions in cloud environment, IEEE Transactions on Cloud Computing, vol.4, issue.2, pp.166-179, 2016. ,
Leaky buffer: A novel abstraction for relieving memory pressure from cluster data processing frameworks, IEEE Transactions on Parallel Distributed System, vol.28, issue.1, pp.128-140, 2017. ,
Blotter: Low latency transactions for geo-replicated storage, Proceedings of WWW, pp.3-7, 2017. ,
Efficient and secure auditing scheme for outsourced big data with dynamicity in cloud, SCIENCE CHINA Information Sciences, vol.61, issue.12, p.15, 2018. ,
Robust big data analytics for electricity price forecasting in the smart grid, IEEE Transactions on Big Data, vol.5, issue.1, pp.34-45, 2019. ,
Mapreduce: simplified data processing on large clusters, Communications of the ACM, vol.51, issue.1, pp.107-113, 2008. ,
Maptask scheduling in mapreduce with data locality: Throughput and heavy-traffic optimality, IEEE/ACM Transactions on Networking, vol.24, issue.1, pp.190-203, 2016. ,
The memory challenge in reduce phase of mapreduce applications, IEEE Transactions on Big Data, vol.2, issue.4, pp.380-386, 2016. ,
Minimizing makespan and total completion time in mapreduce-like systems, Proceedings of INFOCOM, 2014. ,
Efficient coflow scheduling with varys, Proceedings of SIGCOMM, pp.17-22, 2014. ,
VL2: a scalable and flexible data center network, Proceedings of SIGCOMM, pp.16-21, 2009. ,
Leveraging endpoint flexibility in data-intensive clusters, Proceedings of SIGCOMM, pp.12-16, 2013. ,
Delay scheduling: a simple technique for achieving locality and fairness in cluster scheduling, Proceedings of EuroSys, pp.13-16 ,
Shufflewatcher: Shuffle-aware scheduling in multi-tenant mapreduce clusters, Proceedings of USENIX ATC, 2014. ,
The power of choice in data-aware cluster scheduling, Proceedings of OSDI, 2014. ,
The case for evaluating mapreduce performance using workload suites, Proceedings of MASCOTS, pp.25-27 ,
Approxhadoop: Bringing approximations to mapreduce frameworks, Proceedings of ASPLOS, 2015. ,
The power of two choices in randomized load balancing, IEEE Transactions on Parallel and Distributed Systems, vol.12, issue.10, pp.1094-1104, 2001. ,
Adaptdb: Adaptive partitioning for distributed joins, PVLDB, vol.10, issue.5, pp.589-600, 2017. ,
The google file system, Proceedings of SOSP, 2003. ,
Arrayudf: User-defined scientific data analysis on arrays, Proceedings of HPDC, 2017. ,
Parallelizing sequential graph computations, Proceedings of SIGMOD, 2017. ,
Managing data transfers in computer clusters with orchestra, Proceedings of SIGCOMM, 2011. ,
Network-aware scheduling for dataparallel jobs: Plan when you can, Proceedings of SIG-COMM, pp.17-21, 2015. ,
Decentralized task-aware scheduling for data center networks, Proceedings of SIGCOMM, pp.17-22, 2014. ,
Optimizing energy, locality and priority in a mapreduce cluster, Proceedings of International Conference on Autonomic Computing, 2015. ,
vlocality: Revisiting data locality for mapreduce in virtualized clouds, IEEE Network, vol.31, issue.1, pp.28-35, 2017. ,
Locality-aware reduce task scheduling for mapreduce, Proceedings of IEEE CloudCom, 2011. ,
Bashuffler: Maximizing network bandwidth utilization in the shuffle of YARN, Proceedings of HPDC, 2016. ,
Optimizing mapreduce framework through joint scheduling of overlapping phases, Proceedings of ICCCN, pp.1-4 ,
CODA: toward automatically identifying and scheduling coflows in the dark, Proceedings of SIG-COMM, pp.22-26, 2016. ,
Reining in the outliers in map-reduce clusters using mantri, Proceedings of OSDI, pp.4-6, 2010. ,
Sailfish: a framework for large scale data processing, Proceedings of SOCC, pp.14-17, 2012. ,
Themis: an i/o-efficient mapreduce, Proceedings of SOCC, pp.14-17, 2012. ,
Riffle: optimized shuffle service for large-scale data analytics, Proceedings of EuroSys, pp.23-26 ,
, , 2018.
Efficient shuffle management with scache for DAG computing frameworks, Proceedings of PPoPP, pp.24-28, 2018. ,
Making sense of performance in data analytics frameworks, Proceedings of NSDI, pp.4-6, 2015. ,
Network requirements for resource disaggregation, Proceedings of OSDI, 2016. ,
On the [ir]relevance of network performance for data processing, Proceedings of HotCloud, pp.20-21, 2016. ,
Scale-out networking in the data center, IEEE Micro, vol.30, issue.4, pp.29-41, 2010. ,
, Proceedings of SIGCOMM, 2010.
Inside the social network's (datacenter) network, Proceedings of SIGCOMM, pp.17-21, 2015. ,
Surviving failures in bandwidthconstrained datacenters, Proceedings of SIGCOMM, pp.13-17, 2012. ,
Flow stealer: lightweight load balancing by stealing flows in distributed SDN controllers, SCIENCE CHINA Information Sciences, vol.60, issue.3, p.32202, 2017. ,
Ernest: Efficient performance prediction for large-scale advanced analytics, Proceedings of NSDI, pp.16-18, 2016. ,
Mapreduce with communication overlap (marco), Journal of Parallel and Distributed Computing, vol.73, pp.608-620, 2013. ,
Hopper: Decentralized speculation-aware cluster scheduling at scale, Proceedings of SIGCOMM, pp.17-21, 2015. ,
On the diversity of cluster workloads and its impact on research results, Proceedings of USENIX ATC, pp.11-13, 2018. ,
Effective straggler mitigation: Attack of the clones, Proceedings of NSDI, 2013. ,
Energyefficient speculative execution using advanced reservation for heterogeneous clusters, Proceedings of ICPP, pp.13-16, 2018. ,
URL : https://hal.archives-ouvertes.fr/hal-01807496
Interactive analytical processing in big data systems: A cross-industry study of mapreduce workloads, PVLDB, vol.5, issue.12, pp.1802-1813, 2012. ,