Juan Tourifio’s scientific contributions

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (1)


Performance evaluation of big data frameworks for large-scale data analytics
  • Conference Paper
  • Full-text available

December 2016

·

2,709 Reads

·

81 Citations

·

·

·

[...]

·

Juan Tourifio

The increasing adoption of Big Data analytics has led to a high demand for efficient technologies in order to manage and process large datasets. Popular MapReduce frameworks such as Hadoop are being replaced by emerging ones like Spark or Flink, which improve both the programming APIs and performance. However, few works have focused on comparing these frameworks. This paper addresses this issue by performing a comparative evaluation of Hadoop, Spark and Flink using representative Big Data workloads and considering factors like performance and scalability. Moreover, the behavior of these frameworks has been characterized by modifying some of the main parameters of the workloads such as HDFS block size, input data size, interconnect network or thread configuration. The analysis of the results has shown that replacing Hadoop with Spark or Flink can lead to a reduction in execution times by 77% and 70% on average, respectively, for non-sort benchmarks.

Download

Citations (1)


... Despite this, human behaviour has been a focal point in environmental sustainability [43], but this research prioritized the system's characteristics like flexibility, modifiability, and time behaviour to effectively measure its environmental sustainability. Maintainability [23], [27], [34], [47], [52], [55]- [57] Predictability [38], [55], [58], [59] Dependability [38], [59]- [61] Fault tolerance [10], [15], [16], [23], [26], [62] Perdurability [23], [34], [49], [63] Understandability [57], [64], [65] Throughput [5], [16], [17], [20], [64], [66] Modularity [7], [23], [32], [52], [56], [63] Environmental Reusability [16], [22], [23], [38], [39], [47], [52], [55] Flexibility [20], [27], [47], [48], [64] Modifiability [17], [23], [27], [39], [47], [48], [52], [56], [59], [67] Time behavior [27], [32], [27] Availability [7], [10], [16], [17], [23], [27], [47], [64] Economic ...

Reference:

Sustainability dimensions in enhancing the energy and resource efficiency of big data systems
Performance evaluation of big data frameworks for large-scale data analytics