About 7,940,000 results
Open links in new tab
  1. How does the MapReduce sort algorithm work? - Stack Overflow

    MapReduce's use of input files and lack of schema support prevents the performance improvements enabled by common database system features such as B-trees and hash …

  2. frameworks - Simple explanation of MapReduce? - Stack Overflow

    Aug 26, 2008 · MapReduce is a method to process vast sums of data in parallel without requiring the developer to write any code other than the mapper and reduce functions. The map function …

  3. Setting the number of map tasks and reduce tasks - Stack Overflow

    Jul 31, 2011 · For each input split a map task is spawned. So, over the lifetime of a mapreduce job the number of map tasks is equal to the number of input splits. mapred.map.tasks is just a …

  4. mapreduce - How to optimize shuffling/sorting phase in a hadoop …

    Dec 10, 2015 · mapreduce.reduce.shuffle.input.buffer.percent: How much of heap should be used for storing the map output, during the shuffle phase in the reducer. This setting determines the …

  5. what are the disadvantages of mapreduce? - Stack Overflow

    Sep 3, 2013 · What are the disadvantages of mapreduce? There are lots of advantages of mapreduce. But I would like to know the disadvantages of mapreduce too.

  6. Good MapReduce examples - Stack Overflow

    Sep 12, 2012 · MapReduce is a framework originally developed at Google that allows for easy large scale distributed computing across a number of domains. Apache Hadoop is an open …

  7. python - How to change the version of ... - Stack Overflow

    Apr 22, 2020 · By default spark (2.4.4) use MapReduce.fileoutputcommitter.algorithm.version 1. I am trying it to change it to version 2. spark-UI and sparkCtx._conf.getAll() shows version 2 but …

  8. c# - Map and Reduce in .NET - Stack Overflow

    Jan 9, 2009 · What scenarios would warrant the use of the "Map and Reduce" algorithm? Is there a .NET implementation of this algorithm?

  9. python - How can I use the Map_Reduce chain instead of the "stuff ...

    Jun 3, 2023 · How can I use mapreduce with this chain? Or is there any other way that can help me reduce the tokens being sent at a time? Secondly, I have noticed that sometimes the …

  10. mapreduce - Hadoop error in shuffle in fetcher#1 - Stack Overflow

    Nov 7, 2013 · I'm running a parsing job in hadoop, the source is a 11GB map file with about 900,000 binary records each representing an HTML file, the map extract links and write them …