因?yàn)?/c>谷歌發(fā)明了平行MapReduce范式。
As a result of this paper, many open source implementations of MapReduce emerged between 2004 to the present.
這篇文章導(dǎo)致的結(jié)果是,從2004年到現(xiàn)在出現(xiàn)了許多開(kāi)放源碼的MapReduce實(shí)現(xiàn)。
In a cloud environment, the MapReduce structure increases the efficiency of throughput for large data sets.
在云環(huán)境中,MapReduce結(jié)構(gòu)提高了大型數(shù)據(jù)集的吞吐效率。
Google introduced the idea of MapReduce as a programming model for processing or generating large sets of data.
Google引用MapReduce的概念作為處理或生成大型數(shù)據(jù)集的編程模型。
Next in the MapReduce life cycle: All of the intermediate results get boiled down and summarized.
在MapReduce生命周期中,下一步是濃縮和匯總所有中間結(jié)果。
In this example, you see the MapReduce process on a small set of data.
在這個(gè)示例中,我們使用MapReduce處理一個(gè)小數(shù)據(jù)集。
Different sets of input files were provided, each of different size, and executed the MapReduce tasks in both single- and two-node clusters.
提供了不同的輸入文件集,每個(gè)文件集的大小不同,并在單節(jié)點(diǎn)和雙節(jié)點(diǎn)集群中執(zhí)行MapReduce任務(wù)。
First, MapReduce does have associated projects for supporting declarative languages.
首先,MapReduce有相關(guān)計(jì)劃支持的聲明的語(yǔ)言。
In the overall context of your cluster, this is a very small amount of data, but it is critical to running a MapReduce job.
在你集群的整個(gè)環(huán)境下,這些雖然都是很少量的數(shù)據(jù),但是對(duì)運(yùn)行MapReduce工作來(lái)說(shuō)非常重要。
HBase is a database representation over Hadoop's HDFS, permitting MapReduce to operate on database tables over simple files.
HBase是數(shù)據(jù)庫(kù)在Hadoop的HDFS上的表現(xiàn),在簡(jiǎn)單文件上執(zhí)行MapReduce以操作數(shù)據(jù)庫(kù)表。
I'm looking at performance modeling of Pig programs, which in essence represent dags of MapReduce jobs.
我正在建立Pig程序的性能模型,實(shí)際上它代表著分布式計(jì)算系統(tǒng)(mapreduce)工作量的多少。
There is no shortage of cloud-based MapReduce options available both as open source and commercial offerings.
基于云的MapReduce系統(tǒng)既有開(kāi)放源碼的,也有商用產(chǎn)品。
The advantage of MapReduce is that it allows for the distributed processing of the map and reduction operations.
MapReduce的優(yōu)點(diǎn)是它允許對(duì)映射和縮減操作進(jìn)行分布式處理。
After successful completion, the output of the MapReduce execution is available in the R output files.
完成后,MapReduce的執(zhí)行結(jié)果被保存在R個(gè)輸出文件。
MapReduce. The software is free and runs on cheap clusters of computers, rather than expensive servers and hardware.
軟件是免費(fèi)的,在廉價(jià)的計(jì)算機(jī)集群運(yùn)行,而不是昂貴的服務(wù)器和硬件。
MapReduce programs are designed to compute large volumes of data in a parallel fashion.
MapReduce程序用于以并行方式計(jì)算大量數(shù)據(jù)。
The third parameter is the name of the object that holds the results of MapReduce.
第三個(gè)參數(shù)是持有MapReduce結(jié)果的對(duì)象的名稱。
When working with MapReduce, the map function's job is to pick some value to work off of, along with some key.
當(dāng)使用MapReduce進(jìn)行處理時(shí),map函數(shù)的任務(wù)是選擇一些要處理的值,以及一些鍵。
This time you are extending the org. apache. hadoop. mapreduce. Reducer class and implementing its reduce method.
這一次,您將擴(kuò)展org.apache.hadoop.mapreduce.Reducer類并實(shí)現(xiàn)其reduce方法。
Instead of traditional file processing, HBase makes database tables the input and output form for MapReduce processing.
代替傳統(tǒng)的文件處理,HBase使數(shù)據(jù)庫(kù)將MapReduce處理的輸入和輸出格式列表。
The scenario is one where a startup company wishes to offer a MapReduce service to their clients.
這個(gè)情景是關(guān)于一個(gè)新起步的公司,他們希望為他們的客戶提供MapReduce服務(wù)。
The Google operational model is based on deploying MapReduce applications across large clusters of commodity systems, or white boxes.
Google操作模型是基于跨越大量的廉價(jià)硬件設(shè)備上組成的集群或者白盒子上面部署MapReduce應(yīng)用。
The combination of cloud computing and MapReduce seems tailored for big data jobs.
云計(jì)算和MapReduce的組合看起來(lái)非常適合處理BigData作業(yè)。