top of page

Map Reduce Programming Help | Map-Reduce Example in Python | Realcode4you

realcode4you

Word

Count Suppose we have a text file consisting of multiple lines and we wish to find the count of each word appearing in that file. We will use the MapReduce framework to do that, as follows:

  • First, randomly select some text content and save them into a text file. Here, I copied the definition of MapReduce in wiki (https://en.wikipedia.org/wiki/MapReduce) and saved it into ‘MapReduce_wiki.txt’.


  • Then, define the mapper function to split each input line to a list of words, and output (word, 1) for each word found;

  • Next, define the reducer function to sum over the number of 1s for each word, and outputthe count of each word.



from mrjob.job import MRJob
import re
WORD_REGEX = re.compile(r"[\w]+")
class WordCount(MRJob):
	def mapper(self, _, line):
		for word in WORD_REGEX.findall(line):
			yield word.lower(), 1
	def reducer(self, word, counts):
		yield word, sum(counts)
if __name__ == "__main__":
	WordCount.run()

  • Finally, testing on your computer: ‘python WordCount.py

MapReduce_wiki.txt >output_wordcount.txt’ (open command prompt/interpreter (cmd.exe) and change the current working directory/folder to the one in which your python document ‘WordCount.py’ and input file ‘MapReduce_wiki.txt’ are stored. For example, my python document and input file are stored in 'D:\COMP6210\Example').






















Big data technologies became the popular in the recent time due to large demand of data analysis firm. Their are many other big data technologies rather than map reduce like; PySpark, Hive and Hadoop and more others.


We are group of experienced Big data experts and professionals that will help you to do your all big data related project with an reasonable price. For more details you can contact us or send your project requirement details at below mail id:


realcode4you@gmail.com

1 Comment


Nikhil Ji
Nikhil Ji
Nov 04, 2023

"Statistics Assignment Help" is an invaluable resource for students and professionals seeking assistance with their statistical assignments. The service provides expert guidance and support in understanding complex statistical concepts, analyzing data, and completing assignments accurately. With a team of experienced statisticians and educators, it ensures that students receive the help they need to excel in their studies. Whether you're struggling with hypothesis testing, regression analysis, or any other statistical topic, "Statistics Assignment Help" is a reliable and efficient solution to boost your understanding and academic performance.


Like

REALCODE4YOU

Realcode4you is the one of the best website where you can get all computer science and mathematics related help, we are offering python project help, java project help, Machine learning project help, and other programming language help i.e., C, C++, Data Structure, PHP, ReactJs, NodeJs, React Native and also providing all databases related help.

Hire Us to get Instant help from realcode4you expert with an affordable price.

USEFUL LINKS

Discount

ADDRESS

Noida, Sector 63, India 201301

Follows Us!

  • Facebook
  • Twitter
  • Instagram
  • LinkedIn

OUR CLIENTS BELONGS TO

  • india
  • australia
  • canada
  • hong-kong
  • ireland
  • jordan
  • malaysia
  • new-zealand
  • oman
  • qatar
  • saudi-arabia
  • singapore
  • south-africa
  • uae
  • uk
  • usa

© 2023 IT Services provided by Realcode4you.com

bottom of page