代写Data Science-Computational Thinking代写-INF-549代写
代写Data Science

代写Data Science-Computational Thinking代写-INF-549代写

INF-549: Introduction to Computational Thinking and Data Science

Term: Spring 2019

代写Data Science You can complete this assignment in wings or jupyter notebook. Follow instructions for ONE of these two below.

Homework 6

Goals

The purpose of this homework is to see how datasets can be processed in parallel.

Assignment Options  代写Data Science

You can complete this assignment in wings or jupyter notebook. Follow instructions for ONE of these two below.

Assignment in Wings

You should have access to the Wings workflow system at: http://datascience4all.org/wings-portal/.

1.Parallelcomputations

a.Select the CaesarCypherParallel domain. Choose 2 or 3 poems or the lyrics of several songs. Upload them to Wings.

b.Run the CaesarCypherIndependent on those files (use the shift key to select more than one file from the drop-downmenu).

i.Include the workflow diagram before you run the workflow.

ii.How many output files did the workflowgenerate?

c.Run the CaesarCypherMap on one of the files, use the numberOfChunks parameter to dividethe processing.  代写Data Science

i.How many output files did the workflowgenerate?

d.Which workflow would you use to process a collection ofdocuments?

2.MapReduceParallelism

a.Run the CaesarCypherMapReduce on one of your poems.

i.Include the workflow diagram before you run the workflow.

ii.How many output files did the workflowgenerate?

b.Explain why the CaesarCypherMapReduce uses a MapReduce approach while CaesarCypherMap does not.  代写Data Science

c.Discuss why or why not encrypting files is an embarrassingly parallel problem.

3.Parallelism and criticalpaths

a.Describe a problem where a MapReduce approach would make processing more efficient.

i.Sketch (draw) a workflow for thatproblem.

b.Describe a problem where parallel processing would only help in somesteps

i.Sketch (draw) a workflow for that problem.

代写Data Science
代写Data Science

Assignment in Notebook  代写Data Science

You can access the materials for the homework on jupyter notebook here: https://github.com/KnowledgeCaptureAndDiscovery/INF549/.

Within that folder, the instructions for using the notebooks be found in this file: Instructions about Installing Jupyter Notebook.pdf

This assignment is to complete folder Assignment5_ParallelProcessing. Turn in your results for each part below as per the instructions on notebooks.

  1. Complete the8_Parallel Processing of Data.ipynb notebook
  2. Complete the9_Parallel Processing of Data Using MapReduce.ipynb notebook
  3. Complete the10_Processing Datasets Independently.ipynb notebook

IMPORTANT NOTES  代写Data Science

Plagiarism – presenting someone else’s ideas as your own, either verbatim or recast in your own words – is a serious academic offense with serious consequences. Please familiarize yourself with the discussion of plagiarism in SCampus in Section 11, Behavior Violating University Standards https://scampus.usc.edu/1100-behavior-violating-university-standards-and-appropriate-

sanctions. Other forms of academic dishonesty are equally unacceptable. See additional information in SCampus and university policies on scientific misconduct, http://policy.usc.edu/scientific-misconduct.

A number of USC’s schools provide support for students who need help with scholarly writing. Check with your advisor or program staff to find out more. Students whose primary language is not English should check with the American Language Institute http://dornsife.usc.edu/ali, which sponsors courses and workshops specifically for international graduate students.

For more information, see the class syllabus and the USC web site.

 

更多代写:北美统计网课代上看  托业成绩作弊  编程代写多少钱  北美演讲稿代写  澳大利亚文科paper代写  枪手代考英文

合作平台:essay代写 论文代写 写手招聘 英国留学生代写

代写Data Science
代写Data Science

发表回复