Skip to main content
Carpentries Incubator

Carpentries Incubator
Big Data Engineering
  • Big Data Engineering
  • Key Points
  • Glossary
  • Learner Profiles
    • Reference
Big Data Engineering
%

Instructor View

Summary and Setup
1. Syllabus
2. Introduction
3. MapReduce Programming Paradigm
4. Spark Computing Environment
5. Data Parallel Computing with Spark
6. Page Rank
7. Locality Sensitive Hashing
8. Frequent Itemsets
9. Clustering
10. Recommendation Systems
11. Distributed Machine Learning with Spark

  • Key Points
  • Glossary
  • Learner Profiles
  • Reference

See all in one page

Syllabus


Introduction


MapReduce Programming Paradigm


Spark Computing EnvironmentSpark computing environment


Data Parallel Computing with SparkData parallel computing with Spark


Page RankPage Rank


Locality Sensitive HashingLocality Sensitive Hashing


Frequent ItemsetsFrequent Itemsets


ClusteringClustering


Recommendation SystemsRecommendation Systems


Distributed Machine Learning with SparkDistributed machine learning with Spark



This lesson is subject to the Code of Conduct

Edit on GitHub | Contributing | Source

Cite | Contact | About

Materials licensed under CC-BY 4.0 by the authors

Template licensed under CC-BY 4.0 by The Carpentries

Built with sandpaper (0.14.0), pegboard (0.7.1), and varnish (0.3.1).


Back To Top