Skip to main content
Carpentries Incubator

Carpentries Incubator
Big Data Engineering
  • Big Data Engineering
  • Key Points
  • Instructor Notes
  • Extract All Images
    Big Data Engineering
    %

    Learner View

    Summary and Schedule
    1. Introduction
    2. MapReduce Programming Paradigm
    3. Spark Computing Environment
    4. Data Parallel Computing with Spark
    5. Page Rank
    6. Locality Sensitive Hashing
    7. Frequent Itemsets
    8. Clustering
    9. Recommendation Systems
    10. Distributed Machine Learning with Spark

    • Key Points
    • Instructor Notes
    • Extract All Images

    See all in one page

    Instructor Notes

    This is a placeholder file. Please add content here.

    Syllabus


    Introduction


    MapReduce Programming Paradigm


    Spark Computing EnvironmentSpark computing environment


    Data Parallel Computing with SparkData parallel computing with Spark


    Page RankPage Rank


    Locality Sensitive HashingLocality Sensitive Hashing


    Frequent ItemsetsFrequent Itemsets


    ClusteringClustering


    Recommendation SystemsRecommendation Systems


    Distributed Machine Learning with SparkDistributed machine learning with Spark



    This lesson is subject to the Code of Conduct

    Edit on GitHub | Contributing | Source

    Cite | Contact | About

    Materials licensed under CC-BY 4.0 by the authors

    Template licensed under CC-BY 4.0 by The Carpentries

    Built with sandpaper (0.14.0), pegboard (0.7.1), and varnish (0.3.1).


    Back To Top