Skip to main content
Carpentries Incubator

Carpentries Incubator
Big Data Engineering
  • Big Data Engineering
  • Key Points
  • Instructor Notes
  • Extract All Images
    Big Data Engineering
    %

    Learner View

    Summary and Schedule
    1. Syllabus
    2. Introduction
    3. MapReduce Programming Paradigm
    4. Spark Computing Environment
    5. Data Parallel Computing with Spark
    6. Page Rank
    7. Locality Sensitive Hashing
    8. Frequent Itemsets
    9. Clustering
    10. Recommendation Systems
    11. Distributed Machine Learning with Spark

    • Key Points
    • Instructor Notes
    • Extract All Images

    See all in one page

    Syllabus


    Introduction


    MapReduce Programming Paradigm


    Spark Computing EnvironmentSpark computing environment


    Data Parallel Computing with SparkData parallel computing with Spark


    Page RankPage Rank


    Locality Sensitive HashingLocality Sensitive Hashing


    Frequent ItemsetsFrequent Itemsets


    ClusteringClustering


    Recommendation SystemsRecommendation Systems


    Distributed Machine Learning with SparkDistributed machine learning with Spark



    This lesson is subject to the Code of Conduct

    Edit on GitHub | Contributing | Source

    Cite | Contact | About

    Materials licensed under CC-BY 4.0 by the authors

    Template licensed under CC-BY 4.0 by The Carpentries

    Built with sandpaper (0.14.0), pegboard (0.7.1), and varnish (0.3.1).


    Back To Top