Advertisement Remove all ads

Big Data Analytics Semester 8 (BE Fourth Year) BE Computer Engineering University of Mumbai Topics and Syllabus

Advertisement Remove all ads

University of Mumbai Syllabus For Semester 8 (BE Fourth Year) Big Data Analytics: Knowing the Syllabus is very important for the students of Semester 8 (BE Fourth Year). Shaalaa has also provided a list of topics that every student needs to understand.

The University of Mumbai Semester 8 (BE Fourth Year) Big Data Analytics syllabus for the academic year 2021-2022 is based on the Board's guidelines. Students should read the Semester 8 (BE Fourth Year) Big Data Analytics Syllabus to learn about the subject's subjects and subtopics.

Students will discover the unit names, chapters under each unit, and subtopics under each chapter in the University of Mumbai Semester 8 (BE Fourth Year) Big Data Analytics Syllabus pdf 2021-2022. They will also receive a complete practical syllabus for Semester 8 (BE Fourth Year) Big Data Analytics in addition to this.

CBCGS [2019 - current]
CBGS [2015 - 2018]
Old [2000 - 2014]

University of Mumbai Semester 8 (BE Fourth Year) Big Data Analytics Revised Syllabus

University of Mumbai Semester 8 (BE Fourth Year) Big Data Analytics and their Unit wise marks distribution

University of Mumbai Semester 8 (BE Fourth Year) Big Data Analytics Course Structure 2021-2022 With Marking Scheme

Advertisement Remove all ads
Advertisement Remove all ads
Advertisement Remove all ads

Syllabus

C Introduction to Big Data
  • Introduction to Big Data, Big Data characteristics, types of Big Data, Traditional vs. Big Data business approach, Case Study of Big Data Solutions.
CC Introduction to Hadoop
  • What is Hadoop?
  • Core Hadoop Components
  • Hadoop Ecosystem
  • Physical Architecture
  • Hadoop limitations
CCC NoSQL
  • What is NoSQL? NoSQL business drivers; NoSQL case studies;
  • NoSQL data architecture patterns: Key-value stores, Graph stores, Column family (Bigtable) stores, Document stores, Variations of NoSQL architectural patterns;
  • Using NoSQL to manage big data:- What is a big data NoSQL solution? Understanding the types of big data problems; Analyzing big data with a shared-nothing architecture; Choosing distribution models: master-slave versus peer-to-peer; Four ways that NoSQL systems handle big data problems
CD MapReduce and the New Software Stack
401 Distributed File Systems
  • Physical Organization of Compute Nodes, LargeScale File-System Organization.
402 MapReduce
  • The Map Tasks, Grouping by Key, The Reduce Tasks, Combiners, Details of MapReduce Execution, Coping With Node Failures.
403 Algorithms Using MapReduce
  • Matrix-Vector Multiplication by MapReduce, Relational-Algebra Operations, Computing Selections by MapReduce, Computing Projections by MapReduce, Union, Intersection, and Difference by MapReduce, Computing Natural Join by MapReduce, Grouping and Aggregation by MapReduce, Matrix Multiplication, Matrix Multiplication with One MapReduce Step.
D Finding Similar Items
  • Applications of Near-Neighbor Search, Jaccard Similarity of Sets, Similarity of Documents, Collaborative Filtering as a Similar-Sets Problem.
  • Distance Measures:- Definition of a Distance Measure, Euclidean Distances, Jaccard Distance, Cosine Distance, Edit Distance, Hamming Distance.
DC Mining Data Streams
601 The Stream Data Model
  • A Data-Stream-Management System, Examples of Stream Sources, Stream Querie, Issues in Stream Processing.
602 Sampling Data in a Stream
  • Obtaining a Representative Sample, The General Sampling Problem, Varying the Sample Size.
603 Filtering Streams
  • The Bloom Filter, Analysis.
604 Counting Distinct Elements in a Stream
  • The Count-Distinct Problem, The Flajolet-Martin Algorithm, Combining Estimates, Space Requirements.
605 Counting Ones in a Window
  • The Cost of Exact Counts, The Datar-Gionis-Indyk-Motwani Algorithm, Query Answering in the DGIM Algorithm, Decaying Windows.
DCC Link Analysis
  • PageRank Definition, Structure of the web, dead ends, Using Page rank in a search engine, Efficient computation of Page Rank:- PageRank Iteration Using MapReduce, Use of Combiners to Consolidate the Result Vector.
  • Topic sensitive Page Rank, link Spam, Hubs and Authorities.
DCCC Frequent Itemsets
801 Handling Larger Datasets in Main Memory
  • Algorithm of Park, Chen, and Yu, The Multistage Algorithm, The Multihash Algorithm.
802 The Son Algorithm and MapReduce
803 Counting Frequent Items in a Stream
  • Sampling Methods for Streams, Frequent Itemsets in Decaying Windows.
CM Clustering
  • CURE Algorithm, Stream-Computing, A Stream-Clustering Algorithm, Initializing & Merging Buckets, Answering Queries.
M Recommendation Systems
  • A Model for Recommendation Systems, Content-Based Recommendations, Collaborative Filtering.
MC Mining Social-Network Graphs
  • Social Networks as Graphs, Clustering of Social-Network Graphs, Direct Discovery of Communities, SimRank, Counting triangles using MapReduce.
Advertisement Remove all ads
Advertisement Remove all ads
Share
Notifications

View all notifications


      Forgot password?
View in app×