MUTHAYAMMAL ENGINEERING COLLEGE
(An Autonomous Institution)
(Approved by AICTE, New Delhi, Accredited by NAAC & Affiliated to Anna University)
Rasipuram - 637 408, Namakkal Dist., Tamil Nadu.
Department of Master of Computer Applications
Question Bank - Academic Year (2020-21)
Course Code & Course Name
:
19CAB13 & Big Data Analytics
Year/Sem/Sec
:
I / II
Unit-I: INTRODUCTION TO BIG DATA
Part-A (2 Marks)
What is Big Data?
Differentiate Analysis from Reporting.
What is predictive analysis?
Define Big Data and mention any two sources of generating it.
Explain sampling distribution with an example.
What are the three V’s of Big Data?
What is sampling?
Explain prediction error.
Write any two applications of Big Data Analytics.
What are the types of analytics?
Unit-II : MINING DATA STREAMS
Part-A (2 Marks)
What do you mean by Stream computing?
Explain Filtering streams.
What is decaying window?
What is data stream model?
Part-B (16 Marks)
1.
Explain in detail the modern data analytic tools.
(16)
2.
Explain in detail the intelligent data analysis.
(16)
3.
Explain about i. Challenges of conventional systems (8)
ii. Analytical processes and tools (8)
iii.
4.
Explain in detail Analysis Vs Reporting.
(16)
5.
Discuss Sampling distribution concepts
(16)
What is meant by one-time queries?
What is Bloom Filter?
What is Sentiment Analysis?
. List out the applications of RTAP.
What is Stock Market prediction?
How will you count ones in a window?
Unit-III : HADOOP ENVIRONMENT
Part-A (2 Marks)
What is Hadoop?
What are the components of HDFS?
What is MapReduce and what are its phases?
Explain YARN.
What are the components of Hadoop?
What is Hadoop and what are its components?
What are the phases in MapReduce?
Explain Scaling Out.
How does the namenode track the existence of data node?
What are the functions of Datanode?
Unit-IV : DATA ANALYSIS SYTEMS AND VISUALIZATION
Part-A (2 Marks)
Explain Page Rank.
What are basic charts?
Part-B (16 Marks)
1.
Discuss the stream data model and architecture
(16)
2.
Discuss the RealTime Analytics Platform Applications.
(16)
3.
Discuss the Filtering Streams.
(16)
4.
Discuss Real Time Sentiment Analysis.
(16)
5.
Write a detailed note on estimating moments.
(16)
Part-B (16 Marks)
1.
Explain in detail the HDFS.
(16)
2.
Discuss the MapReduce process.
(16)
3.
Explain the procedure in setting up a hadoop cluster.
(16)
4.
Write about the wordcount mapreduce hadoop application.
(16)
5.
Explain the MapReduce architecture and it’s working.
(16)
Explain Link spam.
What is visualization?
What is Recommendation system?
What do you mean by Collaborative filtering?
Explain Content based recommendation.
What is meant by dimensionality reduction?
What are the three major goals in visualization?
Explain interactive visualization.
Unit-V : DATA ANALYTICS USING PYTHON
Part-A (2 Marks)
What is data analytics?
How will you create a Series?
What is Jupiter notebook?
What do you mean by data frame?
What is List?
How will you display the content of a data frame?
Write the procedure to add a column to a data frame.
How will you create a sub data frame?
Explain descriptive statistics.
How to add a compute column in a data frame?
Course Faculty
HoD
Part-B (16 Marks)
1.
Explain the visual data analysis techniques.
(16)
2.
Write a detailed note on recommendation systems.
(16)
3.
Explain the efficient computation of PageRank.
(16)
4.
Discuss the Model for Recommendation Systems.
(16)
5.
Discuss visual interactive technique’s systems and applications.
(16)
Part-B (16 Marks)
1.
Write a detail note on Pandas module.
(16)
2.
Explain Series, Data Frame and Panel.
(16)
3.
Discuss about different ways of creating data frames.
(16)
4.
Explain the procedure to work with columns and rows in a data frame.
(16)
5.
Explain the different ways of handling missing data.
(16)