Big Data Online Practice Test - 12
This Test will cover complete Big Data with very important questions, starting off from basics to advanced level.
Q. A lead was explaining,the concept of Erasure coding,in Hadoop 3.0,to his team.
Suppose,there are two data blocks DB1 and DB2, of a file.
Both of them will be replicated like -
DB1 will have DB1.1 and DB1.2.
DB2 will have DB2.1 and DB2.2.
DBp is a single parity block.
Consider the following statements.
1) The third copy DBp, is the result of XOr,that is DBp = (DB1.1 xor DB2.1).
2) The third copy DBp, is the result of OR,that is, DBp = (DB1.1 or DB2.1).
3) In case of failure of DB1.1, the copy will be first recovered,from Parity block as DB1.1 = DBp xor DB2.1.
4) In case of failure of DB1.1, the copy will be first recovered,from DB1.2,and,then from parity block.
Mark the correct option.
Q. Consider the following situation. If the Hive query was:
select * from ports sort by numberofships;
On execution, if the two reducers,get the following rows,containing the numberofships,values as Reducer 1 - 12,24,8
Reducer 2 - 6, 16, 14.
What is the overall output produced?
|
|
|
|
Q. A developer was told to display,all the Countries from Travel collection,where VisaOnArrival was null.
Mark the correct option.
Q. Below are a few statements,about many clients,writing into a HDFS file,simultaneously.
1) When a Namenode gives permission,to a client,to write to a file,then only that client,can start working on the file.
2) Namenode can grant access,to two clients, to write to the same file, at a time.
3) When a file is opened for writing,in HDFS by a client, the Datanode gives it the required permission.
4) Namenode allows access to the first client.The second client, gets access,from the datanode.
5) Hadoop follows single writer,many readers model.
Group which of these are correct(C) or incorrect(InC).
|
|
|
|
Q. Following statements are written about YARN.
1) Yarn is a replacement for Mapreduce in Hadoop2.0.
2) Yarn handles,the scheduling and monitoring of jobs,in Hadoop2.0.
3) Mapreduce runs,on top of the YARN architecture.
4) YARN has a resource manager,for each cluster.
Mark the correct option.
Q. Consider the following code snippet:
var A =Set(1,2,3);
var B =Set(4,5,6);
What is the output of A.&(B)?
|
|
|
|
Q. Below are a few features mentioned.Mark,if they are of HBase or Impala or Both,support it.
1) The implementation language is C++.
2) There is no support of SQL.
3) Offers APIs such as Thrift, Restful HTTP API.
4) Supports sharding method for storing data on different nodes.
Mark the correct option.
|
|
|
|
Q. In a quiz, it was told,to mark the statements,related to Pig framework,as True or False.
1) PigStorage() is a case sensitive function.
2) The default mode of Pig is local mode.
3) Order by command is not used for sorting.
4) Pig Latin is used to specify the data flow.
Choose the appropriate option.
|
|
|
|
Q. Which Framework Am I?
I carry massive amounts of data,from many sources to a centralized store.
I am very much reliable,distributed and configurable!
I was mainly developed,to collect streaming data,from web servers to HDFS.
I allow data collection,in batch as well as streaming mode.
Mark the correct option.
Q. Read the following statements regarding Sqoop imports.
1) Sqoop jobs,have 1 default reduce task.
2) Number of mappers,can be modified,by passing --n-mappers argument,to the job.
3) The maximum limit,of number of mappers,set by Sqoop,is 10.
4) More number of mappers,means,much efficient performance.
Mark the correct option.