Generic selectors
Exact matches only
Search in title
Search in content
Search in posts
Search in pages
Filter by Categories
nmims post
Objective Type Set
Online MCQ Assignment
Question Solution
Solved Question
Uncategorized

Interview MCQ Set 1

1. As companies move past the experimental phase with Hadoop, many cite the need for additional capabilities, including:
a) Improved data storage and information retrieval
b) Improved extract, transform and load features for data integration
c) Improved data warehousing functionality
d) Improved security, workload management and SQL support

View Answer

Answer: d [Reason:] Adding security to Hadoop is challenging because all the interactions do not follow the classic client- server pattern.

2. Point out the correct statement :
a) Hadoop do need specialized hardware to process the data
b) Hadoop 2.0 allows live stream processing of real time data
c) In Hadoop programming framework output files are divided in to lines or records
d) None of the mentioned

View Answer

Answer: b [Reason:] Hadoop batch processes data distributed over a number of computers ranging in 100s and 1000s.

3. According to analysts, for what can traditional IT systems provide a foundation when they’re integrated with big data technologies like Hadoop ?
a) Big data management and data mining
b) Data warehousing and business intelligence
c) Management of Hadoop clusters
d) Collecting and storing unstructured data

View Answer

Answer: a [Reason:] Data warehousing integrated with Hadoop would give better understanding of data.

4. Hadoop is a framework that works with a variety of related tools. Common cohorts include:
a) MapReduce, Hive and HBase
b) MapReduce, MySQL and Google Apps
c) MapReduce, Hummer and Iguana
d) MapReduce, Heron and Trumpet

View Answer

Answer: a [Reason:] To use Hive with HBase you’ll typically want to launch two clusters, one to run HBase and the other to run Hive.

5. Point out the wrong statement :
a) Hardtop’s processing capabilities are huge and its real advantage lies in the ability to process terabytes & petabytes of data
b) Hadoop uses a programming model called “MapReduce”, all the programs should confirms to this model in order to work on Hadoop platform
c) The programming model, MapReduce, used by Hadoop is difficult to write and test
d) All of the mentioned

View Answer

Answer: c [Reason:] The programming model, MapReduce, used by Hadoop is simple to write and test.

6. What was Hadoop named after?
a) Creator Doug Cutting’s favorite circus act
b) Cutting’s high school rock band
c) The toy elephant of Cutting’s son
d) A sound Cutting’s laptop made during Hadoop’s development

View Answer

Answer: c [Reason:] Doug Cutting, Hadoop’s creator, named the framework after his child’s stuffed toy elephant.

7. All of the following accurately describe Hadoop, EXCEPT:
a) Open source
b) Real-time
c) Java-based
d) Distributed computing approach

View Answer

Answer: b [Reason:] Apache Hadoop is an open-source software framework for distributed storage and distributed processing of Big Data on clusters of commodity hardware.

8. __________ can best be described as a programming model used to develop Hadoop-based applications that can process massive amounts of data.
a) MapReduce
b) Mahout
c) Oozie
d) All of the mentioned

View Answer

Answer: a [Reason:] MapReduce is a programming model and an associated implementation for processing and generating large data sets with a parallel, distributed algorithm.

9. __________ has the world’s largest Hadoop cluster.
a) Apple
b) Datamatics
c) Facebook
d) None of the mentioned

View Answer

Answer: c [Reason:] Facebook has many Hadoop clusters, the largest among them is the one that is used for Data warehousing.

10. Facebook Tackles Big Data With _______ based on Hadoop.
a) ‘Project Prism’
b) ‘Prism’
c) ‘Project Big’
d) ‘Project Data’

View Answer

Answer: a [Reason:] Prism automatically replicates and moves data wherever it’s needed across a vast network of computing facilities.

Interview MCQ Set 2

1. _________ is a Cassandra feature that optimizes the cluster consistency process
a) Hinted handon
b) Hinted handoff
c) Tombstone
d) Hinted tomb

View Answer

Answer: b [Reason:] You can enable or disable hinted handoff in the cassandra.yaml file.

2. Point out the correct statement :
a) Cassandra does not immediately remove data marked for deletion from disk
b) A deleted column can reappear if you do not run node repair routinely
c) The deletion of marked data occurs during compaction
d) All of the mentioned

View Answer

Answer: d [Reason:] Marking data with a tombstone signals Cassandra to retry sending a delete request to a replica that was down at the time of delete.

3. Cassandra searches the __________ to determine the approximate location on disk of the index entry.
a) partition record
b) partition summary
c) partition search
d) all of the mentioned

View Answer

Answer: b [Reason:] If the Bloom filter does not rule out the SSTable, Cassandra checks the partition key cache.

4. You configure sample frequency by changing the ________ property in the table definition.
a) index_time
b) index_interval
c) index_secs
d) none of the mentioned

View Answer

Answer: b [Reason:] By default, the partition summary is a sample of the partition index.

5. Point out the wrong statement :
a) A hint indicates that a write needs to be replayed to one or more unavailable nodes
b) When the cluster cannot meet the consistency level specified by the client, Cassandra does store a hint
c) By default, hints are saved for three hours after a replica fails because if the replica is down longer than that, it is likely permanently dead
d) All of the mentioned

View Answer

Answer: b [Reason:] When the cluster cannot meet the consistency level specified by the client, Cassandra does not store a hint.

6. The compression offset map grows to ____ GB per terabyte compressed.
a) 1-3
b) 10-16
c) 20-22
d) 0-1

View Answer

Answer: a [Reason:] The more you compress data, the greater number of compressed blocks you have and the larger the compression offset table.

7. The type of __________ strategy Cassandra performs on your data is configurable and can significantly affect read performance.
a) compression
b) collection
c) compaction
d) decompression

View Answer

Answer: c [Reason:] Using the SizeTieredCompactionStrategy or DateTieredCompactionStrategy tends to cause data fragmentation when rows are frequently updated.

8. There are _________ types of read requests that a coordinator can send to a replica
a) two
b) three
c) four
d) all of the mentioned

View Answer

Answer: b [Reason:] The coordinator node contacts one replica node with a direct read request.

9. _________ can be configured per table for non-QUORUM consistency levels
a) Read repair
b) Read damage
c) Write repair
d) None of the mentioned

View Answer

Answer: a [Reason:] If the replicas are inconsistent, the coordinator issues writes to the out-of-date replicas to update the row to the most recent values. This process is known as read repair.

10. If the table has been configured with the __________ property, the coordinator node for the read request will retry the request with another replica node .
a) rapid_retry
b) speculative_retry
c) speculative_rapid
d) none of the mentioned

View Answer

Answer: b [Reason:] Rapid read protection allows Cassandra to still deliver read requests when the originally selected replica nodes are either down or taking too long to respond.

Interview MCQ Set 3

1. Which of the following tool is used for measuring I/O of your systems to estimate these transaction costs ?
a) EBS
b) IOSTAT
c) ESW
d) All of the mentioned

View Answer

Answer: b [Reason:] EBS is a service priced on the amount of storage space used.

2. Point out the wrong statement:
a) The cost of creating an EBS volume is lesser than creating a similarly sized S3 bucket
b) An EBS volume can be used as an instance boot partition
c) EBS boot partitions can be stopped and started, and they offer fast AMI boot times
d) None of the mentioned

View Answer

Answer: a [Reason:] The cost of creating an EBS volume is also greater than creating a similarly sized S3 bucket.

3. Which of the following is also referred to edge computing ?
a) CloudWave
b) CloudFront
c) CloudSpot
d) All of the mentioned

View Answer

Answer: b [Reason:] In edge computing, content is pushed out geographically so the data is more readily available to network clients and has a lower latency when requested.

4. CloudFront supports ______ data by performing static data transfers and streaming content from one CloudFront location to another.
a) table caching
b) geo caching
c) index caching
d) windows Media Server

View Answer

Answer: b [Reason:] A user requesting data from a CloudFront site is referred to the nearest geographical location.

5. Point out the correct statement:
a) A volume is mounted on a particular instance and is available to all instances
b) The advantages of an EBS boot partition are that you can have a volume up to 1TB
c) You cannot mount multiple volumes on a single instance
d) All of the mentioned

View Answer

Answer: b [Reason:] EBS is similar in concept to a Storage Area Network or SAN.

6. Data stored in __________ domains doesn’t require maintenance of a schema.
a) SimpleDB
b) SQL Server
c) Oracle
d) RDS

View Answer

Answer: a [Reason:] To create a high performance “simple” database, the data store created is flat; that is, it is non-relational and joins are not supported.

7. Which of the following is relational database service provided by Amazon ?
a) SimpleDB
b) SQL Server
c) Oracle
d) RDS

View Answer

Answer: d [Reason:] Amazon offers two different types of database services.

8. Which of the following can be considered as distributed caching system ?
a) CND
b) CDN
c) CWD
d) All of the mentioned

View Answer

Answer: b [Reason:] Amazon CloudFront is referred to as a content delivery network (CDN), and sometimes called edge computing.

9. Amazon Relational Database Service is a variant of the _______ 5.1 database system.
a) Oracle
b) MySQL
c) SQL Server
d) All of the mentioned

View Answer

Answer: b [Reason:] The purpose of RDS is to allow database applications that already exist to be ported to RDS and placed in an environment that is relatively automated and easy to use.

10. Which of the following database should be used for a solution that has a very high availability ?
a) SimpleDB
b) RDS
c) Amazon EC2
d) None of the mentioned

View Answer

Answer: a [Reason:] Use SimpleDB for the lowest administrative overhead.

Interview MCQ Set 4

1. Which of the following is a method for bidding on unused EC2 capacity based on the current spot price ?
a) On-Demand Instance
b) Reserved Instances
c) Spot Instance
d) All of the mentioned

View Answer

Answer: c [Reason:] This feature offers a significantly lower price, but it varies over time or may not be available when there is no excess capacity.

2. Point out the wrong statement:
a) The standard instances are not suitable for standard server applications
b) High memory instances are useful for large data throughput applications such as SQL Server databases and data caching and retrieval
c) FPS is exposed as an API that sorts transactions into packages called Quick Starts that makes it easy to implement
d) None of the mentioned

View Answer

Answer: a [Reason:] The standard instances are deemed to be suitable for standard server applications.

3. Which of the following instance has hourly rate with no long-term commitment ?
a) On-Demand Instance
b) Reserved Instances
c) Spot Instance
d) All of the mentioned

View Answer

Answer: a [Reason:] Pricing varies by zone, instance, and pricing model.

4. Which of the following is a batch processing application ?
a) IBM sMash
b) IBM WebSphere Application Server
c) Condor
d) Windows Media Server

View Answer

Answer: c [Reason:] Condor is a powerful, distributed batch-processing system that lets you use otherwise idle CPU cycles in a cluster of workstations.

5. Point out the correct statement:
a) Security can be set through passwords, Kerberos tickets, or certificates
b) Secure access to your EC2 AMIs is controlled by passwords, Kerberos, and 509 Certificates
c) Most of the system image templates that Amazon AWS offers are based on Red Hat Linux
d) All of the mentioned

View Answer

Answer: d [Reason:] Hundreds of free and paid AMIs can be found on AWS.

6. How many EC2 service zones or regions exist ?
a) 1
b) 2
c) 3
d) 4

View Answer

Answer: d [Reason:] There are four different EC2 service zones or regions.

7. Amazon ______ cloud-based storage system allows you to store data objects ranging in size from 1 byte up to 5GB.
a) S1
b) S2
c) S3
d) S4

View Answer

Answer: c [Reason:] In S3, storage containers are referred to as buckets.

8. Which of the following can be done with S3 buckets through the SOAP and REST APIs ?
a) Upload new objects to a bucket and download them
b) Create, edit, or delete existing buckets
c) Specify where a bucket should be stored
d) All of the mentioned

View Answer

Answer: d [Reason:] The REST API is preferred to the SOAP API, because it is easier to work with large binary objects with REST.

9. Which of the following operation retrieves the newest version of the object ?
a) PUT
b) GET
c) POST
d) COPY

View Answer

Answer: b [Reason:] Versioning also can be used for preserving data and for archiving purposes.

10. Which of the following statement is wrong about Amazon S3 ?
a) Amazon S3 is highly reliable
b) Amazon S3 provides large quantities of reliable storage that is highly protected
c) Amazon S3 is highly available
d) None of the mentioned

View Answer

Answer: c [Reason:] S3 excels in applications where storage is archival in nature.

Interview MCQ Set 5

1. Which of the following should be used considering factors shown in the figure ?
cloud-computing-aws-interview-questions-answers-q1
a) SimpleDB
b) RDS
c) Amazon EC2
d) All of the mentioned

View Answer

Answer: b [Reason:] Use RDS when you have an existing MySQL database that could be ported and you want to minimize the amount of infrastructure and administrative management required.

2. Point out the wrong statement:
a) Amazon Machine Instances are sized at various levels and rented on a computing/hour basis
b) The metrics obtained by CloudWatch may be used to enable a feature called Auto Scaling
c) A Number of tools are used to support EC2 services
d) None of the mentioned

View Answer

Answer: d [Reason:] Through hardware virtualization on Xen hypervisors, Amazon.com has made it possible to create private virtual servers that you can run worldwide.

3. Which of the following is an edge-storage or content-delivery system that caches data in different physical locations ?
a) Amazon Relational Database Service
b) Amazon SimpleDB
c) Amazon Cloudfront
d) Amazon Associates Web Services

View Answer

Answer: c [Reason:] Cloudfront is similar to systems such as Akamai.com, but is proprietary to Amazon.com and is set up to work with Amazon Simple Storage System (Amazon S3).

4. Which of the following allows you to create instances of the MySQL database to support your Web sites ?
a) Amazon Elastic Compute Cloud
b) Amazon Simple Queue Service
c) Amazon Relational Database Service
d) Amazon Simple Storage System

View Answer

Answer: c [Reason:] RDS provides features such as automated software patching, database backups, and automated database scaling via an API call.

5. Point out the correct statement:
a) Amazon Elastic Cloud is a system for creating virtual disks(volume)
b) SimpleDB interoperates with both Amazon EC2 and Amazon S3
c) EC3 is an Analytics as a Service provider
d) None of the mentioned

View Answer

Answer: b [Reason:] Amazon SimpleDB stores data in “buckets” and without requiring the creation of a database schema.

6. Which of the following is a structured data store that supports indexing and data queries to both EC2 and S3 ?
a) CloudWatch
b) Amazon SimpleDB
c) Amazon Cloudfront
d) All of the mentioned

View Answer

Answer: b [Reason:] SimpleDB isn’t a full database implementation.

7. Which of the following is the machinery for interacting with Amazon’s vast product data and eCommerce catalog function ?
a) Amazon Elastic Compute Cloud
b) Amazon Associates Web Services
c) Alexa Web Information Service
d) All of the mentioned

View Answer

Answer: b [Reason:] This service, which was called Amazon E-Commerce Service (ECS), is the means for vendors to add their products to the Amazon.com site and take orders and payments.

8. Which of the following is a billing and account management service ?
a) Amazon Elastic MapReduce
b) Amazon Mechanical Turk
c) Amazon DevPay
d) Multi-Factor Authentication

View Answer

Answer: c [Reason:] DevPay provides a developer API that eliminates the need for application developers to build order pipelines.

9. Which of the following is a means for accessing human researchers or consultants to help solve problems on a contractual or temporary basis ?
a) Amazon Elastic MapReduce
b) Amazon Mechanical Turk
c) Amazon DevPay
d) Multi-Factor Authentication

View Answer

Answer: b [Reason:] Problems solved by this human workforce have included object identification, video or audio recording, data duplication, and data research.

10. Which of the following is built on top of a Hadoop framework using the Elastic Compute Cloud ?
a) Amazon Elastic MapReduce
b) Amazon Mechanical Turk
c) Amazon DevPay
d) Multi-Factor Authentication

View Answer

Answer: a [Reason:] Amazon Elastic MapReduce is an interactive data analysis tool for performing indexing.