1. Which of the following return a subset of the columns of a data frame ?
a) select
b) retrieve
c) get
d) all of the mentioned
Answer
Answer: a [Reason:] One important contribution of the dplyr package is that it provides a “grammar” for data manipulation and for operating on data frames.
2. Point out the correct statement :
a) The data frame is a key data structure in statistics and in R
b) R has an internal implementation of data frames that is likely the one you will use most often
c) There are packages on CRAN that implement data frames via things like relational databases that allow you to operate on very very large data frames
d) All of the mentioned
Answer
Answer: d [Reason:] The basic structure of a data frame is that there is one observation per row and each column represents a variable, a measure, feature, or characteristic of that observation.
3. _________ extract a subset of rows from a data frame based on logical conditions.
a) rename
b) filter
c) set
d) subset
Answer
Answer: a [Reason:] rename is used to rename variables in a data frame.
4. _________ generate summary statistics of different variables in the data frame, possibly within strata
a) rename
b) summarize
c) set
d) subset
Answer
Answer: b [Reason:] The dplyr package as a number of its own data types that it takes advantage of.
5. Point out the wrong statement :
a) The dplyr package was developed by Hadley Wickham of RStudio
b) The dplyr packageis an optimized and distilled version of his plyr package
c) The dplyr package provideS any “new” functionality to R
d) None of the mentioned
Answer
Answer: c [Reason:] The dplyr package does not provide any “new” functionality to R.
6. ________ add new variables/columns or transform existing variables
a) mutate
b) add
c) apped
d) arrange
Answer
Answer: a [Reason:] arrange is used to reorder rows of a data frame.
7. The _______ operator is used to connect multiple verb actions together into a pipeline
a) pipe
b) piper
c) start
d) all of the mentioned
Answer
Answer: a [Reason:] It is denoted by %>% sign.
8. The dplyr package can be installed from GitHub using the _______ package
a) dev
b) devtools
c) devtool
d) all of the mentioned
Answer
Answer: b [Reason:] The GitHub repository will usually contain the latest updates to the package and the development version.
9. The dplyr package can be installed from CRAN using :
a) installall.packages(“dplyr”)
b) install.packages(“dplyr”)
c) installed.packages(“dplyr”)
d) none of the mentioned
Answer
Answer: b [Reason:] After installing the package it is important that you load it into your R session with the library() function.
10. Which of the following object is masked from ‘package:stats’ ?
a) filter
b) union
c) setdifference
d) none of the mentioned
Answer
Answer: a [Reason:] The following objects are masked from ‘package:base’: intersect, setdiff, setequal, union.
Synopsis and Project Report
You can buy synopsis and project from distpub.com. Just visit https://distpub.com/product-category/projects/ and buy your university/institute project from distpub.com