Choose a topic to test your knowledge and improve your Machine Learning (ML) skills
The effectiveness of an SVM depends upon:
The process of forming general concept definitions from examples of concepts to belearned.
Computers are best at learning
Data used to build a data mining model.
Supervised learning and unsupervised clustering both require at least one
Supervised learning differs from unsupervised clustering in that supervised learning requires
A regression model in which more than one independent variable is used to predict the dependent variable is called
A term used to describe the case when the independent variables in a multiple regression modelare correlated is
A multiple regression model has the form: y = 2 + 3x1 + 4x2. As x1 increases by 1 unit (holding x2constant), y will
A multiple regression model has
A measure of goodness of fit for the estimated regression equation is the
The adjusted multiple coefficient of determination accounts for
The multiple coefficient of determination is computed by
For a multiple regression model, SST = 200 and SSE = 50. The multiple coefficient ofdetermination is
A nearest neighbor approach is best used
Another name for an output attribute.
Classification problems are distinguished from estimation problems in that
Which statement is true about prediction problems?
Which of the following is a common use of unsupervised clustering?
The average positive difference between computed and desired outcome values.
Selecting data so as to assure that each class is properly represented in both the training andtest set.
The standard error is defined as the square root of this computation.
Data used to optimize the parameter settings of a supervised learner model.
Bootstrapping allows us to
The average squared difference between classifier predicted output and actual output.
Simple regression assumes a __________ relationship between the input attribute and outputattribute.
Regression trees are often used to model _______ data.
The leaf nodes of a model tree are
Logistic regression is a ________ regression technique that is used to model data having a_____outcome.
This technique associates a conditional probability value with each data instance.
This supervised learning technique can process both numeric and categorical input attributes.
With Bayes classifier, missing data items are
This clustering algorithm merges and splits nodes to help modify nonoptimal partitions.
This clustering algorithm initially assumes that each data instance represents a single cluster.
This unsupervised clustering algorithm terminates when mean values computed for the currentiteration of the algorithm are identical to the computed mean values for the previous iteration.
In reinforcement learning if feedback is negative one it is defined as____.
According to____ , it’s a key success factor for the survival and evolution of all species.
What is ‘Training set’?
Common deep learning applications include____
Reinforcement learning is particularly efficient when______________.
if there is only a discrete number of possible outcomes (called categories),the process becomes a______.
Which of the following are supervised learning applications
During the last few years, many ______ algorithms have been applied to deepneural networks to learn the best policy for playing Atari video games and to teach an agent how to associate the right action with an input representing the state.
_____is much more difficult because it's necessary to determine a supervised strategy to train a model for each feature and, finally, to predict their value
How it's possible to use a different placeholder through the parameter_______.
If you need a more powerful scaling feature, with a superior control on outliers and the possibility to select a quantile range, there's also the class________
scikit-learn also provides a class for per-sample normalization, Normalizer. It can apply________to each element of a dataset
There are also many univariate methods that can be used in order to select the best features according to specific criteria based on________.
____performs a PCA with non-linearly separable data sets.
The parameter______ allows specifying the percentage of elements to put into the test/training set
In many classification problems, the target ______ is made up of categorical labels which cannot immediately be processed by any algorithm.
If Linear regression model perfectly first i.e., train error is zero, then _____________________
In syntax of linear model lm(formula,data,..), data refers to ______
Which of the following methods do we use to find the best fit line for data in Linear Regression?
Which of the following is true about Residuals ?
Naive Bayes classifiers are a collection ------------------of algorithms
Naive Bayes classifiers is _______________ Learning
Features being classified is __________ of each other in Naïve Bayes Classifier
Bernoulli Naïve Bayes Classifier is ___________distribution
Multinomial Naïve Bayes Classifier is ___________distribution
Gaussian distribution when plotted, gives a bell shaped curve which is symmetric about the _______ of the feature values.
SVM is a ------------------ algorithm
SVM is a ------------------ learning
Even if there are no actual supervisors ________ learning is also based on feedback provided by the environment
When it is necessary to allow the model to develop a generalization ability and avoid a common problem called______.
Techniques involve the usage of both labeled and unlabeled data is called___.
A supervised scenario is characterized by the concept of a _____.