Black Friday Special 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: clap70

AIP-210 CertNexus Certified Artificial Intelligence Practitioner (CAIP) Questions and Answers

Questions 4

A market research team has ratings from patients who have a chronic disease, on several functional, physical, emotional, and professional needs that stay unmet with the current therapy. The dataset also captures ratings on how the disease affects their day-to-day activities.

A pharmaceutical company is introducing a new therapy to cure the disease and would like to design their marketing campaign such that different groups of patients are targeted with different ads. These groups should ideally consist of patients with similar unmet needs.

Which of the following algorithms should the market research team use to obtain these groups of patients?

Options:

A.

k-means clustering

B.

k-nearest neighbors

C.

Logistic regression

D.

Naive-Bayes

Buy Now
Questions 5

Personal data should not be disclosed, made available, or otherwise used for purposes other than specified with which of the following exceptions? (Select two.)

Options:

A.

If it is for a good cause.

B.

If it was collected accidentally.

C.

If it was requested by the authority of law.

D.

If it was with consent of the person it is collected from.

E.

If the data is only collected once.

Buy Now
Questions 6

Which of the following is the correct definition of the quality criteria that describes completeness?

Options:

A.

The degree to which all required measures are known.

B.

The degree to which a set of measures are equivalent across systems.

C.

The degree to which a set of measures are specified using the same units of measure in all systems.

D.

The degree to which the measures conform to defined business rules or constraints.

Buy Now
Questions 7

You have a dataset with many features that you are using to classify a dependent variable. Because the sample size is small, you are worried about overfitting. Which algorithm is ideal to prevent overfitting?

Options:

A.

Decision tree

B.

Logistic regression

C.

Random forest

D.

XGBoost

Buy Now
Questions 8

A classifier has been implemented to predict whether or not someone has a specific type of disease. Considering that only 1% of the population in the dataset has this disease, which measures will work the BEST to evaluate this model?

Options:

A.

Mean squared error

B.

Precision and accuracy

C.

Precision and recall

D.

Recall and explained variance

Buy Now
Questions 9

For each of the last 10 years, your team has been collecting data from a group of subjects, including their age and numerous biomarkers collected from blood samples. You are tasked with creating a prediction model of age using the biomarkers as input. You start by performing a linear regression using all of the data over the 10-year period, with age as the dependent variable and the biomarkers as predictors.

Which assumption of linear regression is being violated?

Options:

A.

Equality of variance (Homoscedastidty)

B.

Independence

C.

Linearity

D.

Normality

Buy Now
Questions 10

Normalization is the transformation of features:

Options:

A.

By subtracting from the mean and dividing by the standard deviation.

B.

Into the normal distribution.

C.

So that they are on a similar scale.

D.

To different scales from each other.

Buy Now
Questions 11

Which two of the following criteria are essential for machine learning models to achieve before deployment? (Select two.)

Options:

A.

Complexity

B.

Data size

C.

Explainability

D.

Portability

E.

Scalability

Buy Now
Questions 12

Why do data skews happen in the ML pipeline?

Options:

A.

Test and evaluation data are designed incorrectly.

B.

There Is a mismatch between live input data and offline data.

C.

There is a mismatch between live output data and offline data.

D.

There is insufficient training data for evaluation.

Buy Now
Questions 13

Which of the following methods can be used to rebalance a dataset using the rebalance design pattern?

Options:

A.

Bagging

B.

Boosting

C.

Stacking

D.

Weighted class

Buy Now
Questions 14

Which of the following occurs when a data segment is collected in such a way that some members of the intended statistical population are less likely to be included than others?

Options:

A.

Algorithmic bias

B.

Sampling bias

C.

Stereotype bias

D.

Systematic value distortion

Buy Now
Questions 15

An AI system recommends New Year's resolutions. It has an ML pipeline without monitoring components. What retraining strategy would be BEST for this pipeline?

Options:

A.

Periodically before New Year's Day and after New Year's Day

B.

Periodically every year

C.

When concept drift is detected

D.

When data drift is detected

Buy Now
Questions 16

When should the model be retrained in the ML pipeline?

Options:

A.

A new monitoring component is added.

B.

Concept drift is detected in the pipeline.

C.

More data become available for the training phase.

D.

Some outliers are detected in live data.

Buy Now
Questions 17

Which of the following best describes distributed artificial intelligence?

Options:

A.

It does not require hyperparemeter tuning because the distributed nature accounts for the bias.

B.

It intelligently pre-distributes the weight of starting a neural network.

C.

It relies on a distributed system that performs robust computations across a network of unreliable nodes.

D.

It uses a centralized system to speak to decentralized nodes.

Buy Now
Questions 18

Which of the following can benefit from deploying a deep learning model as an embedded model on edge devices?

Options:

A.

A more complex model

B.

Guaranteed availability of enough space

C.

Increase in data bandwidth consumption

D.

Reduction in latency

Buy Now
Questions 19

Which of the following equations best represent an LI norm?

Options:

A.

|x| + |y|

B.

|x|+|y|^2

C.

|x|-|y|

D.

|x|^2+|y|^2

Buy Now
Questions 20

You are developing a prediction model. Your team indicates they need an algorithm that is fast and requires low memory and low processing power. Assuming the following algorithms have similar accuracy on your data, which is most likely to be an ideal choice for the job?

Options:

A.

Deep learning neural network

B.

Random forest

C.

Ridge regression

D.

Support-vector machine

Buy Now
Questions 21

Which three security measures could be applied in different ML workflow stages to defend them against malicious activities? (Select three.)

Options:

A.

Disable logging for model access.

B.

Launch ML Instances In a virtual private cloud (VPC).

C.

Monitor model degradation.

D.

Use data encryption.

E.

Use max privilege to control access to ML artifacts.

F.

Use Secrets Manager to protect credentials.

Buy Now
Questions 22

An AI practitioner incorporates risk considerations into a deployment plan and decides to log and store historical predictions for potential, future access requests.

Which ethical principle is this an example of?

Options:

A.

Fairness

B.

Privacy

C.

Safety

D.

Transparency

Buy Now
Questions 23

In general, models that perform their tasks:

Options:

A.

Less accurately are less robust against adversarial attacks.

B.

Less accurately are neither more nor less robust against adversarial attacks.

C.

More accurately are less robust against adversarial attacks.

D.

More accurately are neither more nor less robust against adversarial attacks.

Buy Now
Questions 24

Word Embedding describes a task in natural language processing (NLP) where:

Options:

A.

Words are converted into numerical vectors.

B.

Words are featurized by taking a histogram of letter counts.

C.

Words are featurized by taking a matrix of bigram counts.

D.

Words are grouped together into clusters and then represented by word cluster membership.

Buy Now
Questions 25

R-squared is a statistical measure that:

Options:

A.

Combines precision and recall of a classifier into a single metric by taking their harmonic mean.

B.

Expresses the extent to which two variables are linearly related.

C.

Is the proportion of the variance for a dependent variable thaf’ s explained by independent variables.

D.

Represents the extent to which two random variables vary together.

Buy Now
Questions 26

Which of the following sentences is true about model evaluation and model validation in ML pipelines?

Options:

A.

Model evaluation and validation are the same.

B.

Model evaluation is defined as an external component.

C.

Model validation is defined as a set of tasks to confirm the model performs as expected.

D.

Model validation occurs before model evaluation.

Buy Now
Questions 27

Which of the following principles supports building an ML system with a Privacy by Design methodology?

Options:

A.

Avoiding mechanisms to explain and justify automated decisions.

B.

Collecting and processing the largest amount of data possible.

C.

Understanding, documenting, and displaying data lineage.

D.

Utilizing quasi-identifiers and non-unique identifiers, alone or in combination.

Buy Now
Exam Code: AIP-210
Exam Name: CertNexus Certified Artificial Intelligence Practitioner (CAIP)
Last Update: Nov 24, 2024
Questions: 90
AIP-210 pdf

AIP-210 PDF

$25.5  $84.99
AIP-210 Engine

AIP-210 Testing Engine

$30  $99.99
AIP-210 PDF + Engine

AIP-210 PDF + Testing Engine

$40.5  $134.99