Big Halloween Sale 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: clap70

AIP-210 CertNexus Certified Artificial Intelligence Practitioner (CAIP) Questions and Answers

Questions 4

When working with textual data and trying to classify text into different languages, which approach to representing features makes the most sense?

Options:

A.

Bag of words model with TF-IDF

B.

Bag of bigrams (2 letter pairs)

C.

Word2Vec algorithm

D.

Clustering similar words and representing words by group membership

Buy Now
Questions 5

Which of the following is the correct definition of the quality criteria that describes completeness?

Options:

A.

The degree to which all required measures are known.

B.

The degree to which a set of measures are equivalent across systems.

C.

The degree to which a set of measures are specified using the same units of measure in all systems.

D.

The degree to which the measures conform to defined business rules or constraints.

Buy Now
Questions 6

Your dependent variable data is a proportion. The observed range of your data is 0.01 to 0.99. The instrument used to generate the dependent variable data is known to generate low quality data for values close to 0 and close to 1. A colleague suggests performing a logit-transformation on the data prior to performing a linear regression. Which of the following is a concern with this approach?

Definition of logit-transformation

If p is the proportion: logit(p)=log(p/(l-p))

Options:

A.

After logit-transformation, the data may violate the assumption of independence.

B.

Noisy data could become more influential in your model.

C.

The model will be more likely to violate the assumption of normality.

D.

Values near 0.5 before logit-transformation will be near 0 after.

Buy Now
Questions 7

Which of the following is the primary purpose of hyperparameter optimization?

Options:

A.

Controls the learning process of a given algorithm

B.

Makes models easier to explain to business stakeholders

C.

Improves model interpretability

D.

Increases recall over precision

Buy Now
Questions 8

Which of the following is the definition of accuracy?

Options:

A.

(True Positives + False Positives) / Total Predictions

B.

(True Positives + True Negatives) / Total Predictions

C.

True Positives / (True Positives + False Negatives)

D.

True Positives / (True Positives + False Positives)

Buy Now
Questions 9

Which two of the following criteria are essential for machine learning models to achieve before deployment? (Select two.)

Options:

A.

Complexity

B.

Data size

C.

Explainability

D.

Portability

E.

Scalability

Buy Now
Questions 10

You create a prediction model with 96% accuracy. While the model's true positive rate (TPR) is performing well at 99%, the true negative rate (TNR) is only 50%. Your supervisor tells you that the TNR needs to be higher, even if it decreases the TPR. Upon further inspection, you notice that the vast majority of your data is truly positive.

What method could help address your issue?

Options:

A.

Normalization

B.

Oversampling

C.

Principal components analysis

D.

Quality filtering

Buy Now
Questions 11

R-squared is a statistical measure that:

Options:

A.

Combines precision and recall of a classifier into a single metric by taking their harmonic mean.

B.

Expresses the extent to which two variables are linearly related.

C.

Is the proportion of the variance for a dependent variable thaf’ s explained by independent variables.

D.

Represents the extent to which two random variables vary together.

Buy Now
Questions 12

You are implementing a support-vector machine on your data, and a colleague suggests you use a polynomial kernel. In what situation might this help improve the prediction of your model?

Options:

A.

When it is necessary to save computational time.

B.

When the categories of the dependent variable are not linearly separable.

C.

When the distribution of the dependent variable is Gaussian.

D.

When there is high correlation among the features.

Buy Now
Questions 13

Which of the following is NOT an activation function?

Options:

A.

Additive

B.

Hyperbolic tangent

C.

ReLU

D.

Sigmoid

Buy Now
Questions 14

You have a dataset with thousands of features, all of which are categorical. Using these features as predictors, you are tasked with creating a prediction model to accurately predict the value of a continuous dependent variable. Which of the following would be appropriate algorithms to use? (Select two.)

Options:

A.

K-means

B.

K-nearest neighbors

C.

Lasso regression

D.

Logistic regression

E.

Ridge regression

Buy Now
Questions 15

In which of the following scenarios is lasso regression preferable over ridge regression?

Options:

A.

The number of features is much larger than the sample size.

B.

There are many features with no association with the dependent variable.

C.

There is high collinearity among some of the features associated with the dependent variable.

D.

The sample size is much larger than the number of features.

Buy Now
Questions 16

The graph is an elbow plot showing the inertia or within-cluster sum of squares on the y-axis and number of clusters (also called K) on the x-axis, denoting the change in inertia as the clusters change using k-means algorithm.

What would be an optimal value of K to ensure a good number of clusters?

Options:

A.

2

B.

3

C.

5

D.

9

Buy Now
Questions 17

Given a feature set with rows that contain missing continuous values, and assuming the data is normally distributed, what is the best way to fill in these missing features?

Options:

A.

Delete entire rows that contain any missing features.

B.

Fill in missing features with random values for that feature in the training set.

C.

Fill in missing features with the average of observed values for that feature in the entire dataset.

D.

Delete entire columns that contain any missing features.

Buy Now
Questions 18

A product manager is designing an Artificial Intelligence (AI) solution and wants to do so responsibly, evaluating both positive and negative outcomes.

The team creates a shared taxonomy of potential negative impacts and conducts an assessment along vectors such as severity, impact, frequency, and likelihood.

Which modeling technique does this team use?

Options:

A.

Business

B.

Harms

C.

Process

D.

Threat

Buy Now
Questions 19

Which of the following tests should be performed at the production level before deploying a newly retrained model?

Options:

A.

A/Btest

B.

Performance test

C.

Security test

D.

Unit test

Buy Now
Questions 20

Which two of the following statements about the beta value in an A/B test are accurate? (Select two.)

Options:

A.

The Beta value is the rate of type II errors for the test.

B.

The Beta value is the rate of type I errors for the test.

C.

The statistical power of a test is the inverse of the Beta value, or 1 - Beta.

D.

The Beta in an Alpha/Beta test represents one of the two variants of the A/B test.

Buy Now
Questions 21

What is the open framework designed to help detect, respond to, and remediate threats in ML systems?

Options:

A.

Adversarial ML Threat Matrix

B.

MITRE ATTandCK® Matrix

C.

OWASP Threat and Safeguard Matrix

D.

Threat Susceptibility Matrix

Buy Now
Questions 22

Which of the following approaches is best if a limited portion of your training data is labeled?

Options:

A.

Dimensionality reduction

B.

Probabilistic clustering

C.

Reinforcement learning

D.

Semi-supervised learning

Buy Now
Questions 23

Which two encodes can be used to transform categories data into numerical features? (Select two.)

Options:

A.

Count Encoder

B.

Log Encoder

C.

Mean Encoder

D.

Median Encoder

E.

One-Hot Encoder

Buy Now
Questions 24

Workflow design patterns for the machine learning pipelines:

Options:

A.

Aim to explain how the machine learning model works.

B.

Represent a pipeline with directed acyclic graph (DAG).

C.

Seek to simplify the management of machine learning features.

D.

Separate inputs from features.

Buy Now
Questions 25

Which of the following regressions will help when there is the existence of near-linear relationships among the independent variables (collinearity)?

Options:

A.

Clustering

B.

Linear regression

C.

Polynomial regression

D.

Ridge regression

Buy Now
Questions 26

A change in the relationship between the target variable and input features is

Options:

A.

concept drift.

B.

covariate shift.

C.

data drift.

D.

model decay.

Buy Now
Questions 27

Which of the following sentences is TRUE about the definition of cloud models for machine learning pipelines?

Options:

A.

Data as a Service (DaaS) can host the databases providing backups, clustering, and high availability.

B.

Infrastructure as a Service (IaaS) can provide CPU, memory, disk, network and GPU.

C.

Platform as a Service (PaaS) can provide some services within an application such as payment applications to create efficient results.

D.

Software as a Service (SaaS) can provide AI practitioner data science services such as Jupyter notebooks.

Buy Now
Exam Code: AIP-210
Exam Name: CertNexus Certified Artificial Intelligence Practitioner (CAIP)
Last Update: Oct 26, 2025
Questions: 92
AIP-210 pdf

AIP-210 PDF

$25.5  $84.99
AIP-210 Engine

AIP-210 Testing Engine

$30  $99.99
AIP-210 PDF + Engine

AIP-210 PDF + Testing Engine

$40.5  $134.99