New Year Sale 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: clap70

DA0-001 CompTIA Data+ Certification Exam Questions and Answers

Questions 4

What category of data stewardship work is focused on ensuring that the organization respects the wishes of data subjects?

Options:

A.

Data quality.

B.

Data privacy.

C.

Data security.

D.

Regulatory compliance.

Buy Now
Questions 5

A data analyst received the information in the table below from a recently completed marketing campaign:

Which of the following is the total order conversion rate?

Options:

A.

13.2%

B.

14.8%

C.

22.3%

D.

85.2%

Buy Now
Questions 6

After a merger, an analyst needs to enhance a very complicated quarterly report so that it is more user friendly for new team members. Which of the following elements would help reduce questions?

Options:

A.

Version details

B.

Appendix

C.

Reference data sources

D.

FAQs

Buy Now
Questions 7

Which of the following can be used to translate data into another form so it can only be read by a user who has a key or a password?

Options:

A.

Data encryption.

B.

Data transmission.

C.

Data protection.

D.

Data masking.

Buy Now
Questions 8

Which of the following variable name formats would be problematic if used in the majority of data software programs?

Options:

A.

First_Name_

B.

FirstName

C.

First_Name

D.

First Name

Buy Now
Questions 9

Joseph is interpreting a left skewed distribution of test scores. Joe scored at the mean, Alfonso scored at the median, and gaby scored and the end of the tail.

Who had the highest score?

Options:

A.

Joseph

B.

Joe

C.

Alfonso

D.

Gaby

Buy Now
Questions 10

An analyst is reporting on the average income for a county and is reviewing the following data:

Which of the following is the reason the analyst would need to cleanse the data in this data set?

Options:

A.

Data completeness

B.

Data outliers

C.

Duplicate data

D.

Missing values

Buy Now
Questions 11

Which of the following statements would be used to append two tables that have the same number of columns?

Options:

A.

UNION ALL

B.

MERGE

C.

GROUP BY

D.

JOIN

Buy Now
Questions 12

A data analyst has a set of data that shows the number of gallons of oil produced each day. The company would like to know the standard deviation for the data set. The variance for the data is 36 gallons. Which of the following is the standard deviation for gallons produced?

Options:

A.

1.16

B.

6

C.

36

D.

72

Buy Now
Questions 13

A quality assurance manager is examining tolerances in Internet of Things sensors. Which of the following is the best measure for the manager to calculate?

Options:

A.

Standard deviation

B.

Quartile range

C.

Median

D.

Mean

Buy Now
Questions 14

A cereal manufacturer wants to determine whether the sugar content of its cereal has increased over the years. Which of the following is the appropriate descriptive statistic to use?

Options:

A.

Frequency

B.

Percent change

C.

Variance

D.

Mean

Buy Now
Questions 15

Given the table below:

Which of the following boxes indicates that a Type Il error has occurred?

Options:

A.

1

B.

2

C.

3

D.

4

Buy Now
Questions 16

While reviewing survey data, a research analyst notices data is missing from all the responses to a single question. Which of the following methods would BEST address this issue?

Options:

A.

Replace missing data.

B.

Remove duplicate data.

C.

Replace redundant data.

D.

Remove invalid data.

Buy Now
Questions 17

A collections manager has a team calling customers who are past due on their accounts in an attempt to collect payments. The manager receives the call list in the form of a printed report that is generated by the accounting department at the beginning of each week. Consequently, the collections team calls some customers who have made payments in the time since the report was last printed. Which of the following reporting enhancements could the accounting department implement to best reduce the number of calls on current accounts?

Options:

A.

Modify the date range on the report

B.

Include a time stamp on the report.

C.

Increase the frequency of report generation.

D.

Add a report run date to the report.

Buy Now
Questions 18

Amanda needs to create a dashboard that will draw information from many other data sources and present it to business leaders.

Which one of the following tools is least likely to meet her needs?

Options:

A.

QuickSight.

B.

Tableau.

C.

Power BI.

D.

SPSS Modeler.

Buy Now
Questions 19

An analyst is compiling a series of reports for the new executive board to review. Which of the following elements provides a snapshot of what is contained in the reports for the executives who do not have time to focus on the details?

Options:

A.

Tables

B.

Reference data sources

C.

Observations and insights

D.

Instruction page

Buy Now
Questions 20

You are working with a professional statistician to perform an analysis and would like to use a statistics package.

Which one of the following would be the most appropriate?

Options:

A.

Rapid Miner.

B.

QLIK.

C.

Power BI.

D.

Minitab.

Buy Now
Questions 21

You should always choose the analytics tool that is most appropriate for any given situation, even if that means acquiring a new tool.

Options:

A.

True.

B.

False.

Buy Now
Questions 22

An analyst wants to test the association between the number of doors in a car and the number of gears in the car. Which of the following is the best test to use?

Options:

A.

F-test

B.

Acceptance test

C.

Chi-squared test

D.

Z-test

Buy Now
Questions 23

A customer's telephone number is in the format 123-456-7890. Which of the following data types is used for the phone number?

Options:

A.

Boolean

B.

Date

C.

Text

D.

Number

Buy Now
Questions 24

An analyst needs to know what data an organization possesses. Which of the following is the best document for the analyst to consult?

Options:

A.

Data destruction policy

B.

Data use document

C.

Data dictionary

D.

Data retention policy

Buy Now
Questions 25

A data analyst has received a data set that contains actual and projected sales for the fourth quarter of 2019. Which of the following statistical methods should the analyst use to find the measure of dispersion?

Options:

A.

Mean

B.

Variance

C.

Correlation

D.

Confidence interval

Buy Now
Questions 26

Which of the following types of analyses is best to use when tracking sales revenue against quarterly targets?

Options:

A.

Trend

B.

Performance

C.

Link

D.

Scope

Buy Now
Questions 27

Which of the following describes the method of sampling in which elements of data are selected randomly from each of the small subgroups within a population?

Options:

A.

Simple random

B.

Cluster

C.

Systematic

D.

Stratified

Buy Now
Questions 28

Which of the following concepts should be applied if a data set with 40 fields needs to be pared down to 20 fields and contains similar data across multiple fields?

Options:

A.

Duplication

B.

Consolidation

C.

Compliance

D.

Standardization

Buy Now
Questions 29

An analyst wants to create a historical data set for the past five years with each year in its own data set. Which of the following methods is the best way to create this historical data set?

Options:

A.

Data transpose

B.

Data concatenation

C.

Data append

D.

Data normalization

Buy Now
Questions 30

Different people manually type a series of handwritten surveys into an online database. Which of the following issues will MOST likely arise with this data? (Choose two.)

Options:

A.

Data accuracy

B.

Data constraints

C.

Data attribute limitations

D.

Data bias

E.

Data consistency

F.

Data manipulation

Buy Now
Questions 31

Daniel is using the structured Query language to work with data stored in relational database.

He would like to add several new rows to a database table.

What command should he use?

Options:

A.

SELECT.

B.

ALTER.

C.

INSERT.

D.

UPDATE.

Buy Now
Questions 32

Given the following data:

Which of the following BEST describes the data set?

Options:

A.

There is data bias.

B.

The data is incomplete.

C.

The data is inconsistent.

D.

The data is outliers.

Buy Now
Questions 33

Given the customer table below:

Which of the following chart types is the most appropriate to represent the average spending of active customers vs. inactive customers?

Options:

A.

Pie chart

B.

Heat graph

C.

Scatter plot

D.

Line chart

Buy Now
Questions 34

A data analyst is working for a shipping company and calculating the volume of boxes according to the following formula:

volume = height × width × depth.

Which of the following variable types describes volume?

Options:

A.

Derived

B.

Normalized

C.

Concatenated

D.

Aggregated

Buy Now
Questions 35

A data analyst needs to create a master file that includes customer information from the tables below:

Given the three tables above, the analyst wants to filter down the information prior to joining it together. In which of the following orders should this data manipulation bo approached for the most efficient result?

Options:

A.

Merge, append, deduplicate

B.

Merge, deduplicate, append

C.

Deduplicate, append, merge

D.

Append, deduplicate, merge

Buy Now
Questions 36

What R package makes it easy to work with dates?

Options:

A.

Lubridate.

B.

Datemath.

C.

Stringr.

D.

ggplot.

Buy Now
Questions 37

Which of the following best describe qualitative data? (Select two).

Options:

A.

Discrete

B.

Ordinal

C.

Batch

D.

Continuous

E.

Nominal

F.

Real-time

Buy Now
Questions 38

A Chief Executive Officer (CEO) is requesting more up-to-date sales data for improved visibility prior to month-end. An analyst must determine the frequency of a sales report that was previously distributed on an as-needed basis. Which of the following would be the most appropriate frequency for this report?

Options:

A.

Monthly

B.

Quarterly

C.

Weekly

D.

Every other month

Buy Now
Questions 39

Which of the following are reasons to create and maintain a data dictionary? (Choose two.)

Options:

A.

To improve data acquisition

B.

To remember specifics about data fields

C.

To specify user groups for databases

D.

To provide continuity through personnel turnover

E.

To confine breaches of PHI data

F.

To reduce processing power requirements

Buy Now
Questions 40

An e-commerce company recently tested a new website layout. The website was tested by a test group of customers, and an old website was presented to a control group. The table below shows the percentage of users in each group who made purchases on the websites:

Which of the following conclusions is accurate at a 95% confidence interval?

Options:

A.

In Germany, the increase in conversion from the new layout was not significant.

B.

In France, the increase in conversion from the new layout was not significant.

C.

In general, users who visit the new website are more likely to make a purchase.

D.

The new layout has the lowest conversion rates in the United Kingdom.

Buy Now
Questions 41

Jhon is working on an ELT process that sources data from six different source systems.

Looking at the source data, he finds that data about the sample people exists in two of six systems.

What does he have to make sure he checks for in his ELT process?

Choose the best answer.

Options:

A.

Duplicate Data.

B.

Redundant Data.

C.

Invalid Data.

D.

Missing Data.

Buy Now
Questions 42

A user imports a data file into the accounts payable system each day. On a regular basis. the field input is not what the system is expecting. so it results in an error for the row and a broken import process. To resolve the issue, the user opens the file, finds the error in the row, and manually corrects it before attempting the import again. The import sometimes breaks on subsequent attempts. though. Which of the following changes should be made to this process to reduce the number of errors?

Options:

A.

Delete all incorrect inputs and upload the corrected file.

B.

Have the user manually review the file for data completeness before loading it

C.

Create a data field to data type validator to run the file through prior to import.

D.

Spot-check the file prior to import to catch and correct field errors.

Buy Now
Questions 43

A data analyst for a media company needs to determine the most popular movie genre. Given the table below:

Which of the following must be done to the Genre column before this task can be completed?

Options:

A.

Append

B.

Merge

C.

Concatenate

D.

Delimit

Buy Now
Questions 44

Which of the following is the best description of discrete data types?

Options:

A.

Non-numeric data used to describe attributes of a population sample

B.

The frequency of the number of times each value occurs by using whole numbers

C.

Numeric values that can be measured on a continuous scale

D.

Non-numeric data used to describe attributes of a population sample ranked in a specific order

Buy Now
Questions 45

Which of the following is a common data analytics tool that is also used as an interpreted, high-level, general-purpose programming language?

Options:

A.

SAS

B.

Microsoft Power B1

C.

IBM SPSS

D.

Python

Buy Now
Questions 46

Which of the following database schemas features normalized dimension tables?

Options:

A.

Flat

B.

Snowflake

C.

Hierarchical

D.

Star

Buy Now
Questions 47

A data analyst is attempting to understand how ice cream consumption is affected by different attributes. such as cost, temperature. and income level. Which of the following

regression analyses should the data analyst perform to understand this relationship?

Options:

A.

Logistic

B.

Ordinary least squares

C.

Cox

D.

Polynomial

Buy Now
Questions 48

Given the following table:

Date of visit

Age

Gender

6/1/22

30

Male

6/15/22

65F

Fem.

6/19/2022

24

M

Which of the following describes the data quality issues with the age data?

Options:

A.

Completeness

B.

Consistency

C.

Accuracy

D.

Manipulation

Buy Now
Questions 49

Consider two different datasets, one with gas prices and the other with food prices. Which of the following measures is most affected by outliers?

Options:

A.

Absolute value

B.

Mode

C.

Median

D.

Mean

Buy Now
Questions 50

You have two databases tables that you would like to join together using a foreign key relationship.

What term best describes this action?

Options:

A.

Blending.

B.

Appending.

C.

Mixing.

D.

Merging.

Buy Now
Questions 51

Which of the following is an example of a data-mining ETL tool?

Options:

A.

SSIS

B.

Stata

C.

SPSS

D.

Cognos

Buy Now
Questions 52

What analytics suite is offered by Microsoft and directly integrates with SQL Server Databases?

Options:

A.

Qlik.

B.

Power BI.

C.

Domo.

D.

Dataroma.

Buy Now
Questions 53

A data analyst is helping a retail store categorize its customers into five different groups based on the following information:

• How recently the customers made purchases

• How frequently the customers made purchases

• How much the customers spent

Given the following information:

Which of the following would be most important for the analysis?

Options:

A.

CustomerJD. Channel, Order_Date

B.

CustomerJD, Territory. Amount

C.

CustomerJD, Order_Date. Amount

D.

CustomerJD. Quantity, Amount

Buy Now
Questions 54

Which of the following is the best technique for transferring data from one database to another with some data manipulation?

Options:

A.

Application programming interfaces

B.

Delta load

C.

Extract, transform, load

D.

Export/import

Buy Now
Questions 55

A business intelligence team wants to create a new dashboard in order to solve a problem statement. Which of the following is the correct order of steps the team should take?

Options:

A.

Determine business needs, find data sources, validate the data, create a mock-up, and analyze the information.

B.

Find data sources, determine business needs, validate the data, create a mock-up. and analyze the information.

C.

Create a mock-up, validate the data, analyze the information, determine business needs, and find data sources.

D.

Validate the data, find data sources, analyze the information, and determine business needs.

Buy Now
Questions 56

A data analyst who works for a government agency is required to obtain the average income of citizens. The list of citizens is given in the following table:

A value for one citizen's income is missing. Which of the following approaches should the data analyst take to solve this issue?

Options:

A.

Replace the missing value with the average of the rest of the unemployed citizens.

B.

Insert the value 0 into the field with the missing value.

C.

Impute the mean of the other citizens' incomes into the field with the missing value.

D.

Exclude employed citizens from the analysis.

Buy Now
Questions 57

The senior management team at a company receives a detailed sales report at the end of each quarter. The report is several pages long and includes data from dozens of offices across the country. The team wants a better way to get a quick snapshot of what is included in the report. Which of the following modifications would best meet this requirement?

Options:

A.

Modifying documentation elements to include reference data sources

B.

Modifying the font size and style so important data points are more visible

C.

Modifying the report to include a summary section with observations and insights

D.

Modifying the report layout so it is easier to follow and understand

Buy Now
Questions 58

A data analyst needs to perform a full outer join of a customer's orders using the tables below:

Which of the following is the mean of the order quantity?

Options:

A.

73.5

B.

76.5

C.

78.8

D.

81.5

Buy Now
Questions 59

Which of the following data governance concepts fits into the security requirements category?

Options:

A.

Data transmission

B.

Data deletion

C.

Data use agreements

D.

Personally identifiable information

Buy Now
Questions 60

Which of the following are the first steps a company should take after discovering a data breach? (Select two).

Options:

A.

Delete data.

B.

Notify affected users.

C.

Assess the breach.

D.

Back up the system.

E.

Issue a press release.

F.

Delay reporting.

Buy Now
Questions 61

A sales team wants visibility of current sales numbers, pipeline, and team performance. The team would also like to see calculations of individuals’ earned commissions and projected commissions based on sales, but they want that information to be kept confidential. Which of the following would be the BEST way to provide this visibility?

Options:

A.

Create a dashboard displaying a data refresh date so users know the current sales numbers and configure permissions to control access.

B.

Create a dashboard for sales numbers, pipeline, and team and individual performance for the management team.

C.

Create a dashboard with filters for the overall team, individuals, and management. Users can filter to see the data they want.

D.

Create a dashboard with views for team, individuals, and management. Configure permissions to control access.

Buy Now
Questions 62

Which of the following best describes an exploratory analysis?

Options:

A.

Involves the use of descriptive statistics to understand observations

B.

Involves analysis of exploring data sets for performance tracking

C.

Involves the testing of specific hypotheses

D.

Involves the use of arithmetic algebra to determine the distribution

Buy Now
Questions 63

An organizational document governs role-based and group-based requirements. Which of the following data requirements should be used?

Options:

A.

Security requirements

B.

Storage requirements

C.

Access requirements

D.

Use requirements

Buy Now
Questions 64

Which of the following is the best description of the term "data governance"?

Options:

A.

Data governance governs the development of a data visualization dashboard in an organization.

B.

Data governance is the policy that protects against data breaches by cybercriminals.

C.

Data governance is the process of analyzing, manipulating, and reporting data in an organization.

D.

Data governance is the availability, usability, integrity, and security of data in an enterprise.

Buy Now
Questions 65

Given the following table:

Which of the following describes the data quality issues with theagedata?

Options:

A.

Completeness

B.

Consistency

C.

Accuracy

D.

Manipulation

Buy Now
Questions 66

An analyst is creating a resource to improve users' experience when they select specific records based on particular dates. Which of the following should the analyst use to create a resource that best meets user needs?

Options:

A.

Drop-down menu

B.

Date range

C.

Text field

D.

Frequency

Buy Now
Questions 67

The total values in this month's revenue report are twice as much as last month's. Which of the following most likely occurred during the ETL process?

Options:

A.

The data cleansing processes failed to execute.

B.

The database connectivity failed.

C.

The report included the previous month's data.

D.

The data normalization processes failed.

Buy Now
Questions 68

Which of the following descriptive statistical methods are measures of central tendency? (Choose two.)

Options:

A.

Mean

B.

Minimum

C.

Mode

D.

Variance

E.

Correlation

F.

Maximum

Buy Now
Questions 69

Which of the following differentiates a flat text file from other data types?

Options:

A.

Data is separated by a delimiter.

B.

Data is stored in defined rows.

C.

Data is defined with key-value pairs.

D.

Data is housed in a markup language.

Buy Now
Questions 70

A large data download was divided into two smaller files. Which of the following describes the best way to fix this issue?

Options:

A.

Blending the two data sets

B.

Appending the two data sets

C.

Merging the two data sets

D.

Aggregating the two data sets

Buy Now
Questions 71

A column is being used to store strings of variable lengths. Performance is a concern, so the column needs to use as little space as possible. Which of the following data types best meets these requirements?

Options:

A.

char

B.

nchar

C.

varchar

D.

nvarchar

Buy Now
Questions 72

During data profiling, an analyst decides to recode the status column in the following data set:

Which of the following data concerns explains why the analyst wants to take this action?

Options:

A.

Redundancy

B.

Duplication

C.

Invalidity

D.

Inconsistency

Buy Now
Questions 73

An analyst for a small business with multiple locations is using each location’s quarterly sales reports from last year to create a single revenue report for the year. Which of the following data mining techniques should the analyst use to complete this task?

Options:

A.

Data merge

B.

Data append

C.

Data blending

D.

Data imputation

Buy Now
Questions 74

Under which of the following circumstances should the null hypothesis be accepted when a = 0.05?

Options:

A.

When p is 0.00003

B.

When p is 0.001

C.

When p is 0.04

D.

When p is 0.06

Buy Now
Questions 75

Which of the following is a relational database?

Options:

A.

SQL

B.

Excel

C.

JSON

D.

NoSQL

Buy Now
Questions 76

A table in a hospital database has a column for patient height in inches and a column for patient height in centimeters. This is an example of:

Options:

A.

dependent data.

B.

duplicate data.

C.

invalid data

D.

redundant data

Buy Now
Questions 77

Which of the following defines the policies and procedures for managing the master data?

Options:

A.

Data administration

B.

Data stewardship

C.

Data ownership

D.

Data governance

Buy Now
Questions 78

Which of the following is concatenate typically used to combine?

Options:

A.

Rows

B.

Columns

C.

Tables

D.

Databases

Buy Now
Questions 79

Which of the following terms best describes a situation in which a rating scale does not conform to previously agreed-upon requirements?

Options:

A.

Specification mismatch

B.

Incorrect sampling

C.

Data corruption

D.

Redundancy

Buy Now
Questions 80

An e-commerce company recently tested a new website layout. The website was tested by a test group of customers, and an old website was presented to a control group. The table below shows the percentage of users in each group who made purchases on the websites:

Which of the following conclusions is accurate at a 95% confidence interval?

Options:

A.

In Germany, the increase in conversion from the new layout was not significant.

B.

In France, the increase in conversion from the new layout was not significant.

C.

In general, users who visit the new website are more likely to make a purchase.

D.

The new layout has the lowest conversion rates in the United Kingdom.

Buy Now
Questions 81

An analyst is working with the income data of suburban families in the United States. The data set has a lot of outliers, and the analyst needs to provide a measure that represents the typical income. Which of the following would BEST fulfill the analyst’s goal?

Options:

A.

Median

B.

Mean

C.

Mode

D.

Standard deviation

Buy Now
Questions 82

Which of the following is the first step an analyst should perform upon receiving a business request for analysis?

Options:

A.

Determine the data needs and sources for analysis.

B.

Initiate the analysis for exploratory data analysis.

C.

Review the business questions to understand the scope.

D.

Finalize the methodology to solve the problem.

Buy Now
Questions 83

An analyst needs to conduct a quick analysis. Which of the following is the FIRST step the analyst should perform with the data?

Options:

A.

Conduct an exploratory analysis and use descriptive statistics.

B.

Conduct a trend analysis and use a scatter chart.

C.

Conduct a link analysis and illustrate the connection points.

D.

Conduct an initial analysis and use a Pareto chart.

Buy Now
Questions 84

A data analyst is creating a dashboard and trying to identify the type of information that should be included. Which of the following should the analyst consider first?

Options:

A.

Data refresh rate

B.

Consumer types

C.

Access permissions

D.

Data sources and attributes

Buy Now
Questions 85

Given the following customer and order tables:

Which of the following describes the number of rows and columns of data that would be present after performing an INNER JOIN of the tables?

Options:

A.

Five rows, eight columns

B.

Seven rows, eight columns

C.

Eight rows, seven columns

D.

Nine rows, five columns

Buy Now
Questions 86

Which of the following tools would be best to use to calculate the interquartile range, median, mean, and standard deviation of a column in a table that has 5.000.000 rows?

Options:

A.

Microsoft Excel

B.

R

C.

Snowflake

D.

SQL

Buy Now
Questions 87

A data analyst has been asked to create a sales report that calculates the rolling 12-month average for sales. If the report will be published on November 1, 2020, which of the following months shouts the report cover?

Options:

A.

October 1, 2019 to October 31, 2020

B.

October 31, 2020 to November 1, 2021

C.

November 1, 2019 to October 31, 2020

D.

October 31, 2019 to October 31, 2020

Buy Now
Questions 88

Which of the following explains why standardization of data field names is important to master data management concepts?

Options:

A.

The quality of the data is consistent and improved.

B.

The data looks more appealing.

C.

The colors in data visualization are enhanced.

D.

The data is decompressed.

Buy Now
Questions 89

Given the following grocery store orders:

If a query is made to the table with the following logic:

Order_Total > 132 OR (Order Total >= 25 AND Order_Total < 74)

Which of the following is the number of orders that will be returned by the query?

Options:

A.

Four

B.

Five

C.

Six

D.

Seven

Buy Now
Questions 90

Which of the following is an example of structured data?

Options:

A.

A credit card number

B.

An email

C.

A photo

D.

Social media correspondence

Buy Now
Questions 91

An analyst wants to include a graph in a quarterly sales report that shows the comparison between two quantitative variables. Which of the following visual diagrams can the analyst use to most effectively represent this relationship?

Options:

A.

Bar graph

B.

Heat map

C.

Pie chart

D.

Histogram

Buy Now
Questions 92

A user receives a large custom report to track company sales across various date ranges. The user then completes a series of manual calculations for each date range. Which of the following should an analyst suggest so the user has a dynamic, seamless experience?

Options:

A.

Create multiple reports, one for each needed date range.

B.

Build calculations into the report so they are done automatically.

C.

Add macros to the report to speed up the filtering and calculations process.

D.

Create a dashboard with a date range picker and calculations built in.

Buy Now
Questions 93

A development company is constructing a new Init in its apartment complex. The complex has the following floor plans:

Using the average cost per square foot of the original floor plans. which of the following should be the price of the Rose Init?

Options:

A.

$640,900

B.

$690,000

C.

$705,200

D.

$702,500

Buy Now
Questions 94

Which of the following is a characteristic of a star schema?

Options:

A.

It has a tabular structure.

B.

It stores transactional data.

C.

It stores unstructured data.

D.

It has denormalized dimension tables.

Buy Now
Questions 95

A data analyst was asked to create a visual representation of sales for the first quarter of 2020. Which of the following visualizations should be used when a time element is present?

Options:

A.

A bubble chart

B.

A line chart

C.

A scatter plot

D.

An infographic

Buy Now
Questions 96

Which of the following data cleansing issues will be fixed when a DISTINCT function is applied?

Options:

A.

Missing data

B.

Duplicate data

C.

Redundant data

D.

Invalid data

Buy Now
Questions 97

An analyst needs to create an analytics dashboard for an employee intranet site to improve the search functionality, display relevant information, and maintain an updated FAQ page. Which of the following visualizations would best represent what employees are searching for?

Options:

A.

A word cloud

B.

A histogram

C.

A pie chart

D.

A scatter plot

Buy Now
Questions 98

Which of the following is a characteristic of a relational database?

Options:

A.

It utilizes key-value pairs.

B.

It has undefined fields.

C.

It is structured in nature.

D.

It uses minimal memory.

Buy Now
Questions 99

Which of the following is a common data analytics tool that is also used as an interpreted, high-level, general-purpose programming language?

Options:

A.

SAS

B.

Microsoft Power BI

C.

IBM SPSS

D.

Python

Buy Now
Questions 100

A web developer wants to ensure that malicious users can't type SQL statements when they asked for input, like their username/userid.

Which of the following query optimization techniques would effectively prevent SQL Injection attacks?

Options:

A.

Indexing.

B.

Subset of records.

C.

Temporary table in the query set.

D.

Parametrization.

Buy Now
Questions 101

Which of the following BEST describes the issue in which character values are mixed with integer values in a data set column?

Options:

A.

Duplicate data

B.

Missing data

C.

Data outliers

D.

Invalid data type

Buy Now
Questions 102

An analyst needs to provide a chart to identify the composition between the categories of the survey response data set:

Which of the following charts would be BEST to use?

Options:

A.

Histogram

B.

Pie

C.

Line

D.

Scatter pot

E.

Waterfall

Buy Now
Questions 103

A sales manager wants quarterly sales reports broken down by unit and week. Which of the following data output lists includes the most necessary information?

Options:

A.

Order number. salesperson. date shipped, recipient address, and price

B.

Item name, salesperson. recipient address, shipping cost. and date shipped

C.

Item number, item name, salesperson. date sold. and price

D.

Item name. salesperson. price. shipping cost. and date shipped

Buy Now
Questions 104

Which one of the following is a common data warehouse schema?

Options:

A.

Snowflake.

B.

Square.

C.

Spiral.

D.

Sphere.

Buy Now
Questions 105

Angela is aggregating data from CRM system with data from an employee system.

While performing an initial quality check, she realizes that her employee ID is not associated with her identifier in the CRM system.

What kind of issues is Angela facing?

Choose the best answer.

Options:

A.

ETL process.

B.

Record linkage.

C.

ELT process.

D.

System integration.

Buy Now
Questions 106

A sales manager requested a report that contains the first name, last name, and phone number of all of the company's customers and employees. The data engineer needs to return all the records from several tables, even duplicates. Which of the following is the best way to join the two tables?

Options:

A.

FULL OUTER JOIN

B.

FULL INNER JOIN

C.

LEFT OUTER JOIN

D.

CROSS JOIN

Buy Now
Questions 107

Which of the following occurs if a 90% confidence interval increases to 95%?

Options:

A.

The margin of error does not change.

B.

The interval remains the same.

C.

The interval becomes narrower.

D.

The margin of error doubles.

Buy Now
Questions 108

Given the table below:

Which of the following variable types BEST describes the “Year” column?

Options:

A.

Numeric

B.

Date

C.

Alphanumeric

D.

Text

Buy Now
Questions 109

An analyst compiled a high-level report that includes the following data points:

    Total dollars closed for the year

    Annual quota/goal

    Top 10 customers

    Average deal size

    Largest deals lost

Which of the following groups is the most likely audience for this report?

Options:

A.

External vendors

B.

General public

C.

Lower-level managers

D.

C-suite officers

Buy Now
Questions 110

An analyst is explaining the company’s financial systems and reporting tools to a new coworker. Which of the following data quality dimensions are the most important? (Select three).

Options:

A.

Data formatting

B.

Data accuracy

C.

Data maturity

D.

Data field

E.

Data completeness

F.

Data consistency

G.

Data diversity

Buy Now
Questions 111

A financial institution is reporting on sales performance to a company at the account level. Due to the sensitive nature of the government the does il with, some account information is not shown. Which of the following fields should be masked?

Options:

A.

Sales volume

B.

Start date

C.

Product name

D.

Customer name

Buy Now
Questions 112

A data analyst needs to create a dashboard to help identify trends in the data sets. Which of the following is an appropriate consideration for dashboard development?

Options:

A.

Data sources and attributes

B.

Frequently asked questions

C.

A report from the data source

D.

A comparison of data sets

Buy Now
Questions 113

A customer survey reveals 90% positive feedback. Which of the following statistical methods would be best to utilize to determine the reliability of a data set and predict how a larger sample of customers over the same time period might respond?

Options:

A.

Calculate a high variance on survey responses.

B.

Calculate the maximum range of the survey responses.

C.

Calculate a low standard deviation on survey responses.

D.

Remove any data more than 4 standard deviation from the mean.

Buy Now
Questions 114

A healthcare data analyst notices that one data set in the column for BloodPressure contains several outliers that need to be replaced with meaningful values. Which of the following data manipulation techniques should the analyst use?

Options:

A.

Recode

B.

Impute

C.

Append

D.

Reduction

Buy Now
Questions 115

An analyst has generated a report that includes the number of months in the first two quarters of 2019 when sales exceeded $50,000:

Which of the following functions did the analyst use to generate the data in the Sales_indicator column?

Options:

A.

Aggregate

B.

Logical

C.

Date

D.

Sort

Buy Now
Questions 116

A military commander would like to see the health scorecards of the troops daily and filter them based on gender and rank. Considering this data is PHI, which of the following would be the best way for the commander to view the information?

Options:

A.

An emailed report

B.

A password-protected dashboard

C.

A daily printout of a report

D.

A cloud-hosted spreadsheet

Buy Now
Questions 117

An analyst is training a new coworker on the importance of data governance and is focusing on security requirements. Which of the following should the analyst include in the training?

(Select two).

Options:

A.

Data masking

B.

Data encryption

C.

Data parallelism

D.

Data inclusiveness

E.

Data exclusiveness

F.

Data openness

Buy Now
Questions 118

A database consists of one fact table that is composed of multiple dimensions. Each dimension is represented by a denormalized table. This structure is an example of a:

Options:

A.

Non-relational schema

B.

Galaxy schema

C.

Snowflake schema

D.

Star schema

Buy Now
Exam Code: DA0-001
Exam Name: CompTIA Data+ Certification Exam
Last Update: Jan 1, 2026
Questions: 396
DA0-001 pdf

DA0-001 PDF

$25.5  $84.99
DA0-001 Engine

DA0-001 Testing Engine

$30  $99.99
DA0-001 PDF + Engine

DA0-001 PDF + Testing Engine

$40.5  $134.99