Special Summer Sale 70% Discount Offer - Ends in 0d 00h 00m 00s - Coupon code: clap70

DA0-001 CompTIA Data+ Certification Exam Questions and Answers

Questions 4

An analyst is working with a data set that lists individuals' first and last names in separate columns. Which of the following processes should the analyst use to combine the first and last names into a single spreadsheet cell?

Options:

A.

Transpose

B.

Blend

C.

Concatenate

D.

Merges

Buy Now
Questions 5

An analyst has generated a report that includes the number of months in the first two quarters of 2019 when sales exceeded $50,000:

Which of the following functions did the analyst use to generate the data in the Sales_indicator column?

Options:

A.

Aggregate

B.

Logical

C.

Date

D.

Sort

Buy Now
Questions 6

Which of the following is a control measure for preventing a data breach?

Options:

A.

Data transmission

B.

Data attribution

C.

Data retention

D.

Data encryption

Buy Now
Questions 7

The current date is July 14, 2020. A data analyst has been asked to create a report that shows the company's year-over-year Q2 2020 sales. Which of the following reports should the analyst compare?

Options:

A.

Q2 2020 and Q4 2019

B.

YTD 2020 and YTD 2019

C.

Q2 2020 and Q2 2019

D.

Q2 2020 and Q2 2021

Buy Now
Questions 8

Which of the following roles is responsible for ensuring an organization's data quality, security, privacy, and regulatory compliance?

Options:

A.

Data owner.

B.

Data steward.

C.

Data custodian.

D.

Data processor.

Buy Now
Questions 9

A company's human resources department has asked a data analyst to categorize the income of all employees into five salary bands:

Which of the following types of functions would be the most appropriate to use?

Options:

A.

Statistical

B.

Aggregate

C.

Logical

D.

Mathematical

Buy Now
Questions 10

Which of the ing is the correct ion for a tab-delimited spre file?

Options:

A.

tap

B.

tar

C.

sv

D.

az

Buy Now
Questions 11

Which of the following explains why standardization of data field names is important to master data management concepts?

Options:

A.

The quality of the data is consistent and improved.

B.

The data looks more appealing.

C.

The colors in data visualization are enhanced.

D.

The data is decompressed.

Buy Now
Questions 12

Which of the following data types must be used when working with variables that require classification into two or more groups before analysis?

Options:

A.

Discrete

B.

Numerical

C.

Alphanumeric

D.

Categorical

Buy Now
Questions 13

A data analyst has received a data set that contains actual and projected sales for the fourth quarter of 2019. Which of the following statistical methods should the analyst use to find the measure of dispersion?

Options:

A.

Mean

B.

Variance

C.

Correlation

D.

Confidence interval

Buy Now
Questions 14

Which of the following is an example of a discrete data type?

Options:

A.

8in (20cm)

B.

5 kids

C.

2.5mi (4km)

D.

10.7lbs (4.9kg)

Buy Now
Questions 15

A development company is constructing a new Init in its apartment complex. The complex has the following floor plans:

Using the average cost per square foot of the original floor plans. which of the following should be the price of the Rose Init?

Options:

A.

$640,900

B.

$690,000

C.

$705,200

D.

$702,500

Buy Now
Questions 16

Taylor wants to investigate how manufacturing, marketing, and sales expenditures impact overall profitability for her company.

Which of the following systems is the most appropriate?

Options:

A.

OLTP.

B.

OLAP.

C.

Data warehouse.

D.

Data mart.

Buy Now
Questions 17

Which of following is a non-relational database?

Options:

A.

Neo4j

B.

SQLite

C.

MySQL

D.

PostgreSQL

Buy Now
Questions 18

A user imports a data file into the accounts payable system each day. On a regular basis. the field input is not what the system is expecting. so it results in an error for the row and a broken import process. To resolve the issue, the user opens the file, finds the error in the row, and manually corrects it before attempting the import again. The import sometimes breaks on subsequent attempts. though. Which of the following changes should be made to this process to reduce the number of errors?

Options:

A.

Delete all incorrect inputs and upload the corrected file.

B.

Have the user manually review the file for data completeness before loading it

C.

Create a data field to data type validator to run the file through prior to import.

D.

Spot-check the file prior to import to catch and correct field errors.

Buy Now
Questions 19

A customer's telephone number is in the format 123-456-7890. Which of the following data types is used for the phone number?

Options:

A.

Boolean

B.

Date

C.

Text

D.

Number

Buy Now
Questions 20

Q3 2020 has just ended, and now a data analyst needs to create an ad-hoc sales report that demonstrates how well the Q3 2020 promotion went versus last year's Q3 promotion.

Which of the following date parameters should the analyst use?

Options:

A.

2019 vs. YTD 2020

B.

Q3 2019 vs. Q3 2020

C.

YTD 2019 vs. YTD 2020

D.

Q4 2019 vs. Q3 2020

Buy Now
Questions 21

An analyst is preparing a report that contains weather data. The temperatures are shown in Fahrenheit. but they must be reported in Celsius. Which of the following should the analyst do to fix this issue?

Options:

A.

Normalize the data.

B.

Standardize the data.

C.

Rescale the data.

D.

Aggregate the data.

Buy Now
Questions 22

Which of the following is most likely to be used as a data-mining ETL tool?

Options:

A.

SSIS

B.

Stata

C.

SPSS

D.

Cognos

Buy Now
Questions 23

A company wants to know how its customers interact with an e-commerce website based on clicks over items. Which of the following is the primary requirement for this report?

Options:

A.

Data content

B.

Frequency

C.

Filtering

D.

Views

Buy Now
Questions 24

Which of the following types of analyses should be used to evaluate the connections and anomalies in a data set when either known patterns are being violated or new patterns are emerging?

Options:

A.

Correlation

B.

Descriptive

C.

Graph

D.

Regression

Buy Now
Questions 25

A client wants a new report that will be automatically emailed to all global sales teams on a weekly basis. Each sales team must be able to view the sales for its region and the combined sales for all regions. Which of the following would be the most efficient method for meeting the requirements?

Options:

A.

Creating a single report with a region filter

B.

Creating report distribution lists for the sales teams in each region

C.

Creating a unique copy of the report for each sales team region

D.

Creating a unique copy of the report for each recipient

Buy Now
Questions 26

Which of the following is a non-parametric test?

Options:

A.

One-sample t-test

B.

Two-way ANOVA

C.

Correlation coefficient

D.

Spearman's rank correlation

Buy Now
Questions 27

Which of the following database schemas features normalized dimension tables?

Options:

A.

Flat

B.

Snowflake

C.

Hierarchical

D.

Star

Buy Now
Questions 28

A data analyst needs to present the results of an online marketing campaign to the marketing manager. The manager wants to see the most important KPIs and measure the return on marketing investment. Which of the following should the data analyst use to BEST communicate this information to the manager?

Options:

A.

A real-time monitor that allows the manager to view performance the day the campaign was launched

B.

A sell-service dashboard that allows the manager to look at the company’s annual budget performance

C.

A spreadsheet of the raw data from all marketing campaigns and channels

D.

A summary with statistics, conclusions, and recommendations from the data analyst

Buy Now
Questions 29

An analyst conducted a preliminary analysis for a data set and identified several patterns and anomalies. Which of the following analysis techniques did the analyst use?

Options:

A.

Performance analysis

B.

Exploratory analysis

C.

Link analysis

D.

Trend analysis

Buy Now
Questions 30

Which of the following should be accomplished NEXT after understanding a business requirement for a data analysis report?

Options:

A.

Rephrase the business requirement.

B.

Determine the data necessary for the analysis

C.

Build a mock dashboard/presentation layout.

D.

Perform exploratory data analysis.

Buy Now
Questions 31

A company’s marketing department wants to do a promotional campaign next month. A data analyst on the team has been asked to perform customer segmentation, looking at how recently a customer bought the product, at what frequency, and at what value. Which of the following types of analysis would this practice be considered?

Options:

A.

Prescriptive

B.

Trend

C.

Gap

D.

Custer

Buy Now
Questions 32

A data analyst for a media company needs to determine the most popular movie genre. Given the table below:

Which of the following must be done to the Genre column before this task can be completed?

Options:

A.

Append

B.

Merge

C.

Concatenate

D.

Delimit

Buy Now
Questions 33

Given the following data tables:

Which of the following MDM processes needs to take place FIRST?

Options:

A.

Creation of a data dictionary

B.

Compliance with regulations

C.

Standardization of data field names

D.

Consolidation of multiple data fields

Buy Now
Questions 34

Under which of the following circumstances should the null hypothesis be accepted when a = 0.05?

Options:

A.

When p is 0.00003

B.

When p is 0.001

C.

When p is 0.04

D.

When p is 0.06

Buy Now
Questions 35

An analyst needs to know what data an organization possesses. Which of the following is the best document for the analyst to consult?

Options:

A.

Data destruction policy

B.

Data use document

C.

Data dictionary

D.

Data retention policy

Buy Now
Questions 36

A business unit made the following modification to the values in a table:

Which of the following data quality dimensions was applied in this scenario?

Options:

A.

Integrity

B.

Consistency

C.

Completeness

D.

Accuracy

Buy Now
Questions 37

A data analyst has been asked to create a daily manufacturing report for the floor manager Which of the following metrics should be included in the report?

Options:

A.

Tons of steel produced per hour

B.

Annual sales budget

C.

End-of-day stock price

D.

Daily corporate employee count

Buy Now
Questions 38

An analyst wants to extract data from a variety of sources and store the data in a cloud-based environment prior to cleaning. Which of the following integration techniques should the analyst use?

Options:

A.

ETL

B.

API

C.

SQL

D.

ELT

Buy Now
Questions 39

An analyst is working on a project for a director. During this process. the analyst pulled the data. created summarized tables and graphs with descriptions, created a report summary, and inserted all items into a report. After writing the report, which of the following would be the most appropriate next step?

Options:

A.

Complete an audit on the data pulled for the report.

B.

Complete a check for quality in the report.

C.

Complete a review of the data and a check for consistency

D.

Complete a trend analysis to be included in the report.

Buy Now
Questions 40

An analyst is required to run a text analysis of data that is found in articles from a digital news outlet. Which of the following would be the BEST technique for the analyst to apply to acquire the data?

Options:

A.

Web scraping

B.

Sampling

C.

Data wrangling

D.

ETL

Buy Now
Questions 41

Which of the following is an example of PII?

Options:

A.

Age

B.

Name

C.

Ethnicity

D.

Gender

Buy Now
Questions 42

Daniel is using the structured Query language to work with data stored in relational database.

He would like to add several new rows to a database table.

What command should he use?

Options:

A.

SELECT.

B.

ALTER.

C.

INSERT.

D.

UPDATE.

Buy Now
Questions 43

A data analyst is using a two-tailed, independent t-test to determine whether the type of stretching, dynamic or static, has any influence on a dancer's flexibility. Which of the following is the alternative hypothesis?

Options:

A.

A dancer's flexibility is improved through static stretching.

B.

The change in a dancer's flexibility is not equal to zero.

C.

There is a difference in a dancer's flexibility between static and dynamic stretching.

D.

The means of the static and dynamic stretching groups do not differ from each other.

Buy Now
Questions 44

Which of the following query statements would be used when filtering data in a relational database management system? (Select two).

Options:

A.

ORDER BY

B.

HAVING

C.

WHERE

D.

SELECT

E.

INSERT

F.

GROUP BY

Buy Now
Questions 45

A data analyst has been asked to merge the tables below, first performing an INNER JOIN and then a LEFT JOIN:

Customer Table -

In-store Transactions –

Which of the following describes the number of rows of data that can be expected after performing both joins in the order stated, considering the customer table as the main table?

Options:

A.

INNER: 6 rows; LEFT: 9 rows

B.

INNER: 9 rows; LEFT: 6 rows

C.

INNER: 9 rows; LEFT: 15 rows

D.

INNER: 15 rows; LEFT: 9 rows

Buy Now
Questions 46

Which of the following statistical methods requires two or more categorical variables?

Options:

A.

Simple linear regression

B.

Chi-squared test

C.

Z-test

D.

Two-sample t-test

Buy Now
Questions 47

Which of the following analysis techniques is an unsupervised data mining process?

Options:

A.

Clustering

B.

Descriptive

C.

Regression

D.

Predictive

Buy Now
Questions 48

What R package makes it easy to work with dates?

Options:

A.

Lubridate.

B.

Datemath.

C.

Stringr.

D.

ggplot.

Buy Now
Questions 49

An analyst is updating a customer contacts database with information obtained from a survey of new customers. Which of the following data manipulation techniques should the analyst use?

Options:

A.

Join

B.

Append

C.

Transform

D.

Blend

Buy Now
Questions 50

Given the following data:

CustomerID

ItemBought

Date

Tre_234

Sofa

2022-09-08

216_Tre

Shoes

08/02/2021

215/Tre

Blanket

2021/06/20

045/Tre

Mug

12-26-2021

Tre-345

Lamp

31/08/2022

TREJD19

Bucket

2022'08/01

Which of the following best describes the main issue in the data set?

Options:

A.

Inconsistent data

B.

Data mismatch

C.

Invalid data

D.

Redundant data

Buy Now
Questions 51

Which of the following describes the use of a representative amount of data from a main repository?

Options:

A.

Observation

B.

Delta load

C.

Web scraping

D.

Sampling

Buy Now
Questions 52

Kelly wants to get feedback on the final draft of a strategic report that has taken her six months to develop.

What can she do to get prevent confusion as see seeks feedback before publishing the report?

Choose the best answer.

Options:

A.

Distribute the report to the appropriate stakeholders via email.

B.

Use a watermark to identify the report as a draft.

C.

Show the report to her immediate supervisor.

D.

Publish the report on an internally facing website.

Buy Now
Questions 53

Which of the following data cleansing issues will be fixed when a DISTINCT function is applied?

Options:

A.

Missing data

B.

Duplicate data

C.

Redundant data

D.

Invalid data

Buy Now
Questions 54

A stakeholder wants to see daily sales targets organized in a dashboard by country, state, city, and ZIP Code. Which of the following delivery considerations must a data analyst take into account when creating the dashboard?

Options:

A.

Variable formatting

B.

Drill-down capability

C.

Saved searches

D.

Access permissions

Buy Now
Questions 55

Which of the following contains alphanumeric values?

Options:

A.

10.1Ε²

B.

13.6

C.

1347

D.

A3J7

Buy Now
Questions 56

A JSON file is an example of:

Options:

A.

structured data.

B.

web data.

C.

machine data.

D.

processed data.

Buy Now
Questions 57

A business intelligence engineer needs to reduce the size of a data model for reporting purposes. The data set contains more than one million rows, and the table has a date-time column named Date. Which of the following should the analyst do to complete this task?

Options:

A.

Change the data type of the Date column to text.

B.

Trim the date.

C.

Round the hour of the Date column to the start of the hour.

D.

Split the Date column into two columns—time and date.

Buy Now
Questions 58

Which of the following activities occurs during the ETL process?

Options:

A.

Reviewing and addressing missing values

B.

Creating a dashboard

C.

Inserting a pivot table and pivot chart

D.

Multiplying unique data

Buy Now
Questions 59

Given the following data:

Which of the following BEST describes the data set?

Options:

A.

There is data bias.

B.

The data is incomplete.

C.

The data is inconsistent.

D.

The data is outliers.

Buy Now
Questions 60

An analyst is designing a dashboard to determine which site has the highest percentage of new customers. The analyst must choose an appropriate chart to include in the dashboard. The following data is available:

Which of the following types of charts should be considered to best display the data?

Options:

A.

Include a bar chart using the site and the percentage of new customers data.

B.

Include a line chart using the site and the percentage of new customers data.

C.

Include a pie chart using the site and percentage of new custorners data.

D.

Include a scatter chart using the site and the percent of new customers data.

Buy Now
Questions 61

Consider the following dataset which contains information about houses that are for sale:

Which of the following string manipulation commands will combine the address and region namecolumns to create a full address?

full_address------------------------- 85 Turner St, Northern Metropolitan 25 Bloomburg St, Northern Metropolitan 5 Charles St, Northern Metropolitan 40 Federation La, Northern Metropolitan 55a Park St, Northern Metropolitan

Options:

A.

SELECT CONCAT(address, ' , ' , regionname) AS full_address FROM melb LIMIT 5;

B.

SELECT CONCAT(address, '-' , regionname) AS full_address FROM melb LIMIT 5;

C.

SELECT CONCAT(regionname, ' , ' , address) AS full_address FROM melb LIMIT 5

D.

SELECT CONCAT(regionname, '-' , address) AS full_address FROM melb LIMIT 5;

Buy Now
Questions 62

A web developer wants to ensure that malicious users can't type SQL statements when they asked for input, like their username/userid.

Which of the following query optimization techniques would effectively prevent SQL Injection attacks?

Options:

A.

Indexing.

B.

Subset of records.

C.

Temporary table in the query set.

D.

Parametrization.

Buy Now
Questions 63

An analyst has conducted a review of business questions. Which of the following should the analyst do next to conduct an analysis?

Options:

A.

Determine the data needs and review the observations.

B.

Determine the data needs and sources for analysis.

C.

Determine the data needs and schedule interviews.

D.

Determine the data needs and begin the analysis.

Buy Now
Questions 64

Which of the following data types would a telephone number formatted as XXX-XXX-XXXX be considered?

Options:

A.

Numeric

B.

Date

C.

Float

D.

Text

Buy Now
Questions 65

When analyzing the values of two variables, you decide to convert both variables so they are on a scale of 0 to 1.

What term describes this action?

Options:

A.

Filtering.

B.

Normalization.

C.

Transposition.

D.

Aggregation.

Buy Now
Questions 66

A marketing analytics team received customer transaction data from two different sources. The data is complete and accurate; however, the field names appear to be inconsistent. Given the following tables:

Which of the following is considered best practice if the team wants to consolidate the files and conduct further analysis?

Options:

A.

Standardize the field names.

B.

Recode the data values.

C.

Overwrite the field names in one of the tables.

D.

Edit the field names in the data dictionary.

Buy Now
Questions 67

Which of the following differentiates a flat text file from other data types?

Options:

A.

Data is separated by a delimiter.

B.

Data is stored in defined rows.

C.

Data is defined with key-value pairs.

D.

Data is housed in a markup language.

Buy Now
Questions 68

An analyst is creating a resource to improve users' experience when they select specific records based on particular dates. Which of the following should the analyst use to create a resource that best meets user needs?

Options:

A.

Drop-down menu

B.

Date range

C.

Text field

D.

Frequency

Buy Now
Questions 69

A data analyst is asked on the morning of April 9, 2020, to create a sales report that identifies sales year to date. The daily sales data is current through the end of the day. Which of the following date ranges should be on the report?

Options:

A.

January 1, 2020 to April 1, 2020

B.

January 1, 2020 to April 7, 2020

C.

January 1, 2020 to April 8, 2020

D.

January 1, 2020 to April 9, 2020

Buy Now
Questions 70

Which of the following technologies would be best suited for creating a multiple linear regression model?

Options:

A.

Microsoft Power Bl

B.

R

C.

SQL

D.

Tableau

Buy Now
Questions 71

Given the data below:

In which of the following file formats is the data presented?

Options:

A.

Xs

B.

CSV

C.

RIF

D.

XML

Buy Now
Questions 72

A data analyst needs to create a master file that includes customer information from the tables below:

Given the three tables above, the analyst wants to filter down the information prior to joining it together. In which of the following orders should this data manipulation bo approached for the most efficient result?

Options:

A.

Merge, append, deduplicate

B.

Merge, deduplicate, append

C.

Deduplicate, append, merge

D.

Append, deduplicate, merge

Buy Now
Questions 73

Alex wants to use data from his corporate sale, CRM, and shipping systems to try and predict future sales.

Which of the following systems is the most appropriate?

Choose the best answer.

Options:

A.

Data mart.

B.

OLAP.

C.

Data Warehouse.

D.

OLTP.

Buy Now
Questions 74

A data analyst needs to perform a full outer join of a customer's orders using the tables below:

Which of the following is the mean of the order quantity?

Options:

A.

73.5

B.

76.5

C.

78.8

D.

81.5

Buy Now
Questions 75

Each month an analyst needs to execute a data pull for the two prior months. Which of the following is the most efficient function for the analyst to use?

Options:

A.

Logical

B.

Date

C.

Aggregate

D.

System

Buy Now
Questions 76

Which of the following tools would be best to use to calculate the interquartile range, median, mean, and standard deviation of a column in a table that has 5.000.000 rows?

Options:

A.

Microsoft Excel

B.

R

C.

Snowflake

D.

SQL

Buy Now
Questions 77

An analyst reviews the following data:

7

3

5

2

3

7

7

10

Which of the following is the value of the mode?

Options:

A.

3

B.

5

C.

7

D.

10

Buy Now
Questions 78

The number of phone calls that the call center receives in a day is an example of:

Options:

A.

continuous data.

B.

categorical data.

C.

ordinal data.

D.

discrete data.

Buy Now
Questions 79

A research analyst wants to determine whether the data being analyzed is connected to other datapoints. Which of the following is the BEST type of analysis to conduct?

Options:

A.

Trend analysis

B.

Performance analysis

C.

Link analysis

D.

Exploratory analysis

Buy Now
Questions 80

A data analyst needs to present the results of an online marketing campaign to the marketing manager. The manager wants to see the most important KPIs and measure the return on marketing investment. Which of the following should the data analyst use to BEST communicate this information to the manager?

Options:

A.

A real-time monitor that allows the manager to view performance the day the campaign was launched

B.

A sell-service dashboard that allows the manager to look at the company's annual budget performance

C.

A spreadsheet of the raw data from all marketing campaigns and channels

D.

A summary with statistics, conclusions, and recommendations from the data analyst

Buy Now
Questions 81

A data analyst is compiling a report that a Chief Executive Officer needs for an impromptu meeting. The report should include information on the previous day's performance. Which of the following reports should the analyst provide?

Options:

A.

Tactical

B.

Ad hoc

C.

Dynamic

D.

Recurring

Buy Now
Questions 82

An analyst needs to join two data sets that compare vehicle weights. One data set is in pounds, and the other has various units of measure. Which of the following should the analyst do first to the data prior to any type of join?

Options:

A.

Blend

B.

Reduce

C.

Concatenate

D.

Normalize

Buy Now
Questions 83

A database administrator needs to increase performance on a large dimension table. Which of the following is the best way to accomplish this task?

Options:

A.

Sampling

B.

Partitioning

C.

Windowing

D.

Sorting

Buy Now
Questions 84

A data analyst has a set with more than 40.000 rows in the sample schema below:

The analyst would like to create one column that contains the customers’ birth dates. Which of the following data quality dimensions would BEST explain the reason for compilation?

Options:

A.

Data accuracy

B.

Data completeness

C.

Data duplication

D.

Data integrity

Buy Now
Questions 85

Which of the following is a process that is used during data integration to collect, blend, and load data?

Options:

A.

MDM

B.

ETL

C.

OLTP

D.

BI

Buy Now
Questions 86

Which of the following reports can be used when insight into operational performance is needed each Wednesday?

Options:

A.

Static report

B.

Tactical report

C.

Recurring report

D.

Ad hoc report

Buy Now
Questions 87

Which of the following data governance concepts fits into the security requirements category?

Options:

A.

Data transmission

B.

Data deletion

C.

Data use agreements

D.

Personally identifiable information

Buy Now
Questions 88

A data analyst needs to create a data visualization that aids in un the cumulative impact of sequentially introduced values that are positive or negative. Which of the following

data visualization methods should the analyst use?

Options:

A.

A bubble chart

B.

A waterfall chart

C.

A scatter plot

D.

A line chart

Buy Now
Questions 89

An analyst needs to join two tables of data together for analysis. All the names and cities in the first table should be joined with the corresponding ages in the second table, if applicable.

Which of the following is the correct join the analyst should complete. and how many total rows will be in one table?

Options:

A.

INNER JOIN, two rows

B.

LEFT JOIN. four rows

C.

RIGHT JOIN. five rows

D.

OUTER JOIN, seven rows

Buy Now
Questions 90

A database administrator is required to mask certain table columns containing PII in order to comply with the company privacy policy. Which of the following are the most likely types of information the administrator should mask? (Select two).

Options:

A.

Government-issued ID

B.

Address

C.

Order ID

D.

Order date

E.

Customer ID

F.

Referral number

Buy Now
Questions 91

An analyst notices changes in sales ratios when analyzing a quarterly report. Which of the following is the analyst conducting?

Options:

A.

A gap analysis

B.

A link analysis

C.

A trend analysis

D.

A statistical analysis

Buy Now
Questions 92

A data set has the following values:

Which of the following is the best reason for cleansing the data?

Options:

A.

Invalid data

B.

Redundant data

C.

Data outliers

D.

Missing data

Buy Now
Questions 93

An analyst needs to create an analytics dashboard for an employee intranet site to improve the search functionality, display relevant information, and maintain an updated FAQ page. Which of the following visualizations would best represent what employees are searching for?

Options:

A.

A word cloud

B.

A histogram

C.

A pie chart

D.

A scatter plot

Buy Now
Questions 94

A data analyst must fulfill a request for information that is needed weekly and should be automatically emailed to a specific set of users. Which of the following types of reports should theanalyst recommend?

Options:

A.

A self-service report

B.

A research report

C.

An ad hoc report

D.

An operational report

Buy Now
Questions 95

A data set for sales per month includes the following data:

Which of the following cleaning and profiling methods should be applied to the data set?

Options:

A.

Data outliers

B.

Invalid data

C.

Duplicate data

D.

Data type validation

Buy Now
Questions 96

An analyst develops an IT document and needs to describe the technical terms used in the document. Which of the following is where the analyst should include descriptions of the technical terms?

Options:

A.

Glossary

B.

System diagram

C.

User requirements

D.

Index

Buy Now
Questions 97

A gambler thinks that a coin is fair and is equally likely to turn up heads or tails when the coin is flipped. Which of the following tests should the gambler use to fest this hypothesis?

Options:

A.

t-test

B.

Chi-squared test

C.

Rank sum test

D.

Ratio test

Buy Now
Questions 98

A data analyst wants to create "Income Categories" that would be calculated based on the existing variable "Income". The "Income Categories" would be as follows:

Income category 1: less than $1.

Income category 2: more than $1 and less than $20,000.

Income category 3: more than $20,001 and less than $40,000.

Income category 4: more than $40,001.

Which of the following data manipulation techniques should the data analyst use to create "Income Categories"?

Options:

A.

Data merge

B.

Derived variables

C.

Data blending

D.

Data append

Buy Now
Questions 99

A data analyst reviews the following data set:

Which of the following is the range value?

Options:

A.

9

B.

10

C.

12

D.

13

Buy Now
Questions 100

You are working with a dataset and want to change the names of categories that you used fordifferent types of books.

What term best describes this action?

Options:

A.

Recording.

B.

Summarizing

C.

Aggregating.

D.

Filtering.

Buy Now
Questions 101

Which of the following would be used to store unstructured data from different sources?

Options:

A.

A data lake

B.

A database management system

C.

A database

D.

A data warehouse

Buy Now
Questions 102

A database consists of one fact table that is composed of multiple dimensions. Depending on the dimension, each one can be represented by a denormalized table or multiple normalized tables. This structure is an example of a:

Options:

A.

transactional schema.

B.

star schema.

C.

non-relational schema.

D.

snowflake schema.

Buy Now
Questions 103

A database administrator needs to ensure only approved users can access specific database tables to perform financial functions. Which of the following is the best access control method for the administrator to use?

Options:

A.

Role-based

B.

Rule-based

C.

Discretionary

D.

Group-based

Buy Now
Questions 104

Which of the following data manipulation techniques is an example of a logical function?

Options:

A.

WHERE

B.

AGGREGATE

C.

BOOLEAN

D.

IF

Buy Now
Questions 105

An analysts building a monthly report for production and wants to ensure the audience is aware of its once-a-month cadence. Which of the following is the MOST important to convey that information?

Options:

A.

The date of the dashboard build

B.

The data refresh date

C.

A report summary

D.

Frequently asked questions

Buy Now
Questions 106

Which of the following actions should be taken when transmitting data to mitigate the chance of a data leak occurring? (Choose two.)

Options:

A.

Data identification

B.

Data processing

C.

Data Reporting

D.

Data encryption

E.

Data masking

F.

Fata removal

Buy Now
Questions 107

Emma is working in a data warehouse and finds a finance fact table links to an organization dimension, which in turn links to a currency dimension that not linked to the fact table.

What type of design pattern is the data warehouse using?

Options:

A.

Star.

B.

Sun.

C.

Snowflake.

D.

Comet.

Buy Now
Questions 108

Which of the following data analysis tools increases the efficiency of data visualizations?

Options:

A.

SQL

B.

Microsoft Excel

C.

SAS

D.

RapidMiner

Buy Now
Exam Code: DA0-001
Exam Name: CompTIA Data+ Certification Exam
Last Update: Apr 1, 2025
Questions: 363
DA0-001 pdf

DA0-001 PDF

$25.5  $84.99
DA0-001 Engine

DA0-001 Testing Engine

$30  $99.99
DA0-001 PDF + Engine

DA0-001 PDF + Testing Engine

$40.5  $134.99