An analyst is working with a data set that lists individuals' first and last names in separate columns. Which of the following processes should the analyst use to combine the first and last names into a single spreadsheet cell?
An analyst has generated a report that includes the number of months in the first two quarters of 2019 when sales exceeded $50,000:
Which of the following functions did the analyst use to generate the data in the Sales_indicator column?
The current date is July 14, 2020. A data analyst has been asked to create a report that shows the company's year-over-year Q2 2020 sales. Which of the following reports should the analyst compare?
Which of the following roles is responsible for ensuring an organization's data quality, security, privacy, and regulatory compliance?
A company's human resources department has asked a data analyst to categorize the income of all employees into five salary bands:
Which of the following types of functions would be the most appropriate to use?
Which of the following explains why standardization of data field names is important to master data management concepts?
Which of the following data types must be used when working with variables that require classification into two or more groups before analysis?
A data analyst has received a data set that contains actual and projected sales for the fourth quarter of 2019. Which of the following statistical methods should the analyst use to find the measure of dispersion?
A development company is constructing a new Init in its apartment complex. The complex has the following floor plans:
Using the average cost per square foot of the original floor plans. which of the following should be the price of the Rose Init?
Taylor wants to investigate how manufacturing, marketing, and sales expenditures impact overall profitability for her company.
Which of the following systems is the most appropriate?
A user imports a data file into the accounts payable system each day. On a regular basis. the field input is not what the system is expecting. so it results in an error for the row and a broken import process. To resolve the issue, the user opens the file, finds the error in the row, and manually corrects it before attempting the import again. The import sometimes breaks on subsequent attempts. though. Which of the following changes should be made to this process to reduce the number of errors?
A customer's telephone number is in the format 123-456-7890. Which of the following data types is used for the phone number?
Q3 2020 has just ended, and now a data analyst needs to create an ad-hoc sales report that demonstrates how well the Q3 2020 promotion went versus last year's Q3 promotion.
Which of the following date parameters should the analyst use?
An analyst is preparing a report that contains weather data. The temperatures are shown in Fahrenheit. but they must be reported in Celsius. Which of the following should the analyst do to fix this issue?
A company wants to know how its customers interact with an e-commerce website based on clicks over items. Which of the following is the primary requirement for this report?
Which of the following types of analyses should be used to evaluate the connections and anomalies in a data set when either known patterns are being violated or new patterns are emerging?
A client wants a new report that will be automatically emailed to all global sales teams on a weekly basis. Each sales team must be able to view the sales for its region and the combined sales for all regions. Which of the following would be the most efficient method for meeting the requirements?
A data analyst needs to present the results of an online marketing campaign to the marketing manager. The manager wants to see the most important KPIs and measure the return on marketing investment. Which of the following should the data analyst use to BEST communicate this information to the manager?
An analyst conducted a preliminary analysis for a data set and identified several patterns and anomalies. Which of the following analysis techniques did the analyst use?
Which of the following should be accomplished NEXT after understanding a business requirement for a data analysis report?
A company’s marketing department wants to do a promotional campaign next month. A data analyst on the team has been asked to perform customer segmentation, looking at how recently a customer bought the product, at what frequency, and at what value. Which of the following types of analysis would this practice be considered?
A data analyst for a media company needs to determine the most popular movie genre. Given the table below:
Which of the following must be done to the Genre column before this task can be completed?
Given the following data tables:
Which of the following MDM processes needs to take place FIRST?
Under which of the following circumstances should the null hypothesis be accepted when a = 0.05?
An analyst needs to know what data an organization possesses. Which of the following is the best document for the analyst to consult?
A business unit made the following modification to the values in a table:
Which of the following data quality dimensions was applied in this scenario?
A data analyst has been asked to create a daily manufacturing report for the floor manager Which of the following metrics should be included in the report?
An analyst wants to extract data from a variety of sources and store the data in a cloud-based environment prior to cleaning. Which of the following integration techniques should the analyst use?
An analyst is working on a project for a director. During this process. the analyst pulled the data. created summarized tables and graphs with descriptions, created a report summary, and inserted all items into a report. After writing the report, which of the following would be the most appropriate next step?
An analyst is required to run a text analysis of data that is found in articles from a digital news outlet. Which of the following would be the BEST technique for the analyst to apply to acquire the data?
Daniel is using the structured Query language to work with data stored in relational database.
He would like to add several new rows to a database table.
What command should he use?
A data analyst is using a two-tailed, independent t-test to determine whether the type of stretching, dynamic or static, has any influence on a dancer's flexibility. Which of the following is the alternative hypothesis?
Which of the following query statements would be used when filtering data in a relational database management system? (Select two).
A data analyst has been asked to merge the tables below, first performing an INNER JOIN and then a LEFT JOIN:
Customer Table -
In-store Transactions –
Which of the following describes the number of rows of data that can be expected after performing both joins in the order stated, considering the customer table as the main table?
Which of the following statistical methods requires two or more categorical variables?
Which of the following analysis techniques is an unsupervised data mining process?
An analyst is updating a customer contacts database with information obtained from a survey of new customers. Which of the following data manipulation techniques should the analyst use?
Given the following data:
CustomerID
ItemBought
Date
Tre_234
Sofa
2022-09-08
216_Tre
Shoes
08/02/2021
215/Tre
Blanket
2021/06/20
045/Tre
Mug
12-26-2021
Tre-345
Lamp
31/08/2022
TREJD19
Bucket
2022'08/01
Which of the following best describes the main issue in the data set?
Which of the following describes the use of a representative amount of data from a main repository?
Kelly wants to get feedback on the final draft of a strategic report that has taken her six months to develop.
What can she do to get prevent confusion as see seeks feedback before publishing the report?
Choose the best answer.
Which of the following data cleansing issues will be fixed when a DISTINCT function is applied?
A stakeholder wants to see daily sales targets organized in a dashboard by country, state, city, and ZIP Code. Which of the following delivery considerations must a data analyst take into account when creating the dashboard?
A business intelligence engineer needs to reduce the size of a data model for reporting purposes. The data set contains more than one million rows, and the table has a date-time column named Date. Which of the following should the analyst do to complete this task?
An analyst is designing a dashboard to determine which site has the highest percentage of new customers. The analyst must choose an appropriate chart to include in the dashboard. The following data is available:
Which of the following types of charts should be considered to best display the data?
Consider the following dataset which contains information about houses that are for sale:
Which of the following string manipulation commands will combine the address and region namecolumns to create a full address?
full_address------------------------- 85 Turner St, Northern Metropolitan 25 Bloomburg St, Northern Metropolitan 5 Charles St, Northern Metropolitan 40 Federation La, Northern Metropolitan 55a Park St, Northern Metropolitan
A web developer wants to ensure that malicious users can't type SQL statements when they asked for input, like their username/userid.
Which of the following query optimization techniques would effectively prevent SQL Injection attacks?
An analyst has conducted a review of business questions. Which of the following should the analyst do next to conduct an analysis?
Which of the following data types would a telephone number formatted as XXX-XXX-XXXX be considered?
When analyzing the values of two variables, you decide to convert both variables so they are on a scale of 0 to 1.
What term describes this action?
A marketing analytics team received customer transaction data from two different sources. The data is complete and accurate; however, the field names appear to be inconsistent. Given the following tables:
Which of the following is considered best practice if the team wants to consolidate the files and conduct further analysis?
An analyst is creating a resource to improve users' experience when they select specific records based on particular dates. Which of the following should the analyst use to create a resource that best meets user needs?
A data analyst is asked on the morning of April 9, 2020, to create a sales report that identifies sales year to date. The daily sales data is current through the end of the day. Which of the following date ranges should be on the report?
Which of the following technologies would be best suited for creating a multiple linear regression model?
Given the data below:
In which of the following file formats is the data presented?
A data analyst needs to create a master file that includes customer information from the tables below:
Given the three tables above, the analyst wants to filter down the information prior to joining it together. In which of the following orders should this data manipulation bo approached for the most efficient result?
Alex wants to use data from his corporate sale, CRM, and shipping systems to try and predict future sales.
Which of the following systems is the most appropriate?
Choose the best answer.
A data analyst needs to perform a full outer join of a customer's orders using the tables below:
Which of the following is the mean of the order quantity?
Each month an analyst needs to execute a data pull for the two prior months. Which of the following is the most efficient function for the analyst to use?
Which of the following tools would be best to use to calculate the interquartile range, median, mean, and standard deviation of a column in a table that has 5.000.000 rows?
An analyst reviews the following data:
7
3
5
2
3
7
7
10
Which of the following is the value of the mode?
The number of phone calls that the call center receives in a day is an example of:
A research analyst wants to determine whether the data being analyzed is connected to other datapoints. Which of the following is the BEST type of analysis to conduct?
A data analyst needs to present the results of an online marketing campaign to the marketing manager. The manager wants to see the most important KPIs and measure the return on marketing investment. Which of the following should the data analyst use to BEST communicate this information to the manager?
A data analyst is compiling a report that a Chief Executive Officer needs for an impromptu meeting. The report should include information on the previous day's performance. Which of the following reports should the analyst provide?
An analyst needs to join two data sets that compare vehicle weights. One data set is in pounds, and the other has various units of measure. Which of the following should the analyst do first to the data prior to any type of join?
A database administrator needs to increase performance on a large dimension table. Which of the following is the best way to accomplish this task?
A data analyst has a set with more than 40.000 rows in the sample schema below:
The analyst would like to create one column that contains the customers’ birth dates. Which of the following data quality dimensions would BEST explain the reason for compilation?
Which of the following is a process that is used during data integration to collect, blend, and load data?
Which of the following reports can be used when insight into operational performance is needed each Wednesday?
Which of the following data governance concepts fits into the security requirements category?
A data analyst needs to create a data visualization that aids in un the cumulative impact of sequentially introduced values that are positive or negative. Which of the following
data visualization methods should the analyst use?
An analyst needs to join two tables of data together for analysis. All the names and cities in the first table should be joined with the corresponding ages in the second table, if applicable.
Which of the following is the correct join the analyst should complete. and how many total rows will be in one table?
A database administrator is required to mask certain table columns containing PII in order to comply with the company privacy policy. Which of the following are the most likely types of information the administrator should mask? (Select two).
An analyst notices changes in sales ratios when analyzing a quarterly report. Which of the following is the analyst conducting?
A data set has the following values:
Which of the following is the best reason for cleansing the data?
An analyst needs to create an analytics dashboard for an employee intranet site to improve the search functionality, display relevant information, and maintain an updated FAQ page. Which of the following visualizations would best represent what employees are searching for?
A data analyst must fulfill a request for information that is needed weekly and should be automatically emailed to a specific set of users. Which of the following types of reports should theanalyst recommend?
A data set for sales per month includes the following data:
Which of the following cleaning and profiling methods should be applied to the data set?
An analyst develops an IT document and needs to describe the technical terms used in the document. Which of the following is where the analyst should include descriptions of the technical terms?
A gambler thinks that a coin is fair and is equally likely to turn up heads or tails when the coin is flipped. Which of the following tests should the gambler use to fest this hypothesis?
A data analyst wants to create "Income Categories" that would be calculated based on the existing variable "Income". The "Income Categories" would be as follows:
Income category 1: less than $1.
Income category 2: more than $1 and less than $20,000.
Income category 3: more than $20,001 and less than $40,000.
Income category 4: more than $40,001.
Which of the following data manipulation techniques should the data analyst use to create "Income Categories"?
A data analyst reviews the following data set:
Which of the following is the range value?
You are working with a dataset and want to change the names of categories that you used fordifferent types of books.
What term best describes this action?
Which of the following would be used to store unstructured data from different sources?
A database consists of one fact table that is composed of multiple dimensions. Depending on the dimension, each one can be represented by a denormalized table or multiple normalized tables. This structure is an example of a:
A database administrator needs to ensure only approved users can access specific database tables to perform financial functions. Which of the following is the best access control method for the administrator to use?
Which of the following data manipulation techniques is an example of a logical function?
An analysts building a monthly report for production and wants to ensure the audience is aware of its once-a-month cadence. Which of the following is the MOST important to convey that information?
Which of the following actions should be taken when transmitting data to mitigate the chance of a data leak occurring? (Choose two.)
Emma is working in a data warehouse and finds a finance fact table links to an organization dimension, which in turn links to a currency dimension that not linked to the fact table.
What type of design pattern is the data warehouse using?
Which of the following data analysis tools increases the efficiency of data visualizations?