[Q19-Q41] Exam Questions and Answers for Databricks-Certified-Professional-Data-Engineer Study Guide Questions and Answers!

0 Comments

Rate this post

Exam Questions and Answers for Databricks-Certified-Professional-Data-Engineer Study Guide Questions and Answers!

Databricks Certified Professional Data Engineer Exam Certification Sample Questions and Practice Exam

NO.19 Which of the following features of data lakehouse can help you meet the needs of both workloads?

 
 
 
 
 

NO.20 When you drop a managed table using SQL syntax DROP TABLE table_name how does it impact metadata, history, and data stored in the table?

 
 
 
 
 

NO.21 A new data engineer has started at a company. The data engineer has recently been added to the company’s
Databricks workspace as [email protected]. The data engineer needs to be able to query the table
sales in the database retail. The new data engineer already has been granted USAGE on the database retail.
Which of the following commands can be used to grant the appropriate permissions to the new data engineer?

 
 
 
 
 

NO.22 Which of the following Structured Streaming queries successfully performs a hop from a Silver to Gold table?

 
 
 
 
 

NO.23 A data engineer has developed a code block to perform a streaming read on a data source. The code block is
below:
1. (spark
2. .read
3. .schema(schema)
4. .format(“cloudFiles”)
5. .option(“cloudFiles.format”, “json”)
6. .load(dataSource)
7. )
The code block is returning an error.
Which of the following changes should be made to the code block to configure the block to successfully
perform a streaming read?

 
 
 
 
 

NO.24 You are noticing job cluster is taking 6 to 8 mins to start which is delaying your job to finish on time, what steps you can take to reduce the amount of time cluster startup time

 
 
 
 
 

NO.25 Which of the following benefits does Delta Live Tables provide for ELT pipelines over standard data pipelines
that utilize Spark and Delta Lake on Databricks?

 
 
 
 
 

NO.26 You are looking to process the data based on two variables, one to check if the department is supply chain and second to check if process flag is set to True

 
 
 
 
 

NO.27 When using the complete mode to write stream data, how does it impact the target table?

 
 
 
 
 

NO.28 A data engineer wants to create a relational object by pulling data from two tables. The relational object must
be used by other data engineers in other sessions. In order to save on storage costs, the data engineer wants to
avoid copying and storing physical data.
Which of the following relational objects should the data engineer create?

 
 
 
 
 

NO.29 What is the type of table created when you issue SQL DDL command CREATE TABLE sales (id int, units int)

 
 
 
 
 

NO.30 You were asked to create a table that can store the below data, orderTime is a timestamp but the finance team when they query this data normally prefer the orderTime in date format, you would like to create a calculated column that can convert the orderTime column timestamp datatype to date and store it, fill in the blank to complete the DDL.

 
 
 
 
 

NO.31 Kevin is the owner of the schema sales, Steve wanted to create new table in sales schema called regional_sales so Kevin grants the create table permissions to Steve. Steve creates the new table called regional_sales in sales schema, who is the owner of the table regional_sales

 
 
 
 
 

NO.32 A data engineer is overwriting data in a table by deleting the table and recreating the table. Another data
engineer suggests that this is inefficient and the table should simply be overwritten instead.
Which of the following reasons to overwrite the table instead of deleting and recreating the table is incorrect?

 
 
 
 
 

NO.33 Which of the following statements can successfully read the notebook widget and pass the python variable to a SQL statement in a Python notebook cell?

 
 
 
 
 

NO.34 You had worked with the Data analysts team to set up a SQL Endpoint(SQL warehouse) point so they can easily query and analyze data in the gold layer, but once they started consuming the SQL Endpoint(SQL warehouse) you noticed that during the peak hours as the number of users increase you are seeing queries taking longer to finish, which of the following steps can be taken to resolve the issue?
*Please note Databricks recently renamed SQL endpoint to SQL warehouse.

 
 
 
 
 

NO.35 Which of the following commands results in the successful creation of a view on top of the delta stream(stream on delta table)?

 
 
 
 
 
 

NO.36 Which of the following is a correct statement on how the data is organized in the storage when when managing a DELTA table?

 
 
 
 
 

NO.37 Which of the following statements can be used to test the functionality of code to test number of rows in the table equal to 10 in python?
row_count = spark.sql(“select count(*) from table”).collect()[0][0]

 
 
 
 
 

NO.38 You are asked to write a python function that can read data from a delta table and return the Data-Frame, which of the following is correct?

 
 
 
 
 

NO.39 You are using k-means clustering to classify heart patients for a hospital. You have chosen Patient Sex,
Height, Weight, Age and Income as measures and have used 3 clusters. When you create a pair-wise plot of
the clusters, you notice that there is significant overlap between the clusters. What should you do?

 
 
 
 

NO.40 A data engineer has ingested data from an external source into a PySpark DataFrame raw_df. They need to
briefly make this data available in SQL for a data analyst to perform a quality assurance check on the data.
Which of the following commands should the data engineer run to make this data available in SQL for only
the remainder of the Spark session?

 
 
 
 
 

NO.41 How do you create a delta live tables pipeline and deploy using DLT UI?

 
 
 
 
 

Databricks-Certified-Professional-Data-Engineer certification dumps – Databricks Certification Databricks-Certified-Professional-Data-Engineer guides – 100% valid: https://www.vcedumps.com/Databricks-Certified-Professional-Data-Engineer-examcollection.html


Leave a Reply

Your email address will not be published. Required fields are marked *

Enter the text from the image below