0%
0 votes, 0 avg
16

This quiz randomly generates 30 questions as asked in AWS Certified Machine Learning - Specialty (MLS-C01)

Congratulations!

AWS Certified Machine Learning

AWS Certified Machine Learning - Specialty (MLS-C01)

This quiz randomly generates 30 questions (in 60 mins) as asked in AWS Certified Machine Learning - Specialty (MLS-C01). The real MLS-C01 test has 65 questions and a total time of 180 minutes. Of these, 15 questions are underlined, and only 50 questions are scored. This test randomly generates 30 questions from our question bank. For best results, practice multiple times until you achieve 100% accuracy.

1 / 30

A Machine Learning Specialist is packaging a custom ResNet model into a Docker container so the
company can leverage Amazon SageMaker for training. The Specialist is using Amazon EC2 P3 instances
to train the model and needs to properly configure the Docker container to leverage the NVIDIA GPUs.
What does the Specialist need to do?

2 / 30

When submitting Amazon SageMaker training jobs using one of the built-in algorithms, which common parameters MUST be specified? (Choose three.)

3 / 30

Machine Learning Specialist is building a model to predict future employment rates based on a wide range
of economic factors. While exploring the data, the Specialist notices that the magnitude of the input
features vary greatly. The Specialist does not want variables with a larger magnitude to dominate the
model.
What should the Specialist do to prepare the data for model training?

4 / 30

A Machine Learning Specialist must build out a process to query a dataset on Amazon S3 using Amazon
Athena. The dataset contains more than 800,000 records stored as plaintext CSV files. Each record
contains 200 columns and is approximately 1.5 MB in size. Most queries will span 5 to 10 columns only.
How should the Machine Learning Specialist transform the dataset to minimize query runtime?

5 / 30

A company wants to classify user behavior as either fraudulent or normal. Based on internal research, a
Machine Learning Specialist would like to build a binary classifier based on two features: age of account
and transaction month. The class distribution for these features is illustrated in the figure provided.

Based on this information, which model would have the HIGHEST accuracy?

6 / 30

A Machine Learning Specialist at a company sensitive to security is preparing a dataset for model training.
The dataset is stored in Amazon S3 and contains Personally Identifiable Information (PII).
The dataset:
Must be accessible from a VPC only.
Must not traverse the public internet.
How can these requirements be satisfied?

7 / 30

A large consumer goods manufacturer has the following products on sale:
1. 34 different toothpaste variants
2. 48 different toothbrush variants
3. 43 different mouthwash variants
The entire sales history of all these products is available in Amazon S3. Currently, the company is using
custom-built autoregressive integrated moving average (ARIMA) models to forecast demand for these
products. The company wants to predict the demand for a new product that will soon be launched.
Which solution should a Machine Learning Specialist apply?

8 / 30

A company is using Amazon Polly to translate plaintext documents to speech for automated company
announcements. However, company acronyms are being mispronounced in the current documents.
How should a Machine Learning Specialist address this issue for future documents?

9 / 30

A Machine Learning Specialist is creating a new natural language processing application that processes a
dataset comprised of 1 million sentences. The aim is to then run Word2Vec to generate embeddings of the
sentences and enable different types of predictions.
Here is an example from the dataset:
"The quck BROWN FOX jumps over the lazy dog."
Which of the following are the operations the Specialist needs to perform to correctly sanitize and prepare
the data in a repeatable manner? (Choose three.)

10 / 30

An agency collects census information within a country to determine healthcare and social program needs
by province and city. The census form collects responses for approximately 500 questions from each
citizen.
Which combination of algorithms would provide the appropriate insights? (Select TWO.)

11 / 30

A city wants to monitor its air quality to address the consequences of air pollution. A Machine Learning
Specialist needs to forecast the air quality in parts per million of contaminates for the next 2 days in the
city. As this is a prototype, only daily data from the last year is available.
Which model is MOST likely to provide the best results in Amazon SageMaker?

12 / 30

An office security agency conducted a successful pilot using 100 cameras installed at key locations within

the main office. Images from the cameras were uploaded to Amazon S3 and tagged using Amazon

Rekognition, and the results were stored in Amazon ES. The agency is now looking to expand the pilot into

a full production system using thousands of video cameras in its office locations globally. The goal is to

identify activities performed by non-employees in real time

Which solution should the agency consider?

13 / 30

A Data Scientist is developing a machine learning model to classify whether a financial transaction is
fraudulent. The labeled data available for training consists of 100,000 non-fraudulent observations and
1,000 fraudulent observations.
The Data Scientist applies the XGBoost algorithm to the data, resulting in the following confusion matrix
when the trained model is applied to a previously unseen validation dataset. The accuracy of the model is
99.1%, but the Data Scientist has been asked to reduce the number of false negatives.
Which combination of steps should the Data Scientist take to reduce the number of false positive
predictions by the model? (Choose two.)

14 / 30

Which of the following metrics should a Machine Learning Specialist generally use to compare/evaluate machine learning classification models against each other?

15 / 30

A Machine Learning Specialist has created a deep learning neural network model that performs well on the
training data but performs poorly on the test data.
Which of the following methods should the Specialist consider using to correct this? (Choose three.)

16 / 30

A Machine Learning Specialist is configuring Amazon SageMaker so multiple Data Scientists can access
notebooks, train models, and deploy endpoints. To ensure the best operational performance, the Specialist
needs to be able to track how often the Scientists are deploying models, GPU and CPU utilization on the
deployed SageMaker endpoints, and all errors that are generated when an endpoint is invoked.
Which services are integrated with Amazon SageMaker to track this information? (Choose two.)

17 / 30

A monitoring service generates 1 TB of scale metrics record data every minute. A Research team performs
queries on this data using Amazon Athena. The queries run slowly due to the large volume of data, and the
team requires better performance. How should the records be stored in Amazon S3 to improve query performance?

18 / 30

A Machine Learning Specialist deployed a model that provides product recommendations on a company's
website. Initially, the model was performing very well and resulted in customers buying more products on
average. However, within the past few months, the Specialist has noticed that the effect of product
recommendations has diminished and customers are starting to return to their original habits of spending
less. The Specialist is unsure of what happened, as the model has not changed from its initial deployment
over a year ago.
Which method should the Specialist try to improve model performance?

19 / 30

A Machine Learning Specialist is designing a system for improving sales for a company. The objective is to
use the large amount of information the company has on users' behavior and product preferences to
predict which products users would like based on the users' similarity to other users.
What should the Specialist do to meet this objective?

20 / 30

An interactive online dictionary wants to add a widget that displays words used in similar contexts. A
Machine Learning Specialist is asked to provide word features for the downstream nearest neighbor model
powering the widget.
What should the Specialist do to meet these requirements?

21 / 30

Machine Learning Specialist is training a model to identify the make and model of vehicles in images. The
Specialist wants to use transfer learning and an existing model trained on images of general objects. The
Specialist collated a large custom dataset of pictures containing different vehicle makes and models.
What should the Specialist do to initialize the model to re-train it with the custom data?

22 / 30

A Data Scientist wants to gain real-time insights into a data stream of GZIP files.
Which solution would allow the use of SQL to query the stream with the LEAST latency?

23 / 30

A company is running a machine learning prediction service that generates 100 TB of predictions every
day. A Machine Learning Specialist must generate a visualization of the daily precision-recall curve from
the predictions, and forward a read-only version to the Business team.
Which solution requires the LEAST coding effort?

24 / 30

A Machine Learning Specialist working for an online fashion company wants to build a data ingestion
solution for the company's Amazon S3-based data lake.
The Specialist wants to create a set of ingestion mechanisms that will enable future capabilities comprised
of:
Real-time analytics
Interactive analytics of historical data
Clickstream analytics
Product recommendations
Which services should the Specialist use?

25 / 30

An online reseller has a large, multi-column dataset with one column missing 30% of its data. A Machine
Learning Specialist believes that certain columns in the dataset could be used to reconstruct the missing
data.
Which reconstruction approach should the Specialist use to preserve the integrity of the dataset?

26 / 30

A company is observing low accuracy while training on the default built-in image classification algorithm in
Amazon SageMaker. The Data Science team wants to use an Inception neural network architecture
instead of a ResNet architecture.
Which of the following will accomplish this? (Choose two.)

27 / 30

A Mobile Network Operator is building an analytics platform to analyze and optimize a company's
operations using Amazon Athena and Amazon S3. The source systems send data in .CSV format in real time. The Data Engineering team wants to transform the data to the Apache Parquet format before storing it on Amazon S3. Which solution takes the LEAST effort to implement?

28 / 30

A Machine Learning Specialist is building a model that will perform time series forecasting using Amazon
SageMaker. The Specialist has finished training the model and is now planning to perform load testing on
the endpoint so they can configure Auto Scaling for the model variant.
Which approach will allow the Specialist to review the latency, memory utilization, and CPU utilization
during the load test?

29 / 30

The displayed graph is from a forecasting model for testing a time series.

Considering the graph only, which conclusion should a Machine Learning Specialist make about the
behavior of the model?

 

30 / 30

A Machine Learning Specialist receives customer data for an online shopping website. The data includes
demographics, past visits, and locality information. The Specialist must develop a machine learning
approach to identify the customer shopping patterns, preferences, and trends to enhance the website-for
better service and smart recommendations.
Which solution should the Specialist recommend?

Your score is

0%

Scroll to Top