As a data scientist, you are working on a global health dataset that has data from more than 50 countries. You want to encode three features, such as 'countries', 'race', and 'body organ' as categories. Which option would you use to encode the categorical feature?
You’re going to create an Oracle Cloud Infrastructure Anomaly Detection model for multivariate data. Where do you need to store the training data?
For your next data science project, you need access to public geospatial images. Which Oracle Cloud service provides free access to those images?
Six months ago, you created and deployed a model that predicts customer churn for a call centre. Initially, it was yielding quality predictions. However, over the last two months, users are questioning the credibility of the predictions. Which TWO methods would you employ to verify the accuracy of the model?
You are a data scientist using Oracle AutoML to produce a model and you are evaluating the score metric for the model. Which of the following TWO prevailing metrics would you use for evaluating a multiclass classification model?
You are a data scientist building a pipeline in the Oracle Cloud Infrastructure (OCI) Data Science service for your machine learning project. You want to optimize the pipeline completion time by running some steps in parallel. Which statement is true about running pipeline steps in parallel?
You are a data scientist designing an air traffic control model, and you choose to leverage Oracle AutoML. You understand that the Oracle AutoML pipeline consists of multiple stages and automatically operates in a certain sequence. What is the correct sequence for the Oracle AutoML pipeline?
Which TWO statements about Oracle Cloud Infrastructure (OCI) Open Data service are true?
You are a data scientist leveraging Oracle Cloud Infrastructure (OCI) Data Science to create a model and need some additional Python libraries for processing genome sequencing data. Which of the following THREE statements are correct with respect to installing additional Python libraries to process the data?
You are a computer vision engineer building an image recognition model. You decide to use Oracle Data Labeling to annotate your image data. Which of the following THREE are possible ways to annotate an image in Data Labeling?
You are a data scientist leveraging Oracle Cloud Infrastructure (OCI) to create a model and need some additional Python libraries for processing genome sequencing data. Which of the following THREE statements are correct with respect to installing additional Python libraries to process the data?
Which of the following programming languages are most widely used by data scientists?
When preparing your model artifact to save it to the Oracle Cloud Infrastructure (OCI) DataScience model catalog, you create a score.py file. What is the purpose of the score.py file?
Which of the following TWO non-open source JupyterLab extensions has Oracle Cloud Infrastructure (OCI) Data Science developed and added to the notebook session experience?
You are given the task of writing a program that sorts document images by language. Which Oracle service would you use?
A bike sharing platform has collected user commute data for the past 3 years. For increasing profitability and making useful inferences, a machine learning model needs to be built from the accumulated data. Which of the following options has the correct order of the required machine learning tasks for building a model?
You are a data scientist working for a manufacturing company. You have developed a forecasting model to predict the sales demand in the upcoming months. You created a model artifact that contained custom logic requiring third-party libraries. When you deployed the model, it failed to run because you did not include all the third-party dependencies in the model artifact. What file should be modified to include the missing libraries?
You loaded data into Oracle Cloud Infrastructure (OCI) Data Science. To transform the data, you want to use the Accelerated Data Science (ADS) SDK. When you applied the get_recommendations() tool to the ADSDataset object, it showed you user-detected issues with all the recommended changes to apply to the dataset. Which option should you use to apply all the recommended transformations at once?
You are a data scientist; you use the Oracle Cloud Infrastructure (OCI) Language service to train custom models. Which types of custom models can be trained?
Which OCI service provides a scalable environment for developers and data scientists to run Apache Spark applications at scale?
The Oracle AutoML pipeline automates hyperparameter tuning by training the model with different parameters in parallel. You have created an instance of Oracle AutoML as oracle_automl and now you want an output with all the different trials performed by Oracle AutoML. Which of the following commands gives you the results of all trials?
You are working as a data scientist for a healthcare company. They decided to analyze the data to find patterns in a large volume of electronic medical records. You are asked to build a PySpark solution to analyze these records in a JupyterLab notebook. What is the order of recommended steps to develop a PySpark application in OCI Data Science?
You are a data scientist working inside a notebook session and you attempt to pip install a package from a public repository that is not included in your conda environment. After running this command, you get a network timeout error. What might be missing from your network configuration?
Which of these protects customer data at rest and in transit in a way that allows customers to meet their security and compliance requirements for cryptographic algorithms and key management?
Which Oracle Data Safe feature minimizes the amount of personal data and allows internal test, development, and analytics teams to operate with reduced risk?
You want to create a user group for a team of external data science consultants. The consultants should only have the ability to see Data Science resource details but not have the ability to create, delete, or update Data Science resources. What verb should you write in the policy?
Which model has an open-source, open model format that allows you to run machine learning models on different platforms?
Which OCI service provides a managed Kubernetes service for deploying, scaling, and managing containerized applications?
You realize that your model deployment is about to reach its utilization limit. What would you do to avoid the issue before requests start to fail? Pick THREE.
You are a data scientist leveraging the Oracle Cloud Infrastructure (OCI) Language AI service for various types of text analyses. Which TWO capabilities can you utilize with this tool?
You are using Oracle Cloud Infrastructure (OCI) Anomaly Detection to train a model to detect anomalies in pump sensor data. How does the required False Alarm Probability setting affect an anomaly detection model?
You have configured the Management Agent on an Oracle Cloud Infrastructure (OCI) Linux instance for log ingestion purposes. Which is a required configuration for OCI Logging Analytics service to collect data from multiple logs of this instance?
You want to build a multistep machine learning workflow by using the Oracle Cloud Infrastructure (OCI) Data Science Pipeline feature. How would you configure the conda environment to run a pipeline step?
You are a researcher who requires access to large datasets. Which OCI service would you use?