Vivid AWS-Certified-Big-Data-Specialty Item Pool 2021

We provide real AWS-Certified-Big-Data-Specialty exam questions and answers braindumps in two formats. Download PDF & Practice Tests. Pass Amazon AWS-Certified-Big-Data-Specialty Exam quickly & easily. The AWS-Certified-Big-Data-Specialty PDF type is available for reading and printing. You can print more and practice many times. With the help of our Amazon AWS-Certified-Big-Data-Specialty dumps pdf and vce product and material, you can easily pass the AWS-Certified-Big-Data-Specialty exam.

Online AWS-Certified-Big-Data-Specialty free questions and answers of New Version:

NEW QUESTION 1
Customers have recently been complaining that your web application has randomly stopped responding. During a deep dive of your logs, the team has discovered a major bug in your Java web application. This bug is causing a memory leak that eventually causes the application to crash.
Your web application runs on Amazon EC2 and was built with AWS CloudFormation.
Which techniques should you see to help detect theses problems faster, as well as help eliminate the server’s unresponsiveness? Choose 2 answers

  • A. Update your AWS CloudFormation configuration and enable a CustomResource that uses cfn- signal to detect memory leaks
  • B. Update your CloudWatch metric granularity config for all Amazon EC2 memory metrics to support five-second granularit
  • C. Create a CloudWatch alarm that triggers an Amazon SNS notification to page your team when the application memory becomes too large
  • D. Update your AWS CloudFormation configuration to take advantage of Auto Scaling group
  • E. Configure an Auto Scaling group policy to trigger off your custom CloudWatch metrics
  • F. Create a custom CloudWatch metric that you push your JVM memory usage to create a CloudWatch alarm that triggers an Amazon SNS notification to page your team when the application memory usage becomes too large
  • G. Update your AWS CloudFormation configuration to take advantage of CloudWatch metrics Agen
  • H. Configure the CloudWatch Metrics Agent to monitor memory usage and trigger an Amazon SNS alarm

Answer: CD

NEW QUESTION 2
A company is using Amazon Machine Learning as part of a medical software application. The application will predict the most likely blood type for a patient based on a variety of other clinical tests that are available when blood type knowledge is unavailable.
What is the appropriate model choice and target attribute combination for the problem?

  • A. Multi-class classification model with a categorical target attribute
  • B. Regression model with a numeric target attribute
  • C. Binary Classification with a categorical target attribute
  • D. K-Nearest Neighbors model with a multi-class target attribute

Answer: C

NEW QUESTION 3
A city has been collecting data on its public bicycle share program for the past three years. The SPB
dataset currently on Amazon S3. The data contains the following data points:
• Bicycle organization points
• Bicycle destination points
• Mileage between the points
• Number of bicycle slots available at the station (which is variable based on the station location)
• Number of slots available and taken at each station at a given time
The program has received additional funds to increase the number of bicycle stations, available. All data is regularly archived to Amazon Glacier.
The new bicycle station must be located to provide the most riders access to bicycles. How should this task be performed?

  • A. Move the data from Amazon S3 into Amazon EBS-backed volumes and EC2 Hardoop with spot instances to run a Spark job that performs a stochastic gradient descent optimization.
  • B. Use the Amazon Redshift COPY command to move the data from Amazon S3 into RedShift and platform a SQL query that outputs the most popular bicycle stations.
  • C. Persist the data on Amazon S3 and use a transits EMR cluster with spot instances to run a Spark streaming job that will move the data into Amazon Kinesis.
  • D. Keep the data on Amazon S3 and use an Amazon EMR based Hadoop cluster with spot insistences to run a spark job that perform a stochastic gradient descent optimization over EMBFS.

Answer: B

NEW QUESTION 4
You have an ASP.NET web application running in Amazon Elastic BeanStalk. Your next version of the
application requires a third-party Windows installer package to be installed on the instance on first boot and before the application launches.
Which options are possible? Choose 2 answer

  • A. In the application’s Global.asax file, run msiexec.exe to install the package using Process.Start() in the Application_Start event handler
  • B. In the source bundle’s .ebextensions folder, create a file with a .config extensio
  • C. In the file, under the “packages” section and “msi” package manager, include the package’s URL
  • D. Launch a new Amazon EC2 instance from the AMI used by the environmen
  • E. Log into the instance, install the package and run syspre
  • F. Create a new AM
  • G. Configure the environment to use the new AMI
  • H. In the environment’s configuration, edit the instances configuration and add the package’s URL to the “Packages” section
  • I. In the source bundle’s .ebextensions folder, create a “Packages” folde
  • J. Place the package in the folder

Answer: BC

NEW QUESTION 5
An Administrator needs to design the event log storage architecture for events from mobile devices.
The event data will be processed by an Amazon EMR cluster daily for aggregated reporting and analytics before being archived.
How should the administrator recommend storing the log data?

  • A. Create an Amazon S3 bucket and write log data into folders by device Execute the EMR job on the device folders
  • B. Create an Amazon DynamoDB table partitioned on the device and sorted on data, write log data to the tabl
  • C. Execute the EMR job on the Amazon DynamoDB table
  • D. Create an Amazon S3 bucket and write data into folders by da
  • E. Execute the EMR job on the daily folder
  • F. Create an Amazon DynamoDB table partitioned on EventID, write log data to tabl
  • G. Execute the EMR job on the table

Answer: C

NEW QUESTION 6
Which data store should the organization choose?

  • A. Amazon Relational Database Service (RDS)
  • B. Amazon Redshift
  • C. Amazon DynamoDB
  • D. Amazon Elasticsearch

Answer: C

NEW QUESTION 7
A media advertising company handles a large number of real-time messages sourced from over 200
websites. The company’s data engineer needs to collect and process records in real time for analysis using Spark Streaming on Amazon Elastic MapReduce (EMR). The data engineer needs to fulfill a corporate mandate to keep ALL raw messages as they are received as a top priority.
Which Amazon Kinesis configuration meets these requirements?

  • A. Publish messages to Amazon Kinesis Firehose backed by Amazon Simple Storage Service (S3). Pull messages off Firehose with Spark Streaming in parallel to persistence to Amazon S3
  • B. Publish messages to Amazon Kinesis Stream
  • C. Pull messages off Stream with Spark Streaming in parallel to AWS messages from Streams to Firehose backed by Amazon Simple Storage Service (S3)
  • D. Publish messages to Amazon Kinesis Firehose backed by Amazon Simple Storage (S3). Use AWS Lambda messages from Firehose to Streams for processing with Spark Streaming
  • E. Publish messages to Amazon Kinesis Streams, pull messages off with Spark Streaming and write data new data to Amazon Simple Storage Service (S3) before and after processing

Answer: D

NEW QUESTION 8
An Amazon Redshift Database is encrypted using KMS. A data engineer needs to use the AWS CLI to
create a KMS encrypted snapshot of the database in another AWS region.
Which three steps should the data engineer take to accomplish this task? (Select Three.)

  • A. Create a new KMS key in the destination region
  • B. Copy the existing KMS key to the destination region
  • C. Use CreateSnapshotCopyGrant to allow Amazon Redshift to use the KMS key created in the destination region
  • D. Use CreateSnapshotCopyGrant to allow Amazon Redshift to use the KMS key from the source region
  • E. In the source, enable cross-region replication and specify the name of the copy grant created
  • F. In the destination region, enable cross-region replication and specify the name of the copy grant created

Answer: BDF

NEW QUESTION 9
A company with a support organization needs support engineers to be able to search historic cases to
provide fast responses on new issues raised. The company has forwarded all support messages into an Amazon Kinesis Stream. This meets a company objective of using only managed services to reduce.
The company needs an appropriate architecture that allows support engineers to search on historic cases can find similar issues and their associated responses.
Which AWS Lambda action is most appropriate?

  • A. Ingest and index the content into an Amazon Elasticsearch domain
  • B. Stem and tokenize the input and store the results into Amazon ElastiCache
  • C. Write data as JSON into Amazon DynamoDB with primary and secondary indexes
  • D. Aggregate feedback is Amazon S3 using a columnar format with partitioning

Answer: A

NEW QUESTION 10
A company needs to monitor the read and write IOPs metrics for their AWS MySQL RDS instance and
send real-time alerts to their operations team. Which AWS services can accomplish this? Choose 2 answers

  • A. Amazon Simple Email Service
  • B. Amazon CloudWatch
  • C. Amazon Simple Queue Service
  • D. Amazon Route 53
  • E. Amazon Simple Notification Service

Answer: BE

NEW QUESTION 11
You are deploying an application to collect votes for a very popular television show. Millions of users
will submit votes using mobile devices. The votes must be collected into a durable, scalable, and highly available data store for real-time public tabulation. Which service should you use?

  • A. Amazon DynamoDB
  • B. Amazon Redshift
  • C. Amazon Kinesis
  • D. Amazon Simple Queue Service

Answer: C

NEW QUESTION 12
An Amazon EMR cluster using EMRFS has access to Megabytes of data on Amazon S3, originating
from multiple unique data sources. The customer needs to query common fields across some of the data sets to be able to perform interactive joins and then display results quickly.
Which technology is most appropriate to enable this capability?

  • A. Presto
  • B. MicroStrategy
  • C. Pig
  • D. R Studio

Answer: A

NEW QUESTION 13
A us-based company is expanding their web presence into Europe. The company wants to extend their AWS infrastructure from Northern Virginia (us-east-1) into the Dublin (eu-west-1) region. Which of the following options would enable an equivalent experience for users on both continents?

  • A. Use a public-facing load balancer per region to load-balancer web traffic, and enable HTTP health checks
  • B. Use a public-facing load balancer per region to load balancer web traffic, and enable sticky sessions
  • C. Use Amazon Route S3, and apply a geolocation routing policy to distribution traffic across both regions
  • D. Use Amazon Route S3, and apply a weighted routing policy to distribute traffic across both regions

Answer: C

NEW QUESTION 14
A social media customer has data from different data sources including RDS running MySQL, RedShift, and Hive on EMR. To support better analysis, the customer needs to be able to analyze data from different data sources and to combine the results.
What is the most cost-effective solution to meet these requirements?

  • A. Load all data from a different database/warehouse to S3. Use Redshift COPY command to copy data to Redshift for analysis.
  • B. Install Presto on the EMR cluster where Hive sit
  • C. Configure MySQL and PostgreSQL connector to select from different data sources in a single query.
  • D. Spin up an Elasticsearch cluste
  • E. Load data from all three data sources and use Kibana to analyze.
  • F. Write a program running on a separate EC2 instance to run queries to three different system
  • G. Aggregate the results after getting the responses from all three systems.

Answer: D

NEW QUESTION 15
A company is centralizing a large number of unencrypted small files rom multiple Amazon S3 buckets. The company needs to verify that the files contain the same data after centralization.
Which method meets the requirements?

  • A. Company the S3 Etags from the source and destination objects
  • B. Call the S3 CompareObjects API for the source and destination objects
  • C. Place a HEAD request against the source and destination objects comparing SIG v4 header
  • D. Compare the size of the source and destination objects

Answer: B

NEW QUESTION 16
When you put objects in Amazon 53, what is the indication that an object was successfully stored?

  • A. A HTTP 200 result code and MD5 checksum, taken together, indicate that the operation was successful
  • B. A success code is inserted into the S3 object metadata
  • C. Amazon S3 is engineered for 99.999999999% durabilit
  • D. Therefore there is no need to confirm that data was inserted.
  • E. Each S3 account has a special bucket named_ s3_log
  • F. Success codes are written to this bucket with a timestamp and checksum

Answer: A

NEW QUESTION 17
Your Devops team is responsible for a multi-tier, Windows-based web application consisting of web servers, Amazon RDS database instances, and a load balancer behind Amazon Route53. You have been asked by your manager to build a cost-effective rolling deployment solution for this web application.
What method should you use?

  • A. Re-deploy your application on an AWS OpsWorks stac
  • B. Use the AWS OpsWorks clone stack feature to allow updates between duplicate stacks
  • C. Re-deploy your application on Elastic BeanStalk and take advantage of Elastic BeanStalk rolling updates
  • D. Re-deploy your application using an AWS CloudFormation template, launch a new AWS CloudFormation stack during each deployment, and then tear down the old stack
  • E. Re-deploy your application using an AWS CloudFormation templat
  • F. Use AWS CloudFormation rolling deployment policies, create a new policy for your AWS CloudFormation stack, and initiate an update stack operation to deploy new code

Answer: D

NEW QUESTION 18
An administrator needs to design a distribution strategy for a star schema in a Redshift cluster. The
administrator needs to determine the optimal distribution style for the tables in the Redshift schema. In which three circumstances would choosing Key-based distribution be most appropriate? (Select three)

  • A. When the administrator needs to optimize a large, slowly changing dimension table
  • B. When the administrator needs to reduce cross-node traffic
  • C. When the administrator needs to optimize the fact table for parity with the number of slices
  • D. When the administrator needs to balance data distribution and collocation of data
  • E. When the administrator needs to take advantage of data locality on a local node of joins and aggregates

Answer: ADE

NEW QUESTION 19
Which of the following requires a custom cloudwatch metric to monitor?

  • A. Memory utilization of an EC2 instance
  • B. CPU utilization of an EC2 instance
  • C. Disk usage activity of an EC2 instance
  • D. Data transfer of an EC2 instance

Answer: A

NEW QUESTION 20
A user is planning to setup infrastructure on AWS for the Christmas sales. The user is planning to use
Auto Scaling based on the schedule for proactive scaling. What advise would you give to the user?

  • A. It is good to schedule now because if the user forgets later on it will not scale up
  • B. The scaling should be setup only one week before Christmas
  • C. Wait till end of November before scheduling the activity
  • D. It is not advisable to use scheduled based scaling

Answer: C

NEW QUESTION 21
You have launched an Amazon Elastic Compute Cloud (EC2) instance into a public subnet with a primary private IP address assigned, an internet gateway is attached to the VPC, and the public route table is configured to send all internet-based internet. Why is the internet unreachable from this instance?

  • A. The Internet gateway security group must allow all outbound traffic
  • B. The instance does not have a public IP address
  • C. The instance “Source/Destination check” property must be enabled
  • D. The instance security group must allow all inbound traffic

Answer: B

NEW QUESTION 22
......

Thanks for reading the newest AWS-Certified-Big-Data-Specialty exam dumps! We recommend you to try the PREMIUM Dumpscollection AWS-Certified-Big-Data-Specialty dumps in VCE and PDF here: http://www.dumpscollection.net/dumps/AWS-Certified-Big-Data-Specialty/ (243 Q&As Dumps)