Athena vs Macie
In this post we will discuss about two AWS services i.e. Athena and Macie and will understand the difference between both concepts.
Athena is an interactive query response service which enables you to analyze and to run queries to the data stored in the AWS S3 buckets directly using SQL.
Athena is a serverless service hence no infrastructure is required for its setup & you can start analyzing the data immediately. No data is needed to be stored on Athena, you can store the data in S3 and start analyzing it immediately. To get started, just log into the Athena Management Console, define your schema, and start querying on the supported data formats i.e. CSV, JSON, ORC, Apache Parquet etc.
Because Amazon Athena uses Amazon S3 as the underlying data store, it is highly available and durable with data redundantly stored across multiple facilities and multiple devices in each facility.
Related – AWS S3 (Simple Storage Service) Basics
Where to use Athena:
- To analyze/query log files stored in S3.
- Generate Business reports on data stored in S3.
- Analyze AWS costs and usage report.
Amazon Athena is priced per query and charged based on the amount of data scanned by the query. You can store data in a variety of formats on Amazon S3. If you compress your data, partition, or convert it to columnar storage format, you pay less because you scan less data.
AWS Macie is a Security service using ML (Machine Learning) and NLP (Natural Language Processing) to discover and protect sensitive data stored in S3.
Macie uses AI to identify any PII (Personal Identity Information) information contained in the data stored on the S3 buckets. PII may contain the
- Phone number
- Credit card number
- Passport number
- Driving License number etc.
Amazon Macie is a regional service. Amazon Macie needs to be enabled on a region by region basis and enables you to view findings across all your accounts within each region. This ensures all data analyzed is regionally based and doesn’t cross AWS regional boundaries.
Macie can be enabled at single-click in the AWS Management Console or a single API call. Macie provides multi-account support using AWS Organizations, so you can enable Macie across all of your accounts with a few clicks.
Where to use Macie:
- Macie is great for PCI-DSS and preventing ID theft.
With Macie you pay based on two factors, the number of Amazon S3 buckets in your account per month and the amount of data processed for sensitive data discovery in a given month
Are you preparing for an Interview, please go through the list of Top 100 AWS Interview Questions