{"id":3587,"date":"2019-11-26T08:46:06","date_gmt":"2019-11-26T08:46:06","guid":{"rendered":"https:\/\/prwatech.in\/blog\/?p=3587"},"modified":"2024-03-29T06:08:34","modified_gmt":"2024-03-29T06:08:34","slug":"amazon-athena","status":"publish","type":"post","link":"https:\/\/prwatech.in\/blog\/aws\/amazon-athena\/amazon-athena\/","title":{"rendered":"Amazon Athena"},"content":{"rendered":"<h1><span style=\"color: #ff6600;\">What is AWS Athena?<\/span><\/h1>\n<p><a href=\"https:\/\/prwatech.in\/aws-training-institute-in-pune\/\">AWS<\/a> Athena is a code-free, fully automated, fully managed, data pipeline that performs database automation, <a href=\"https:\/\/www.youtube.com\/channel\/UCwAaWqnH2MqikDMpb1jBspw\">Parquet file<\/a> conversion, table creation, Snappy compression, partitioning, and more. It is an interactive query service to analyze\u00a0Amazon Simple Storage Service (S3)\u00a0data using standard SQL.<\/p>\n<h3><span style=\"color: #0000ff;\">Introduction<\/span><\/h3>\n<p>Amazon launched Athena on November 20, 2016, and this serverless query service provides data analysis with standard SQL.<\/p>\n<p>With the <a href=\"https:\/\/prwatech.in\/aws-training-institute-in-pune\/\">AWS<\/a> management console, users can point Athena at data stored in Amazon S3 and execute queries to get results in seconds using standard SQL.<\/p>\n<p>Amazon Athena has no infrastructure to set up or manage, &amp; the customers need to pay only for the queries they run on it.<\/p>\n<p>Amazon Athena scales executing queries in parallel, scales automatically, providing fast results even with a large dataset &amp; complex questions.<\/p>\n<h3><span style=\"color: #ff6600;\">Why Amazon Athena?<\/span><\/h3>\n<p>In the current scenario of Big Data, data increases day-by-day and in this Datalake, so we want to keep all of our data. But we don\u2019t actually need expensive redshift nodes to be running all the time. This led us to find Presto. Presto is a distributed SQL query engine tool, designed for analytic queries. Presto decouples the data from its processing; No data is stored in Presto, so it reads it from elsewhere. e.g. S3. and since S3 storage is really cheap, it makes a lot of sense to use it as the storage system for your Datalake.<\/p>\n<p>Amazon Athena is designed on the base of Presto and it supports standard SQL syntax which makes it easier for use by our data analysts. Do note however that there are some differences in the SQL dialect from e.g. Redshift\u2019s SQL.<\/p>\n<p><strong>Partitioning of Data:<\/strong><\/p>\n<p>By partitioning your data, you can restrict the amount of data scanned by each query thus improving performance &amp; reducing cost<\/p>\n<p>Athena leverages <strong><a href=\"https:\/\/prwatech.in\/blog\/hadoop\/hadoop-hive-partitioning\/\">HIVE<\/a> <\/strong>for partitioning data<\/p>\n<p>You can apply a partition on your data by any key<\/p>\n<p><strong>Queries:<\/strong><\/p>\n<p>You can query geospatial data.<\/p>\n<p>You can query different kinds of logs as your datasets.<\/p>\n<p>Athena stores query results in S3.<\/p>\n<p>The Athena retain query history for 45 days.<\/p>\n<p>Athena does-not support user-defined functions \u201cINSERT INTO\u201d statements and stored procedures.<\/p>\n<h3><span style=\"color: #0000ff;\">Features of Athena<\/span><\/h3>\n<p>Athena is one of the best services offers by AWS. It has several features making it suitable to analyze your data. Let\u2019s have a look at the various features of Athena given below:<\/p>\n<p><strong>Easy Implementation:<\/strong>\u00a0Athena requires no installation &amp; can directly access using the AWS Console.<\/p>\n<p><strong>Serverless:<\/strong>\u00a0The end-user does not face any problems in configuring, scaling or failure as Athena is a serverless service. It can take care of everything on its own.<\/p>\n<p><strong>Pay per query:<\/strong> It charges only for queries you run, which means the amount of data that is manage per query.<\/p>\n<p><strong>Fast:<\/strong>\u00a0Athena is a high-speed analytics tool and can perform even the complex queries in relatively less time by splitting into simpler ones and running them parallelly, and merge them to provide the desired output.<\/p>\n<p><strong>Secure:<\/strong>\u00a0Using AWS Identity and IAM policies (IAM), Athena provides you with complete control over the data set.<\/p>\n<p><strong>High availability:<\/strong>\u00a0With AWS, Athena is accessible &amp; the user can run queries round the clock.<\/p>\n<p><strong>Integration:<\/strong>\u00a0The best feature of Athena is its integration with AWS Glue which is an ETL service for the customer.<\/p>\n<h3><span style=\"color: #ff6600;\">AWS Glue<\/span><\/h3>\n<p>AWS Glue is a perfectly managed ETL service which makes it flexible for customers who want to prepare and load data for analytics. You can build and execute an ETL in the Amazon Management Console with a few clicks. You can point AWS Glue ETL service to your AWS data and discovers your data and store associated metadata like Schema and table definition in the AWS Glue Data Catalog. Your data once cataloged is immediately searchable, queryable, and available for ETL.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-3604\" src=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2019\/11\/g.jpg\" alt=\"Amazon Athena\" width=\"850\" height=\"444\" srcset=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2019\/11\/g.jpg 310w, https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2019\/11\/g-300x157.jpg 300w\" sizes=\"auto, (max-width: 850px) 100vw, 850px\" \/><\/p>\n<p><strong>Benefits of AWS Glue:<\/strong><\/p>\n<p>AWS Glue is integrated with a wide range of AWS services, which means less hassle for you while onboarding.<\/p>\n<p>AWS Glue is serverless i.e. No infrastructure required to provision or manage.<\/p>\n<p>Need to pay only for the resources used to run the jobs.<\/p>\n<h3><span style=\"color: #ff6600;\"><strong>Amazon QuickSight<\/strong><\/span><\/h3>\n<p>Amazon QuickSight is a cloud-powered, fast BI service, which makes it easy to deliver insights to everyone in the organization. Being a wholly managed service, QuickSight lets you create interactive dashboards easily and publish with ML insights. Dashboards can be accessed from any device embedded into your applications, websites, and portals. Using Pay-per-Session pricing, it allows you to provide everyone to obtain data required when only paying for what you use.<\/p>\n<p><strong>Some of the major benefits provided by Amazon QuickSight are list as follows:<\/strong><\/p>\n<p>Pay only for what you use<\/p>\n<p>Scale from 10 users to 10,000<\/p>\n<p>Embed self-service data analytics<\/p>\n<p>Build end-to-end BI solutions<\/p>\n<h3><span style=\"color: #ff6600;\">How does AWS Athena work?<\/span><\/h3>\n<p>Athena works directly with S3 data. It uses a distributed SQL engine, Presto for running queries. It uses\u00a0<strong>Apache Hive<\/strong>\u00a0to create and alter tables and partitions.<\/p>\n<p>Let\u2019s have a look at the prerequisites to start working with Athena:<\/p>\n<p>Must have an AWS account<\/p>\n<p>Enable your account to export your cost and usage data into an S3 bucket.<\/p>\n<p>Prepare buckets for Athena to connect.<\/p>\n<p>AWS creates manifest files using metadata every time it writes to the bucket. It creates a folder inside the technology-aws-billing-data bucket known as Athena, which contains only the data.<\/p>\n<p>To simplify the setup, we can use one region: the us-west-2 region.<\/p>\n<p>The final step is downloading the credentials for the new IAM user. The credentials will directly map to the database credentials to connect<\/p>\n<table width=\"583\">\n<tbody>\n<tr>\n<td width=\"292\"><strong>Parameter<\/strong><\/td>\n<td width=\"291\"><strong>Value<\/strong><\/td>\n<\/tr>\n<tr>\n<td width=\"292\">Database host<\/td>\n<td width=\"291\">athena.us-west-2.amazonaws.com<\/td>\n<\/tr>\n<tr>\n<td width=\"292\">Database username<\/td>\n<td width=\"291\">IAM username<\/td>\n<\/tr>\n<tr>\n<td width=\"292\">Database password<\/td>\n<td width=\"291\">Secret Access Key<\/td>\n<\/tr>\n<tr>\n<td width=\"292\">Database name<\/td>\n<td width=\"291\">Access Key ID<\/td>\n<\/tr>\n<tr>\n<td width=\"292\">Database port<\/td>\n<td width=\"291\">443<\/td>\n<\/tr>\n<tr>\n<td width=\"292\">S3 staging directory<\/td>\n<td width=\"291\">s3:\/\/aws-athena-query-results-technology\/<\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n<h3><span style=\"color: #0000ff;\"><strong>Conclusion<\/strong><\/span><\/h3>\n<p>With public cloud services, providing service-based analytics services such as Amazon Athena, businesses can get more analysis without any expensive complications that arise with home-built analytics tools.<\/p>\n<p>Amazon Athena a serverless architecture and employing ANSI SQL, Athena makes data queries quick to set up, easy to use, and fast to run. The pay-per-use model of Amazon Athena will make it affordable to run analytics. Since Athena works with Amazon Simple Storage Service (S3) and comes with unmatched scalability, durability, reliability and the power of object storage, this is the perfect-suite to run analytics workloads.<\/p>\n<p style=\"text-align: center;\"><span style=\"color: #0000ff;\"><strong>#Last but not least, always ask for help!<\/strong><\/span><\/p>\n<p><strong>\u00a0<\/strong><\/p>\n","protected":false},"excerpt":{"rendered":"<p>What is AWS Athena? AWS Athena is a code-free, fully automated, fully managed, data pipeline that performs database automation, Parquet file conversion, table creation, Snappy compression, partitioning, and more. It is an interactive query service to analyze\u00a0Amazon Simple Storage Service (S3)\u00a0data using standard SQL. Introduction Amazon launched Athena on November 20, 2016, and this serverless [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":3588,"comment_status":"closed","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[1661,31],"tags":[390],"class_list":["post-3587","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-amazon-athena","category-aws","tag-amazon-athena"],"acf":[],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v25.7 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>What is AWS Athena? What to Know More About it?<\/title>\n<meta name=\"description\" content=\"AWS Athena is a code-free, fully automated, fully managed, data pipeline that performs database automation, Parquet file conversion, table creation.\" \/>\n<meta name=\"robots\" content=\"noindex, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"What is AWS Athena? What to Know More About it?\" \/>\n<meta property=\"og:description\" content=\"AWS Athena is a code-free, fully automated, fully managed, data pipeline that performs database automation, Parquet file conversion, table creation.\" \/>\n<meta property=\"og:url\" content=\"https:\/\/prwatech.in\/blog\/aws\/amazon-athena\/amazon-athena\/\" \/>\n<meta property=\"og:site_name\" content=\"Prwatech\" \/>\n<meta property=\"article:publisher\" content=\"https:\/\/www.facebook.com\/prwatech.in\/\" \/>\n<meta property=\"article:published_time\" content=\"2019-11-26T08:46:06+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2024-03-29T06:08:34+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2019\/11\/White-and-Grey-Laurel-Hipster-Logo.png\" \/>\n\t<meta property=\"og:image:width\" content=\"500\" \/>\n\t<meta property=\"og:image:height\" content=\"500\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/png\" \/>\n<meta name=\"author\" content=\"Prwatech\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:creator\" content=\"@Eduprwatech\" \/>\n<meta name=\"twitter:site\" content=\"@Eduprwatech\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Prwatech\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"5 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/prwatech.in\/blog\/aws\/amazon-athena\/amazon-athena\/\",\"url\":\"https:\/\/prwatech.in\/blog\/aws\/amazon-athena\/amazon-athena\/\",\"name\":\"What is AWS Athena? What to Know More About it?\",\"isPartOf\":{\"@id\":\"https:\/\/prwatech.in\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/prwatech.in\/blog\/aws\/amazon-athena\/amazon-athena\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/prwatech.in\/blog\/aws\/amazon-athena\/amazon-athena\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2019\/11\/White-and-Grey-Laurel-Hipster-Logo.png\",\"datePublished\":\"2019-11-26T08:46:06+00:00\",\"dateModified\":\"2024-03-29T06:08:34+00:00\",\"author\":{\"@id\":\"https:\/\/prwatech.in\/blog\/#\/schema\/person\/db90baff7744090b2288bbc98fea87f3\"},\"description\":\"AWS Athena is a code-free, fully automated, fully managed, data pipeline that performs database automation, Parquet file conversion, table creation.\",\"breadcrumb\":{\"@id\":\"https:\/\/prwatech.in\/blog\/aws\/amazon-athena\/amazon-athena\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/prwatech.in\/blog\/aws\/amazon-athena\/amazon-athena\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/prwatech.in\/blog\/aws\/amazon-athena\/amazon-athena\/#primaryimage\",\"url\":\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2019\/11\/White-and-Grey-Laurel-Hipster-Logo.png\",\"contentUrl\":\"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2019\/11\/White-and-Grey-Laurel-Hipster-Logo.png\",\"width\":500,\"height\":500},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/prwatech.in\/blog\/aws\/amazon-athena\/amazon-athena\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/prwatech.in\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Amazon Athena\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/prwatech.in\/blog\/#website\",\"url\":\"https:\/\/prwatech.in\/blog\/\",\"name\":\"Prwatech\",\"description\":\"Share Ideas, Start Something Good.\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/prwatech.in\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/prwatech.in\/blog\/#\/schema\/person\/db90baff7744090b2288bbc98fea87f3\",\"name\":\"Prwatech\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/prwatech.in\/blog\/#\/schema\/person\/image\/\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/c00bafc1b04045f31eda917de39891456c44fa47c092b9bb6be0f860a3a30a2f?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/c00bafc1b04045f31eda917de39891456c44fa47c092b9bb6be0f860a3a30a2f?s=96&d=mm&r=g\",\"caption\":\"Prwatech\"},\"url\":\"https:\/\/prwatech.in\/blog\/author\/prwatech123\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"What is AWS Athena? What to Know More About it?","description":"AWS Athena is a code-free, fully automated, fully managed, data pipeline that performs database automation, Parquet file conversion, table creation.","robots":{"index":"noindex","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"og_locale":"en_US","og_type":"article","og_title":"What is AWS Athena? What to Know More About it?","og_description":"AWS Athena is a code-free, fully automated, fully managed, data pipeline that performs database automation, Parquet file conversion, table creation.","og_url":"https:\/\/prwatech.in\/blog\/aws\/amazon-athena\/amazon-athena\/","og_site_name":"Prwatech","article_publisher":"https:\/\/www.facebook.com\/prwatech.in\/","article_published_time":"2019-11-26T08:46:06+00:00","article_modified_time":"2024-03-29T06:08:34+00:00","og_image":[{"width":500,"height":500,"url":"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2019\/11\/White-and-Grey-Laurel-Hipster-Logo.png","type":"image\/png"}],"author":"Prwatech","twitter_card":"summary_large_image","twitter_creator":"@Eduprwatech","twitter_site":"@Eduprwatech","twitter_misc":{"Written by":"Prwatech","Est. reading time":"5 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/prwatech.in\/blog\/aws\/amazon-athena\/amazon-athena\/","url":"https:\/\/prwatech.in\/blog\/aws\/amazon-athena\/amazon-athena\/","name":"What is AWS Athena? What to Know More About it?","isPartOf":{"@id":"https:\/\/prwatech.in\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/prwatech.in\/blog\/aws\/amazon-athena\/amazon-athena\/#primaryimage"},"image":{"@id":"https:\/\/prwatech.in\/blog\/aws\/amazon-athena\/amazon-athena\/#primaryimage"},"thumbnailUrl":"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2019\/11\/White-and-Grey-Laurel-Hipster-Logo.png","datePublished":"2019-11-26T08:46:06+00:00","dateModified":"2024-03-29T06:08:34+00:00","author":{"@id":"https:\/\/prwatech.in\/blog\/#\/schema\/person\/db90baff7744090b2288bbc98fea87f3"},"description":"AWS Athena is a code-free, fully automated, fully managed, data pipeline that performs database automation, Parquet file conversion, table creation.","breadcrumb":{"@id":"https:\/\/prwatech.in\/blog\/aws\/amazon-athena\/amazon-athena\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/prwatech.in\/blog\/aws\/amazon-athena\/amazon-athena\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/prwatech.in\/blog\/aws\/amazon-athena\/amazon-athena\/#primaryimage","url":"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2019\/11\/White-and-Grey-Laurel-Hipster-Logo.png","contentUrl":"https:\/\/prwatech.in\/blog\/wp-content\/uploads\/2019\/11\/White-and-Grey-Laurel-Hipster-Logo.png","width":500,"height":500},{"@type":"BreadcrumbList","@id":"https:\/\/prwatech.in\/blog\/aws\/amazon-athena\/amazon-athena\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/prwatech.in\/blog\/"},{"@type":"ListItem","position":2,"name":"Amazon Athena"}]},{"@type":"WebSite","@id":"https:\/\/prwatech.in\/blog\/#website","url":"https:\/\/prwatech.in\/blog\/","name":"Prwatech","description":"Share Ideas, Start Something Good.","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/prwatech.in\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/prwatech.in\/blog\/#\/schema\/person\/db90baff7744090b2288bbc98fea87f3","name":"Prwatech","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/prwatech.in\/blog\/#\/schema\/person\/image\/","url":"https:\/\/secure.gravatar.com\/avatar\/c00bafc1b04045f31eda917de39891456c44fa47c092b9bb6be0f860a3a30a2f?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/c00bafc1b04045f31eda917de39891456c44fa47c092b9bb6be0f860a3a30a2f?s=96&d=mm&r=g","caption":"Prwatech"},"url":"https:\/\/prwatech.in\/blog\/author\/prwatech123\/"}]}},"_links":{"self":[{"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/posts\/3587","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/comments?post=3587"}],"version-history":[{"count":19,"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/posts\/3587\/revisions"}],"predecessor-version":[{"id":11170,"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/posts\/3587\/revisions\/11170"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/media\/3588"}],"wp:attachment":[{"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/media?parent=3587"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/categories?post=3587"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/prwatech.in\/blog\/wp-json\/wp\/v2\/tags?post=3587"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}