<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Sotaro Hikita &#8211; Noise</title>
	<atom:link href="https://noise.getoto.net/author/sotaro-hikita/feed/" rel="self" type="application/rss+xml" />
	<link>https://noise.getoto.net</link>
	<description>The collective thoughts of the interwebz</description>
	<lastBuildDate>Fri, 09 May 2025 15:50:07 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.8.2</generator>
	<item>
		<title>Accelerate lightweight analytics using PyIceberg with AWS Lambda and an AWS Glue Iceberg REST endpoint</title>
		<link>https://noise.getoto.net/2025/05/09/accelerate-lightweight-analytics-using-pyiceberg-with-aws-lambda-and-an-aws-glue-iceberg-rest-endpoint/</link>
		
		<dc:creator><![CDATA[Sotaro Hikita]]></dc:creator>
		<pubDate>Fri, 09 May 2025 15:50:07 +0000</pubDate>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[AWS Big Data]]></category>
		<category><![CDATA[AWS Glue]]></category>
		<category><![CDATA[AWS Lambda]]></category>
		<guid isPermaLink="false">http://noise.getoto.net/?guid=c32dd913e60aa7e4db151d1bdfacdc70</guid>

					<description><![CDATA[In this post, we demonstrate how PyIceberg, integrated with the AWS Glue Data Catalog and AWS Lambda, provides a lightweight approach to harness Iceberg’s powerful features through intuitive Python interfaces. We show how this integration enables teams to start working with Iceberg tables with minimal setup and infrastructure dependencies.]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Manage concurrent write conflicts in Apache Iceberg on the AWS Glue Data Catalog</title>
		<link>https://noise.getoto.net/2025/04/08/manage-concurrent-write-conflicts-in-apache-iceberg-on-the-aws-glue-data-catalog/</link>
		
		<dc:creator><![CDATA[Sotaro Hikita]]></dc:creator>
		<pubDate>Tue, 08 Apr 2025 16:51:54 +0000</pubDate>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[AWS Big Data]]></category>
		<category><![CDATA[AWS Glue]]></category>
		<guid isPermaLink="false">http://noise.getoto.net/?guid=9108ddd7d874df3b91bb40fc7e67b4b7</guid>

					<description><![CDATA[This post demonstrates how to implement reliable concurrent write handling mechanisms in Iceberg tables. We will explore Iceberg’s concurrency model, examine common conflict scenarios, and provide practical implementation patterns of both automatic retry mechanisms and situations requiring custom conflict resolution logic for building resilient data pipelines. We will also cover the pattern with automatic compaction through AWS Glue Data Catalog table optimization.]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Use open table format libraries on AWS Glue 5.0 for Apache Spark</title>
		<link>https://noise.getoto.net/2024/12/04/use-open-table-format-libraries-on-aws-glue-5-0-for-apache-spark/</link>
		
		<dc:creator><![CDATA[Sotaro Hikita]]></dc:creator>
		<pubDate>Wed, 04 Dec 2024 19:04:31 +0000</pubDate>
				<category><![CDATA[AWS Big Data]]></category>
		<category><![CDATA[AWS Glue]]></category>
		<guid isPermaLink="false">http://noise.getoto.net/?guid=49be600f3d32f1628be585b47349cf4d</guid>

					<description><![CDATA[Open table formats are emerging in the rapidly evolving domain of big data management, fundamentally altering the landscape of data storage and analysis. In earlier posts, we discussed AWS Glue 5.0 for Apache Spark. In this post, we highlight notable updates on Iceberg, Hudi, and Delta Lake in AWS Glue 5.0.]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Introducing AWS Glue Data Catalog automation for table statistics collection for improved query performance on Amazon Redshift and Amazon Athena</title>
		<link>https://noise.getoto.net/2024/12/04/introducing-aws-glue-data-catalog-automation-for-table-statistics-collection-for-improved-query-performance-on-amazon-redshift-and-amazon-athena/</link>
		
		<dc:creator><![CDATA[Sotaro Hikita]]></dc:creator>
		<pubDate>Tue, 03 Dec 2024 23:18:24 +0000</pubDate>
				<category><![CDATA[Amazon Redshift]]></category>
		<category><![CDATA[Analytics]]></category>
		<category><![CDATA[announcements]]></category>
		<category><![CDATA[AWS Glue]]></category>
		<guid isPermaLink="false">http://noise.getoto.net/?guid=30bb5fbfd52bf1327e35259c34a40f23</guid>

					<description><![CDATA[The AWS Glue Data Catalog now automates generating statistics for new tables. These statistics are integrated with the cost-based optimizer (CBO) from Amazon Redshift Spectrum and Amazon Athena, resulting in improved query performance and potential cost savings. In this post, we discuss how the Data Catalog automates table statistics collection and how you can use it to enhance your data platform’s efficiency.]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Accelerate query performance with Apache Iceberg statistics on the AWS Glue Data Catalog</title>
		<link>https://noise.getoto.net/2024/07/10/accelerate-query-performance-with-apache-iceberg-statistics-on-the-aws-glue-data-catalog/</link>
		
		<dc:creator><![CDATA[Sotaro Hikita]]></dc:creator>
		<pubDate>Tue, 09 Jul 2024 21:42:44 +0000</pubDate>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[announcements]]></category>
		<category><![CDATA[AWS Glue]]></category>
		<guid isPermaLink="false">http://noise.getoto.net/?guid=39044e142c2ad3c38b1d8d259710fc35</guid>

					<description><![CDATA[Today, we are pleased to announce a new capability for the AWS Glue Data Catalog: generating column-level aggregation statistics for Apache Iceberg tables to accelerate queries. These statistics are utilized by cost-based optimizer (CBO) in Amazon Redshift Spectrum, resulting in improved query performance and potential cost savings. Apache Iceberg is an open table format that […]]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
	</channel>
</rss>

<!--
Performance optimized by W3 Total Cache. Learn more: https://www.boldgrid.com/w3-total-cache/

Object Caching 29/93 objects using Memcached
Page Caching using Disk: Enhanced 
Lazy Loading (feed)
Database Caching using Memcached

Served from: noise.getoto.net @ 2026-02-09 02:45:13 by W3 Total Cache
-->