<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Big Data &#8211; Noise</title>
	<atom:link href="https://noise.getoto.net/tag/big-data/feed/" rel="self" type="application/rss+xml" />
	<link>https://noise.getoto.net</link>
	<description>The collective thoughts of the interwebz</description>
	<lastBuildDate>Fri, 17 Oct 2025 18:42:37 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.8.2</generator>
	<item>
		<title>How and Why Netflix Built a Real-Time Distributed Graph: Part 1 — Ingesting and Processing Data…</title>
		<link>https://noise.getoto.net/2025/10/17/how-and-why-netflix-built-a-real-time-distributed-graph-part-1-ingesting-and-processing-data/</link>
		
		<dc:creator><![CDATA[Netflix Technology Blog]]></dc:creator>
		<pubDate>Fri, 17 Oct 2025 18:42:37 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[data-engineering]]></category>
		<category><![CDATA[software-architecture]]></category>
		<guid isPermaLink="false">https://medium.com/p/80113e124acc</guid>

					<description><![CDATA[How and Why Netflix Built a Real-Time Distributed Graph: Part 1 — Ingesting and Processing Data Streams at Internet ScaleAuthors: Adrian Taruc and James DaltonThis is the first entry of a multi-part blog series describing how we built a Real-Time Distr...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Scaling Muse: How Netflix Powers Data-Driven Creative Insights at Trillion-Row Scale</title>
		<link>https://noise.getoto.net/2025/09/23/scaling-muse-how-netflix-powers-data-driven-creative-insights-at-trillion-row-scale/</link>
		
		<dc:creator><![CDATA[Netflix Technology Blog]]></dc:creator>
		<pubDate>Mon, 22 Sep 2025 21:24:20 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[data-engineering]]></category>
		<category><![CDATA[Druid]]></category>
		<guid isPermaLink="false">https://medium.com/p/aa9ad326fd77</guid>

					<description><![CDATA[By Andrew Pierce, Chris Thrailkill, Victor ChiapaikeoAt Netflix, we prioritize getting timely data and insights into the hands of the people who can act on them. One of our key internal applications for this purpose is Muse. Muse’s ultimate goal is to ...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Export JMX metrics from Kafka connectors in Amazon Managed Streaming for Apache Kafka Connect with a custom plugin</title>
		<link>https://noise.getoto.net/2025/08/15/export-jmx-metrics-from-kafka-connectors-in-amazon-managed-streaming-for-apache-kafka-connect-with-a-custom-plugin/</link>
		
		<dc:creator><![CDATA[Jaydev Nath]]></dc:creator>
		<pubDate>Fri, 15 Aug 2025 15:51:01 +0000</pubDate>
				<category><![CDATA[Advanced (300)]]></category>
		<category><![CDATA[Amazon Managed Streaming for Apache Kafka (Amazon MSK)]]></category>
		<category><![CDATA[Amazon MSK]]></category>
		<category><![CDATA[Analytics]]></category>
		<category><![CDATA[AWS Analytics]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[MSK Connect]]></category>
		<category><![CDATA[Technical How-to]]></category>
		<guid isPermaLink="false">http://noise.getoto.net/?guid=716ec3970d49f2451367c21417de7033</guid>

					<description><![CDATA[In this post, we demonstrate how you can export the JMX metrics for Debezium connector when used with Amazon MSK Connect.]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Express brokers for Amazon MSK: Turbo-charged Kafka scaling with up to 20 times faster performance</title>
		<link>https://noise.getoto.net/2025/03/07/express-brokers-for-amazon-msk-turbo-charged-kafka-scaling-with-up-to-20-times-faster-performance/</link>
		
		<dc:creator><![CDATA[Masudur Rahaman Sayem]]></dc:creator>
		<pubDate>Fri, 07 Mar 2025 16:49:21 +0000</pubDate>
				<category><![CDATA[Amazon Managed Streaming for Apache Kafka (Amazon MSK)]]></category>
		<category><![CDATA[Amazon MSK]]></category>
		<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Expert (400)]]></category>
		<guid isPermaLink="false">http://noise.getoto.net/?guid=94368d817efcd50510497f2fe9ad93be</guid>

					<description><![CDATA[In this post, we walk you through the implementation of MSK Express brokers, highlighting their core features, benefits, and best practices for rapid Kafka scaling.]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Let’s Architect! Modern data architectures</title>
		<link>https://noise.getoto.net/2024/11/06/lets-architect-modern-data-architectures-2/</link>
		
		<dc:creator><![CDATA[Luca Mezzalira]]></dc:creator>
		<pubDate>Tue, 05 Nov 2024 22:31:27 +0000</pubDate>
				<category><![CDATA[Amazon Machine Learning]]></category>
		<category><![CDATA[Amazon Simple Storage Service (S3)]]></category>
		<category><![CDATA[Architecture]]></category>
		<category><![CDATA[artificial intelligence]]></category>
		<category><![CDATA[AWS Big Data]]></category>
		<category><![CDATA[Best practices]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[data-mesh]]></category>
		<category><![CDATA[database]]></category>
		<category><![CDATA[generative AI]]></category>
		<category><![CDATA[Let's Architect]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[storage]]></category>
		<guid isPermaLink="false">http://noise.getoto.net/?guid=3d228dcaac93c6af4fc415343971de2b</guid>

					<description><![CDATA[Data is the fuel for AI; modern data is even more important for generative AI and advanced data analytics, producing more accurate, relevant, and impactful results. Modern data comes in various forms: real-time, unstructured, or user-generated. Each form requires a different solution. AWS’s data journey began with Amazon Simple Storage Service (Amazon S3) in 2006, […]]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Modernize your legacy databases with AWS data lakes, Part 2: Build a data lake using AWS DMS data on Apache Iceberg</title>
		<link>https://noise.getoto.net/2024/10/30/modernize-your-legacy-databases-with-aws-data-lakes-part-2-build-a-data-lake-using-aws-dms-data-on-apache-iceberg/</link>
		
		<dc:creator><![CDATA[Shaheer Mansoor]]></dc:creator>
		<pubDate>Wed, 30 Oct 2024 20:15:02 +0000</pubDate>
				<category><![CDATA[Amazon Simple Queue Service (SQS)]]></category>
		<category><![CDATA[Amazon Simple Storage Service (S3)]]></category>
		<category><![CDATA[Apache Iceberg]]></category>
		<category><![CDATA[AWS Big Data]]></category>
		<category><![CDATA[AWS Database Migration Service]]></category>
		<category><![CDATA[AWS Glue]]></category>
		<category><![CDATA[AWS Lambda]]></category>
		<category><![CDATA[AWS Step Functions]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Data Catalog]]></category>
		<category><![CDATA[Data Lake]]></category>
		<category><![CDATA[Events]]></category>
		<category><![CDATA[python]]></category>
		<category><![CDATA[Technical How-to]]></category>
		<guid isPermaLink="false">http://noise.getoto.net/?guid=3bd27ad09e40947c19d2f0eab552f51a</guid>

					<description><![CDATA[This is part two of a three-part series where we show how to build a data lake on AWS using a modern data architecture. This post shows how to load data from a legacy database (SQL Server) into a transactional data lake (Apache Iceberg) using AWS Glue. We show how to build data pipelines using AWS Glue jobs, optimize them for both cost and performance, and implement schema evolution to automate manual tasks. To review the first part of the series, where we load SQL Server data into Amazon Simple Storage Service (Amazon S3) using AWS Database Migration Service (AWS DMS), see Modernize your legacy databases with AWS data lakes, Part 1: Migrate SQL Server using AWS DMS.]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>How Getir unleashed data democratization using a data mesh architecture with Amazon Redshift</title>
		<link>https://noise.getoto.net/2024/10/23/how-getir-unleashed-data-democratization-using-a-data-mesh-architecture-with-amazon-redshift/</link>
		
		<dc:creator><![CDATA[Asser Moustafa]]></dc:creator>
		<pubDate>Wed, 23 Oct 2024 15:52:23 +0000</pubDate>
				<category><![CDATA[Amazon Redshift]]></category>
		<category><![CDATA[Amazon Redshift data sharing]]></category>
		<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Architecture]]></category>
		<category><![CDATA[AWS Analytics]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Customer Solutions]]></category>
		<guid isPermaLink="false">http://noise.getoto.net/?guid=a2589fe3b9e394784646322e1a26e9ee</guid>

					<description><![CDATA[In this post, we explain how ultrafast delivery pioneer, Getir, unleashed the power of data democratization on a large scale through their data mesh architecture using Amazon Redshift. We start by introducing Getir and their vision—to seamlessly, securely, and efficiently share business data across different teams within the organization for BI, extract, transform, and load (ETL), and other use cases. We’ll then explore how Amazon Redshift data sharing powered the data mesh architecture that allowed Getir to achieve this transformative vision.]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Building a scalable streaming data platform that enables real-time and batch analytics of electric vehicles on AWS</title>
		<link>https://noise.getoto.net/2024/07/17/building-a-scalable-streaming-data-platform-that-enables-real-time-and-batch-analytics-of-electric-vehicles-on-aws/</link>
		
		<dc:creator><![CDATA[Ayush Agrawal]]></dc:creator>
		<pubDate>Wed, 17 Jul 2024 16:53:05 +0000</pubDate>
				<category><![CDATA[Amazon Managed Streaming for Apache Kafka]]></category>
		<category><![CDATA[Amazon Managed Streaming for Apache Kafka (Amazon MSK)]]></category>
		<category><![CDATA[Amazon OpenSearch]]></category>
		<category><![CDATA[Amazon OpenSearch Service]]></category>
		<category><![CDATA[Amazon QuickSight]]></category>
		<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Best practices]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Internet of Things]]></category>
		<category><![CDATA[kafka]]></category>
		<guid isPermaLink="false">http://noise.getoto.net/?guid=0443fcefa879a8753d6533c3f7f87beb</guid>

					<description><![CDATA[The automobile industry has undergone a remarkable transformation because of the increasing adoption of electric vehicles (EVs). EVs, known for their sustainability and eco-friendliness, are paving the way for a new era in transportation. As environmental concerns and the push for greener technologies have gained momentum, the adoption of EVs has surged, promising to reshape […]]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Detect and handle data skew on AWS Glue</title>
		<link>https://noise.getoto.net/2024/05/01/detect-and-handle-data-skew-on-aws-glue/</link>
		
		<dc:creator><![CDATA[Salim Tutuncu]]></dc:creator>
		<pubDate>Wed, 01 May 2024 16:27:24 +0000</pubDate>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Apache Spark]]></category>
		<category><![CDATA[AWS Analytics]]></category>
		<category><![CDATA[AWS Glue]]></category>
		<category><![CDATA[Best practices]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Expert (400)]]></category>
		<category><![CDATA[Optimization]]></category>
		<category><![CDATA[Spark]]></category>
		<category><![CDATA[Technical How-to]]></category>
		<guid isPermaLink="false">http://noise.getoto.net/?guid=ac6e0a0c43720a34b439d0d7b7faf802</guid>

					<description><![CDATA[AWS Glue is a fully managed, serverless data integration service provided by Amazon Web Services (AWS) that uses Apache Spark as one of its backend processing engines (as of this writing, you can use Python Shell, Spark, or Ray). Data skew occurs when the data being processed is not evenly distributed across the Spark cluster, […]]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>How the GoDaddy data platform achieved over 60% cost reduction and 50% performance boost by adopting Amazon EMR Serverless</title>
		<link>https://noise.getoto.net/2024/03/12/how-the-godaddy-data-platform-achieved-over-60-cost-reduction-and-50-performance-boost-by-adopting-amazon-emr-serverless/</link>
		
		<dc:creator><![CDATA[Brandon Abear]]></dc:creator>
		<pubDate>Tue, 12 Mar 2024 16:01:54 +0000</pubDate>
				<category><![CDATA[Advanced (300)]]></category>
		<category><![CDATA[Amazon EMR]]></category>
		<category><![CDATA[Analytics]]></category>
		<category><![CDATA[AWS Big Data]]></category>
		<category><![CDATA[Best practices]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Case Study]]></category>
		<category><![CDATA[Customer Solutions]]></category>
		<category><![CDATA[EMR Serverless]]></category>
		<category><![CDATA[serverless]]></category>
		<guid isPermaLink="false">http://noise.getoto.net/?guid=62d43bbbe4df3f5053adcaf9f98af6ff</guid>

					<description><![CDATA[This is a guest post co-written with Brandon Abear, Dinesh Sharma, John Bush, and Ozcan IIikhan from GoDaddy. GoDaddy empowers everyday entrepreneurs by providing all the help and tools to succeed online. With more than 20 million customers worldwide, GoDaddy is the place people come to name their ideas, build a professional website, attract customers, […]]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Sliding window rate limits in distributed systems</title>
		<link>https://noise.getoto.net/2023/12/14/sliding-window-rate-limits-in-distributed-systems/</link>
		
		<dc:creator><![CDATA[Grab Tech]]></dc:creator>
		<pubDate>Thu, 14 Dec 2023 00:00:10 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[distributed-systems]]></category>
		<category><![CDATA[Engineering]]></category>
		<category><![CDATA[Frequency capping]]></category>
		<category><![CDATA[Rate-limiting]]></category>
		<guid isPermaLink="false">https://engineering.grab.com/frequency-capping</guid>

					<description><![CDATA[Like many other companies, Grab uses marketing communications to notify users of promotions or other news. If a user receives these notifications from multiple companies, it would be a form of information overload and they might even start considering ...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Road localisation in GrabMaps</title>
		<link>https://noise.getoto.net/2023/11/17/road-localisation-in-grabmaps/</link>
		
		<dc:creator><![CDATA[Grab Tech]]></dc:creator>
		<pubDate>Fri, 17 Nov 2023 00:00:10 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[Data processing]]></category>
		<category><![CDATA[Data Science]]></category>
		<category><![CDATA[Engineering]]></category>
		<category><![CDATA[GrabMaps]]></category>
		<category><![CDATA[hyperlocalisation]]></category>
		<category><![CDATA[Maps]]></category>
		<category><![CDATA[Product]]></category>
		<guid isPermaLink="false">https://engineering.grab.com/road-localisation-grabmaps</guid>

					<description><![CDATA[Introduction

In 2022, Grab achieved self-sufficiency in its Geo services. As part of this transition, one crucial step was moving towards using an internally-developed map tailored specifically to the market in which Grab operates. Now that we have fu...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Streaming SQL in Data Mesh</title>
		<link>https://noise.getoto.net/2023/11/03/streaming-sql-in-data-mesh/</link>
		
		<dc:creator><![CDATA[Netflix Technology Blog]]></dc:creator>
		<pubDate>Fri, 03 Nov 2023 21:48:50 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[data-mesh]]></category>
		<category><![CDATA[event-streaming]]></category>
		<category><![CDATA[flink]]></category>
		<category><![CDATA[stream-processing]]></category>
		<guid isPermaLink="false">https://medium.com/p/0d83f5a00d08</guid>

					<description><![CDATA[Democratizing Stream Processing @ NetflixBy Guil Pires, Mark Cho, Mingliang Liu, Sujay JainData powers much of what we do at Netflix. On the Data Platform team, we build the infrastructure used across the company to process data at scale.In our last bl...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Building hyperlocal GrabMaps</title>
		<link>https://noise.getoto.net/2023/08/30/building-hyperlocal-grabmaps/</link>
		
		<dc:creator><![CDATA[Grab Tech]]></dc:creator>
		<pubDate>Wed, 30 Aug 2023 00:00:10 +0000</pubDate>
				<category><![CDATA[Big Data]]></category>
		<category><![CDATA[data]]></category>
		<category><![CDATA[Data processing]]></category>
		<category><![CDATA[Data Science]]></category>
		<category><![CDATA[Engineering]]></category>
		<category><![CDATA[GrabMaps]]></category>
		<category><![CDATA[hyperlocalisation]]></category>
		<category><![CDATA[Maps]]></category>
		<category><![CDATA[navigation]]></category>
		<category><![CDATA[Product]]></category>
		<guid isPermaLink="false">https://engineering.grab.com/building-hyperlocal-grabmaps</guid>

					<description><![CDATA[Introduction

Southeast Asia (SEA) is a dynamic market, very different from other parts of the world. When travelling on the road, you may experience fast-changing road restrictions, new roads appearing overnight, and high traffic congestion. To addres...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Let’s Architect! Architecting a data mesh</title>
		<link>https://noise.getoto.net/2023/03/08/lets-architect-architecting-a-data-mesh/</link>
		
		<dc:creator><![CDATA[Luca Mezzalira]]></dc:creator>
		<pubDate>Wed, 08 Mar 2023 16:20:53 +0000</pubDate>
				<category><![CDATA[Architecture]]></category>
		<category><![CDATA[Architecture Center]]></category>
		<category><![CDATA[AWS Well-Architected Framework]]></category>
		<category><![CDATA[AWS Well-Architected Tool]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[data-mesh]]></category>
		<category><![CDATA[Let's Architect]]></category>
		<category><![CDATA[Thought Leadership]]></category>
		<guid isPermaLink="false">http://noise.getoto.net/?guid=5df3da01476c11f60bbc316bae21b176</guid>

					<description><![CDATA[Data architectures were mainly designed around technologies rather than business domains in the past. This changed in 2019, when Zhamak Dehghani introduced the data mesh. Data mesh is an application of the Domain-Driven-Design (DDD) principles to data architectures: Data is organized into data domains and the data is the product that the team owns and […]]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>AWS Local Zones and AWS Outposts, choosing the right technology for your edge workload</title>
		<link>https://noise.getoto.net/2022/12/01/aws-local-zones-and-aws-outposts-choosing-the-right-technology-for-your-edge-workload/</link>
		
		<dc:creator><![CDATA[Sheila Busser]]></dc:creator>
		<pubDate>Thu, 01 Dec 2022 18:25:40 +0000</pubDate>
				<category><![CDATA[automation]]></category>
		<category><![CDATA[AWS Lambda]]></category>
		<category><![CDATA[AWS Local Zones]]></category>
		<category><![CDATA[AWS Outposts]]></category>
		<category><![CDATA[Best practices]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Customer Solutions]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<guid isPermaLink="false">http://noise.getoto.net/?guid=09e750b259a663ae88f8a46978130535</guid>

					<description><![CDATA[This blog post is written by Joe Sacco, Senior Technical Account Manager. The AWS Global Cloud Infrastructure includes 30 Launched Regions, 96 Availability Zones (AZs), 410+ Points of Presence with 400+ Edge Locations, and 13 Regional Edge Caches.  With over 200 AWS services, most customer workloads can run in the AWS Regions. However, for some […]]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Let’s Architect! Modern data architectures</title>
		<link>https://noise.getoto.net/2022/09/07/lets-architect-modern-data-architectures/</link>
		
		<dc:creator><![CDATA[Luca Mezzalira]]></dc:creator>
		<pubDate>Wed, 07 Sep 2022 14:54:48 +0000</pubDate>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Architecture]]></category>
		<category><![CDATA[AWS Glue]]></category>
		<category><![CDATA[AWS Lake Formation]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[cloud]]></category>
		<category><![CDATA[data architecture]]></category>
		<category><![CDATA[data-mesh]]></category>
		<category><![CDATA[distributed-systems]]></category>
		<category><![CDATA[Let's Architect]]></category>
		<category><![CDATA[ml]]></category>
		<category><![CDATA[software-architecture]]></category>
		<guid isPermaLink="false">http://noise.getoto.net/?guid=3bd16bce59692c54fbcaab401f0dc042</guid>

					<description><![CDATA[With the rapid growth in data coming from data platforms and applications, and the continuous improvements in state-of-the-art machine learning algorithms, data are becoming key assets for companies. Modern data architectures include data mesh—a recent style that represents a paradigm shift, in which data is treated as a product and data architectures are designed around […]]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Interactively develop your AWS Glue streaming ETL jobs using AWS Glue Studio notebooks</title>
		<link>https://noise.getoto.net/2022/09/01/interactively-develop-your-aws-glue-streaming-etl-jobs-using-aws-glue-studio-notebooks/</link>
		
		<dc:creator><![CDATA[Arun A K]]></dc:creator>
		<pubDate>Thu, 01 Sep 2022 18:43:52 +0000</pubDate>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[AWS Glue]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[jupyternotebook]]></category>
		<category><![CDATA[Technical How-to]]></category>
		<guid isPermaLink="false">http://noise.getoto.net/?guid=0fd808210ee571f307fcd2a01403004f</guid>

					<description><![CDATA[Enterprise customers are modernizing their data warehouses and data lakes to provide real-time insights, because having the right insights at the right time is crucial for good business outcomes. To enable near-real-time decision-making, data pipelines need to process real-time or near-real-time data. This data is sourced from IoT devices, change data capture (CDC) services like […]]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>From centralized architecture to decentralized architecture: How data sharing fine-tunes Amazon Redshift workloads</title>
		<link>https://noise.getoto.net/2022/08/16/from-centralized-architecture-to-decentralized-architecture-how-data-sharing-fine-tunes-amazon-redshift-workloads/</link>
		
		<dc:creator><![CDATA[Jingbin Ma]]></dc:creator>
		<pubDate>Tue, 16 Aug 2022 17:53:16 +0000</pubDate>
				<category><![CDATA[Amazon Redshift]]></category>
		<category><![CDATA[Amazon Redshift data sharing]]></category>
		<category><![CDATA[Analytics]]></category>
		<category><![CDATA[AWS Analytics]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[Optimization]]></category>
		<guid isPermaLink="false">http://noise.getoto.net/?guid=2ba6cdbea875751a5afa9a8b02c8f263</guid>

					<description><![CDATA[Amazon Redshift is a fully managed, petabyte-scale, massively parallel data warehouse that offers simple operations and high performance. It makes it fast, simple, and cost-effective to analyze all your data using standard SQL and your existing business intelligence (BI) tools. Today, Amazon Redshift has become the most widely used cloud data warehouse. With the significant […]]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Bias in the machine: How can we address gender bias in AI?</title>
		<link>https://noise.getoto.net/2022/03/08/bias-in-the-machine-how-can-we-address-gender-bias-in-ai/</link>
		
		<dc:creator><![CDATA[Sue Sentance]]></dc:creator>
		<pubDate>Tue, 08 Mar 2022 09:42:15 +0000</pubDate>
				<category><![CDATA[artificial intelligence]]></category>
		<category><![CDATA[Big Data]]></category>
		<category><![CDATA[computing education]]></category>
		<category><![CDATA[Data Science]]></category>
		<category><![CDATA[Equity]]></category>
		<category><![CDATA[Gender Balance in Computing]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[research]]></category>
		<guid isPermaLink="false">https://www.raspberrypi.org/?p=78629</guid>

					<description><![CDATA[<p>At the Raspberry Pi Foundation, we’ve been thinking about questions relating to artificial intelligence (AI) education and data science education for several months now, inviting experts to share their perspectives in a series of very well-attended seminars. At the same time, we’ve been running a programme of research trials to find out what interventions in…</p>
<p>The post <a rel="nofollow" href="https://www.raspberrypi.org/blog/gender-bias-in-ai-machine-learning-biased-data/">Bias in the machine: How can we address gender bias in AI?</a> appeared first on <a rel="nofollow" href="https://www.raspberrypi.org/">Raspberry Pi</a>.</p>]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
	</channel>
</rss>

<!--
Performance optimized by W3 Total Cache. Learn more: https://www.boldgrid.com/w3-total-cache/

Object Caching 77/443 objects using Memcached
Page Caching using Disk: Enhanced 
Lazy Loading (feed)
Database Caching using Memcached

Served from: noise.getoto.net @ 2025-12-10 08:38:20 by W3 Total Cache
-->