<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Data Science &#8211; Noise</title>
	<atom:link href="https://noise.getoto.net/tag/data-science/feed/" rel="self" type="application/rss+xml" />
	<link>https://noise.getoto.net</link>
	<description>The collective thoughts of the interwebz</description>
	<lastBuildDate>Wed, 26 Nov 2025 00:00:10 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.8.2</generator>
	<item>
		<title>Real-time data quality monitoring: Kafka stream contracts with syntactic and semantic test</title>
		<link>https://noise.getoto.net/2025/11/26/real-time-data-quality-monitoring-kafka-stream-contracts-with-syntactic-and-semantic-test/</link>
		
		<dc:creator><![CDATA[Grab Tech]]></dc:creator>
		<pubDate>Wed, 26 Nov 2025 00:00:10 +0000</pubDate>
				<category><![CDATA[data]]></category>
		<category><![CDATA[Data processing]]></category>
		<category><![CDATA[Data Science]]></category>
		<category><![CDATA[Engineering]]></category>
		<category><![CDATA[kafka]]></category>
		<category><![CDATA[Performance]]></category>
		<category><![CDATA[Real-time streaming]]></category>
		<guid isPermaLink="false">https://engineering.grab.com/real-time-data-quality-monitoring</guid>

					<description><![CDATA[Introduction

In today’s data-driven landscape, monitoring data quality has become a critical need for ensuring reliable and efficient data usage across domains. High-quality data is the backbone of AI innovation, driving efficiency and unlocking new o...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>What should be included in a data science curriculum for schools?</title>
		<link>https://noise.getoto.net/2025/10/30/what-should-be-included-in-a-data-science-curriculum-for-schools/</link>
		
		<dc:creator><![CDATA[Jan Ander]]></dc:creator>
		<pubDate>Thu, 30 Oct 2025 11:24:44 +0000</pubDate>
				<category><![CDATA[AI]]></category>
		<category><![CDATA[AI education]]></category>
		<category><![CDATA[AI literacy]]></category>
		<category><![CDATA[data literacy]]></category>
		<category><![CDATA[Data Science]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[research]]></category>
		<guid isPermaLink="false">https://www.raspberrypi.org/?p=91761</guid>

					<description><![CDATA[<p>Current artificial intelligence (AI) methods, especially machine learning (ML), rely heavily on data. To complement our work on AI literacy, we have been investigating what data science teaching resources and education research are currently available. Our goal is to work out what data science concepts should be taught in a data science curriculum for schools.…</p>
<p>The post <a href="https://www.raspberrypi.org/blog/what-should-be-included-in-a-data-science-curriculum-for-schools/">What should be included in a data science curriculum for schools?</a> appeared first on <a href="https://www.raspberrypi.org/">Raspberry Pi Foundation</a>.</p>]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Machine-learning predictive autoscaling for Flink</title>
		<link>https://noise.getoto.net/2025/10/30/machine-learning-predictive-autoscaling-for-flink/</link>
		
		<dc:creator><![CDATA[Grab Tech]]></dc:creator>
		<pubDate>Thu, 30 Oct 2025 00:00:10 +0000</pubDate>
				<category><![CDATA[data]]></category>
		<category><![CDATA[Data Science]]></category>
		<category><![CDATA[Engineering]]></category>
		<category><![CDATA[Performance]]></category>
		<guid isPermaLink="false">https://engineering.grab.com/ml-predictive-autoscaling-for-flink</guid>

					<description><![CDATA[Introduction

As Grab transitions to derive more valuable insights from our wealth of operational data, we are witnessing a steep increase in stream-processing applications. Over the past year, the number of Flink applications grew 2.5 times, driven by...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Modernising Grab’s model serving platform with NVIDIA Triton Inference Server</title>
		<link>https://noise.getoto.net/2025/10/21/modernising-grabs-model-serving-platform-with-nvidia-triton-inference-server/</link>
		
		<dc:creator><![CDATA[Grab Tech]]></dc:creator>
		<pubDate>Tue, 21 Oct 2025 00:00:10 +0000</pubDate>
				<category><![CDATA[data]]></category>
		<category><![CDATA[Data Science]]></category>
		<category><![CDATA[Engineering]]></category>
		<category><![CDATA[Performance]]></category>
		<guid isPermaLink="false">https://engineering.grab.com/modernising-grab-model-serving-platform</guid>

					<description><![CDATA[Introduction

Catwalk is Grab’s machine learning (ML) model serving platform, designed to enable data scientists and engineers in deploying production-ready inference APIs. Currently, Catwalk powers hundreds of ML models and online deployments. To acco...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Bringing data science to life for K–12 students with the ‘API Can Code’ curriculum</title>
		<link>https://noise.getoto.net/2025/06/12/bringing-data-science-to-life-for-k-12-students-with-the-api-can-code-curriculum/</link>
		
		<dc:creator><![CDATA[Diana Kirby]]></dc:creator>
		<pubDate>Thu, 12 Jun 2025 11:23:57 +0000</pubDate>
				<category><![CDATA[Data Science]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[research seminar]]></category>
		<guid isPermaLink="false">https://www.raspberrypi.org/?p=90472</guid>

					<description><![CDATA[<p>As data and data-driven technologies become a bigger part of everyday life, it’s more important than ever to make sure that young people are given the chance to learn data science concepts and skills. In our April research seminar, David Weintrop, Rotem Israel-Fishelson, and Peter Moon from the University of Maryland introduced API Can Code,…</p>
<p>The post <a href="https://www.raspberrypi.org/blog/bringing-data-science-to-life-for-k-12-students-with-the-api-can-code-curriculum/">Bringing data science to life for K–12 students with the ‘API Can Code’ curriculum</a> appeared first on <a href="https://www.raspberrypi.org/">Raspberry Pi Foundation</a>.</p>]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Join our free data science education workshop for teachers</title>
		<link>https://noise.getoto.net/2025/06/09/join-our-free-data-science-education-workshop-for-teachers/</link>
		
		<dc:creator><![CDATA[Jan Ander]]></dc:creator>
		<pubDate>Mon, 09 Jun 2025 10:32:50 +0000</pubDate>
				<category><![CDATA[Data Science]]></category>
		<category><![CDATA[research]]></category>
		<guid isPermaLink="false">https://www.raspberrypi.org/?p=90377</guid>

					<description><![CDATA[<p>Are you a teacher who is interested in data science education for key stage 5 (age 16 to 18)? Then we invite you to join our free, in-person workshop exploring the topic, taking place in Cambridge, UK on 10 July 2025. You will be among the very first educators to see some of our first…</p>
<p>The post <a href="https://www.raspberrypi.org/blog/join-our-free-data-science-education-workshop-for-teachers/">Join our free data science education workshop for teachers</a> appeared first on <a href="https://www.raspberrypi.org/">Raspberry Pi Foundation</a>.</p>]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Streamlining RiskOps with the SOP agent framework</title>
		<link>https://noise.getoto.net/2025/05/08/streamlining-riskops-with-the-sop-agent-framework/</link>
		
		<dc:creator><![CDATA[Grab Tech]]></dc:creator>
		<pubDate>Thu, 08 May 2025 00:00:10 +0000</pubDate>
				<category><![CDATA[Data Analytics]]></category>
		<category><![CDATA[Data Science]]></category>
		<category><![CDATA[Engineering]]></category>
		<category><![CDATA[Experiment]]></category>
		<category><![CDATA[generative AI]]></category>
		<category><![CDATA[LLM]]></category>
		<category><![CDATA[machine learning]]></category>
		<guid isPermaLink="false">https://engineering.grab.com/streamlining-riskops-with-sop</guid>

					<description><![CDATA[Introduction

In the blog our previous introduction to the SOP-driven LLM Agent Framework, we the potential of LLM agent framework to revolutionise business operations was discussed. Now, we’re excited to explore a compelling use case: automating Accou...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Introducing the SOP-driven LLM agent frameworks</title>
		<link>https://noise.getoto.net/2025/04/25/introducing-the-sop-driven-llm-agent-frameworks/</link>
		
		<dc:creator><![CDATA[Grab Tech]]></dc:creator>
		<pubDate>Fri, 25 Apr 2025 00:00:10 +0000</pubDate>
				<category><![CDATA[Data Analytics]]></category>
		<category><![CDATA[Data Science]]></category>
		<category><![CDATA[Engineering]]></category>
		<category><![CDATA[Experiment]]></category>
		<category><![CDATA[generative AI]]></category>
		<category><![CDATA[LLM]]></category>
		<category><![CDATA[machine learning]]></category>
		<guid isPermaLink="false">https://engineering.grab.com/introducing-the-sop-drive-llm-agent-framework</guid>

					<description><![CDATA[Introduction

We’re excited to introduce an innovative Large Language Model (LLM) agent framework that reimagines how enterprises can harness the power of AI to streamline operations and boost productivity. At its core, this framework leverages Standar...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Grab AI Gateway: Connecting Grabbers to Multiple GenAI Providers</title>
		<link>https://noise.getoto.net/2025/02/17/grab-ai-gateway-connecting-grabbers-to-multiple-genai-providers/</link>
		
		<dc:creator><![CDATA[Grab Tech]]></dc:creator>
		<pubDate>Mon, 17 Feb 2025 00:00:10 +0000</pubDate>
				<category><![CDATA[Data Science]]></category>
		<category><![CDATA[Engineering]]></category>
		<category><![CDATA[generative AI]]></category>
		<category><![CDATA[LLM]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[Optimisation]]></category>
		<guid isPermaLink="false">https://engineering.grab.com/grab-ai-gateway</guid>

					<description><![CDATA[The transformative world of Generative AI (GenAI), which refers to artificial intelligence systems capable of creating new content such as text, images, or music that is similar to human-generated content, has become integral to innovation, powering th...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Teaching about AI in K–12 education: Thoughts from the USA</title>
		<link>https://noise.getoto.net/2025/02/13/teaching-about-ai-in-k-12-education-thoughts-from-the-usa/</link>
		
		<dc:creator><![CDATA[Katharine Childs]]></dc:creator>
		<pubDate>Thu, 13 Feb 2025 11:55:09 +0000</pubDate>
				<category><![CDATA[AI in education]]></category>
		<category><![CDATA[Data Science]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[research seminar]]></category>
		<guid isPermaLink="false">https://www.raspberrypi.org/?p=89462</guid>

					<description><![CDATA[<p>As artificial intelligence continues to shape our world, understanding how to teach about AI has never been more important. Our new research seminar series brings together educators and researchers to explore approaches to AI and data science education. In the first seminar, we welcomed Shuchi Grover, Director of AI and Education Research at Looking Glass…</p>
<p>The post <a href="https://www.raspberrypi.org/blog/teaching-about-ai-in-k-12-education-thoughts-from-the-usa/">Teaching about AI in K–12 education: Thoughts from the USA</a> appeared first on <a href="https://www.raspberrypi.org/">Raspberry Pi Foundation</a>.</p>]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>How can we teach students about AI and data science? Join our 2025 seminar series to learn more about the topic</title>
		<link>https://noise.getoto.net/2024/12/12/how-can-we-teach-students-about-ai-and-data-science-join-our-2025-seminar-series-to-learn-more-about-the-topic/</link>
		
		<dc:creator><![CDATA[Jane Waite]]></dc:creator>
		<pubDate>Thu, 12 Dec 2024 09:54:06 +0000</pubDate>
				<category><![CDATA[AI in education]]></category>
		<category><![CDATA[Data Science]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[research]]></category>
		<category><![CDATA[research seminar]]></category>
		<guid isPermaLink="false">https://www.raspberrypi.org/?p=89069</guid>

					<description><![CDATA[<p>AI, machine learning (ML), and data science infuse our daily lives, from the recommendation functionality on music apps to technologies that influence our healthcare, transport, education, defence, and more. What jobs will be affected by AL, ML, and data science remains to be seen, but it is increasingly clear that students will need to learn…</p>
<p>The post <a href="https://www.raspberrypi.org/blog/how-can-we-teach-students-about-ai-and-data-science-2025-seminar-series/">How can we teach students about AI and data science? Join our 2025 seminar series to learn more about the topic</a> appeared first on <a href="https://www.raspberrypi.org/">Raspberry Pi Foundation</a>.</p>]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>How we seamlessly migrated high volume real-time streaming traffic from one service to another with zero data loss and duplication</title>
		<link>https://noise.getoto.net/2024/12/05/how-we-seamlessly-migrated-high-volume-real-time-streaming-traffic-from-one-service-to-another-with-zero-data-loss-and-duplication/</link>
		
		<dc:creator><![CDATA[Grab Tech]]></dc:creator>
		<pubDate>Thu, 05 Dec 2024 00:00:10 +0000</pubDate>
				<category><![CDATA[Data Science]]></category>
		<category><![CDATA[data streaming]]></category>
		<category><![CDATA[Engineering]]></category>
		<category><![CDATA[Optimisation]]></category>
		<category><![CDATA[Real-time streaming]]></category>
		<category><![CDATA[Service]]></category>
		<guid isPermaLink="false">https://engineering.grab.com/seamless-migration</guid>

					<description><![CDATA[At Grab, we continuously enhance our systems to improve scalability, reliability and cost-efficiency. Recently, we undertook a project to split the read and write functionalities of one of our backend services into separate services. This was motivated...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Supercharging LLM Application Development with LLM-Kit</title>
		<link>https://noise.getoto.net/2024/11/29/supercharging-llm-application-development-with-llm-kit/</link>
		
		<dc:creator><![CDATA[Grab Tech]]></dc:creator>
		<pubDate>Fri, 29 Nov 2024 00:00:10 +0000</pubDate>
				<category><![CDATA[Data Science]]></category>
		<category><![CDATA[Engineering]]></category>
		<category><![CDATA[generative AI]]></category>
		<category><![CDATA[LLM]]></category>
		<category><![CDATA[machine learning]]></category>
		<guid isPermaLink="false">https://engineering.grab.com/supercharging-llm-application-development-with-llm-kit</guid>

					<description><![CDATA[Introduction

At Grab, we are committed to leveraging the power of technology to deliver the best services to our users and partners. As part of this commitment, we have developed the LLM-Kit, a comprehensive framework designed to supercharge the setup...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Metasense V2: Enhancing, improving and productionisation of LLM powered data governance</title>
		<link>https://noise.getoto.net/2024/11/14/metasense-v2-enhancing-improving-and-productionisation-of-llm-powered-data-governance/</link>
		
		<dc:creator><![CDATA[Grab Tech]]></dc:creator>
		<pubDate>Thu, 14 Nov 2024 00:00:10 +0000</pubDate>
				<category><![CDATA[Data Science]]></category>
		<category><![CDATA[Engineering]]></category>
		<category><![CDATA[generative AI]]></category>
		<category><![CDATA[LLM]]></category>
		<category><![CDATA[machine learning]]></category>
		<guid isPermaLink="false">https://engineering.grab.com/metasense-v2</guid>

					<description><![CDATA[Introduction

In the initial article, LLM Powered Data Classification, we addressed how we integrated Large Language Models (LLM) to automate governance-related metadata generation. The LLM integration enabled us to resolve challenges in Gemini, such a...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>LLM-assisted vector similarity search</title>
		<link>https://noise.getoto.net/2024/10/23/llm-assisted-vector-similarity-search/</link>
		
		<dc:creator><![CDATA[Grab Tech]]></dc:creator>
		<pubDate>Wed, 23 Oct 2024 00:00:10 +0000</pubDate>
				<category><![CDATA[Data Science]]></category>
		<category><![CDATA[Engineering]]></category>
		<category><![CDATA[Experiment]]></category>
		<category><![CDATA[generative AI]]></category>
		<category><![CDATA[LLM]]></category>
		<category><![CDATA[machine learning]]></category>
		<guid isPermaLink="false">https://engineering.grab.com/llm-assisted-vector-similarity-search</guid>

					<description><![CDATA[Introduction

As the complexity of data retrieval requirements continue to grow, traditional search methods often struggle to provide relevant and accurate results, especially for nuanced or conceptual queries. Vector similarity search has emerged as a...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Leveraging RAG-powered LLMs for Analytical Tasks</title>
		<link>https://noise.getoto.net/2024/10/09/leveraging-rag-powered-llms-for-analytical-tasks-2/</link>
		
		<dc:creator><![CDATA[Grab Tech]]></dc:creator>
		<pubDate>Wed, 09 Oct 2024 00:00:10 +0000</pubDate>
				<category><![CDATA[Analytics]]></category>
		<category><![CDATA[Data Science]]></category>
		<category><![CDATA[Engineering]]></category>
		<category><![CDATA[Experiment]]></category>
		<category><![CDATA[generative AI]]></category>
		<category><![CDATA[LLM]]></category>
		<category><![CDATA[machine learning]]></category>
		<guid isPermaLink="false">https://engineering.grab.com/transforming-the-analytics-landscape-with-RAG-powered-LLM</guid>

					<description><![CDATA[Introduction

Retrieval-Augmented Generation (RAG) is a powerful process that is designed to integrate direct function calling to answer queries more efficiently by retrieving relevant information from a broad database. In the rapidly evolving business...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Evolution of Catwalk: Model serving platform at Grab</title>
		<link>https://noise.getoto.net/2024/10/01/evolution-of-catwalk-model-serving-platform-at-grab/</link>
		
		<dc:creator><![CDATA[Grab Tech]]></dc:creator>
		<pubDate>Tue, 01 Oct 2024 00:00:50 +0000</pubDate>
				<category><![CDATA[Data Science]]></category>
		<category><![CDATA[Docker]]></category>
		<category><![CDATA[Engineering]]></category>
		<category><![CDATA[Kubernetes]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[Models]]></category>
		<category><![CDATA[TensorFlow]]></category>
		<guid isPermaLink="false">https://engineering.grab.com/catwalk-evolution</guid>

					<description><![CDATA[Introduction

As Southeast Asia’s leading super app, Grab serves millions of users across multiple countries every day. Our services range from ride-hailing and food delivery to digital payments and much more. The backbone of our operations? Machine Le...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Improve Your Next Experiment by Learning Better Proxy Metrics From Past Experiments</title>
		<link>https://noise.getoto.net/2024/08/26/improve-your-next-experiment-by-learning-better-proxy-metrics-from-past-experiments/</link>
		
		<dc:creator><![CDATA[Netflix Technology Blog]]></dc:creator>
		<pubDate>Mon, 26 Aug 2024 15:46:24 +0000</pubDate>
				<category><![CDATA[a-b-testing]]></category>
		<category><![CDATA[Data Science]]></category>
		<category><![CDATA[experimentation]]></category>
		<category><![CDATA[machine learning]]></category>
		<category><![CDATA[Statistics]]></category>
		<guid isPermaLink="false">https://medium.com/p/64c786c2a3ac</guid>

					<description><![CDATA[By Aurélien Bibaut, Winston Chou, Simon Ejdemyr, and Nathan KallusWe are excited to share our work on how to learn good proxy metrics from historical experiments at KDD 2024. This work addresses a fundamental question for technology companies and acade...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>A Recap of the Data Engineering Open Forum at Netflix</title>
		<link>https://noise.getoto.net/2024/06/20/a-recap-of-the-data-engineering-open-forum-at-netflix/</link>
		
		<dc:creator><![CDATA[Netflix Technology Blog]]></dc:creator>
		<pubDate>Thu, 20 Jun 2024 15:01:27 +0000</pubDate>
				<category><![CDATA[data]]></category>
		<category><![CDATA[Data Science]]></category>
		<category><![CDATA[data-engineering]]></category>
		<category><![CDATA[software engineering]]></category>
		<category><![CDATA[Technology]]></category>
		<guid isPermaLink="false">https://medium.com/p/6b4d4410b88f</guid>

					<description><![CDATA[A summary of sessions at the first Data Engineering Open Forum at Netflix on April 18th, 2024The Data Engineering Open Forum at Netflix on April 18th, 2024.At Netflix, we aspire to entertain the world, and our data engineering teams play a crucial role...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Round 2: A Survey of Causal Inference Applications at Netflix</title>
		<link>https://noise.getoto.net/2024/06/06/round-2-a-survey-of-causal-inference-applications-at-netflix/</link>
		
		<dc:creator><![CDATA[Netflix Technology Blog]]></dc:creator>
		<pubDate>Thu, 06 Jun 2024 20:10:54 +0000</pubDate>
				<category><![CDATA[causal-inference]]></category>
		<category><![CDATA[Data Science]]></category>
		<category><![CDATA[experimentation]]></category>
		<category><![CDATA[Netflix]]></category>
		<category><![CDATA[Technology]]></category>
		<guid isPermaLink="false">https://medium.com/p/fd78328ee0bb</guid>

					<description><![CDATA[At Netflix, we want to ensure that every current and future member finds content that thrills them today and excites them to come back for more. Causal inference is an essential part of the value that Data Science and Engineering adds towards this miss...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
	</channel>
</rss>

<!--
Performance optimized by W3 Total Cache. Learn more: https://www.boldgrid.com/w3-total-cache/

Object Caching 54/315 objects using Memcached
Page Caching using Disk: Enhanced 
Lazy Loading (feed)
Database Caching using Memcached

Served from: noise.getoto.net @ 2025-12-10 17:49:44 by W3 Total Cache
-->