<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>data mining &#8211; Noise</title>
	<atom:link href="https://noise.getoto.net/tag/data-mining/feed/" rel="self" type="application/rss+xml" />
	<link>https://noise.getoto.net</link>
	<description>The collective thoughts of the interwebz</description>
	<lastBuildDate>Mon, 26 Jun 2023 15:36:17 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.8.2</generator>
	<item>
		<title>Excel Data Forensics</title>
		<link>https://noise.getoto.net/2023/06/26/excel-data-forensics/</link>
		
		<dc:creator><![CDATA[Bruce Schneier]]></dc:creator>
		<pubDate>Mon, 26 Jun 2023 15:36:17 +0000</pubDate>
				<category><![CDATA[academic]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[microsoft]]></category>
		<category><![CDATA[plagiarism]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<guid isPermaLink="false">https://www.schneier.com/?p=67485</guid>

					<description><![CDATA[In this detailed article about academic plagiarism are some interesting details about how to do data forensics on Excel files. It really needs the graphics to understand, so see the description at the link.
(And, yes, an author of a paper on dishonesty...]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Serverless Architecture for a Structured Data Mining Solution</title>
		<link>https://noise.getoto.net/2021/10/01/serverless-architecture-for-a-structured-data-mining-solution/</link>
		
		<dc:creator><![CDATA[Uri Rotem]]></dc:creator>
		<pubDate>Fri, 01 Oct 2021 17:21:50 +0000</pubDate>
				<category><![CDATA[Advanced (300)]]></category>
		<category><![CDATA[Amazon DynamoDB]]></category>
		<category><![CDATA[Amazon SageMaker Ground Truth]]></category>
		<category><![CDATA[Architecture]]></category>
		<category><![CDATA[AWS Lambda]]></category>
		<category><![CDATA[AWS Step Functions]]></category>
		<category><![CDATA[data cleaning]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[Manufacturing]]></category>
		<guid isPermaLink="false">http://noise.getoto.net/?guid=38cf93258ca3b750b4cb1cf88459d519</guid>

					<description><![CDATA[Many businesses have an essential need for structured data stored in their own database for business operations and offerings. For example, a company that produces electronics may want to store a structured dataset of parts. This requires the following properties: color, weight, connector type, and more. This data may already be available from external sources. […]]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
		<item>
		<title>Commercial Location Data Used to Out Priest</title>
		<link>https://noise.getoto.net/2021/07/23/commercial-location-data-used-to-out-priest/</link>
		
		<dc:creator><![CDATA[Bruce Schneier]]></dc:creator>
		<pubDate>Fri, 23 Jul 2021 13:58:33 +0000</pubDate>
				<category><![CDATA[cell phones]]></category>
		<category><![CDATA[data collection]]></category>
		<category><![CDATA[data mining]]></category>
		<category><![CDATA[de-anonymization]]></category>
		<category><![CDATA[geolocation]]></category>
		<category><![CDATA[Privacy]]></category>
		<category><![CDATA[surveillance]]></category>
		<category><![CDATA[Uncategorized]]></category>
		<guid isPermaLink="false">https://www.schneier.com/?p=63489</guid>

					<description><![CDATA[<p>A Catholic priest was outed through commercially available surveillance data. Vice has a <a href="https://www.vice.com/en/article/pkbxp8/grindr-location-data-priest-weaponization-app">good analysis</a>:</p>
<blockquote><p>The news starkly demonstrates not only the inherent power of location data, but how the chance to wield that power has trickled down from corporations and intelligence agencies to essentially any sort of disgruntled, unscrupulous, or dangerous individual. A growing market of data brokers that collect and sell data from countless apps has made it so that anyone with a bit of cash and effort can figure out which phone in a so-called anonymized dataset belongs to a target, and abuse that information...</p></blockquote>]]></description>
		
		
		<enclosure url="" length="0" type="" />

			</item>
	</channel>
</rss>

<!--
Performance optimized by W3 Total Cache. Learn more: https://www.boldgrid.com/w3-total-cache/

Object Caching 38/97 objects using Memcached
Page Caching using Disk: Enhanced 
Lazy Loading (feed)
Database Caching using Memcached

Served from: noise.getoto.net @ 2025-12-09 18:45:21 by W3 Total Cache
-->