<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Thomas LaRock</title>
	<atom:link href="https://thomaslarock.com/feed/?max-results=5" rel="self" type="application/rss+xml" />
	<link>https://thomaslarock.com/</link>
	<description>Thomas LaRock is an author, speaker, data expert, and SQLRockstar. He helps people connect, learn, and share. Along the way he solves data problems, too.</description>
	<lastBuildDate>Thu, 11 Jul 2024 21:35:40 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	<generator>https://wordpress.org/?v=6.8.1</generator>

<image>
	<url>https://thomaslarock.com/wp-content/uploads/2015/07/gravatar.jpg</url>
	<title>Thomas LaRock</title>
	<link>https://thomaslarock.com/</link>
	<width>32</width>
	<height>32</height>
</image> 
<site xmlns="com-wordpress:feed-additions:1">18470099</site>	<item>
		<title>Microsoft Fabric is the New Office</title>
		<link>https://thomaslarock.com/2024/07/microsoft-fabric-is-the-new-office/</link>
					<comments>https://thomaslarock.com/2024/07/microsoft-fabric-is-the-new-office/#respond</comments>
		
		<dc:creator><![CDATA[Thomas LaRock]]></dc:creator>
		<pubDate>Thu, 11 Jul 2024 21:35:38 +0000</pubDate>
				<category><![CDATA[Azure]]></category>
		<category><![CDATA[Data Analytics]]></category>
		<category><![CDATA[SQL MVP]]></category>
		<guid isPermaLink="false">https://thomaslarock.com/?p=29269</guid>

					<description><![CDATA[<p>At Microsoft Build in 2023 the world first heard about a new offering from Microsoft called Microsoft Fabric. Reactions to the announcement ranged from “meh” to “what is this?” To be fair, this is the typical reaction most people have when you talk data with them. Many of us had no idea what to make ... <a title="Microsoft Fabric is the New Office" class="read-more" href="https://thomaslarock.com/2024/07/microsoft-fabric-is-the-new-office/" aria-label="Read more about Microsoft Fabric is the New Office">Read more</a></p>
<p>The post <a href="https://thomaslarock.com/2024/07/microsoft-fabric-is-the-new-office/">Microsoft Fabric is the New Office</a> appeared first on <a href="https://thomaslarock.com">Thomas LaRock</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<p>At Microsoft Build in 2023 the world first heard about a new offering from Microsoft called <a href="https://learn.microsoft.com/en-us/fabric/get-started/microsoft-fabric-overview?WT.mc_id=twitter&amp;sharingId=DP-MVP-4025219" target="_blank" rel="noreferrer noopener">Microsoft Fabric</a>. Reactions to the announcement ranged from “meh” to “what is this?” To be fair, this is the typical reaction most people have when you talk data with them.</p>



<p>Many of us had no idea what to make of Fabric. To me, it seemed as if Microsoft were doing a rebranding of sorts. They changed the name of Azure Synapse Analytics, also called a Dedicated SQL Pool, and previously known as Azure SQL Data Warehouse. Microsoft excels (ha!) at renaming products every 18 months, keeping customers guessing if anyone is leading product marketing.</p>



<p>Microsoft Fabric also came with this thing called <a href="https://learn.microsoft.com/en-us/fabric/onelake/onelake-overview?WT.mc_id=twitter&amp;sharingId=DP-MVP-4025219" target="_blank" rel="noreferrer noopener">OneLake</a>, a place for all your company data. Folks with an eye on data security, privacy, and governance thought the idea of OneLake was madness. The idea of combining all your company data into one big bucket seemed like a lot of administrative overhead. But OneLake also offers a way to separate storage and compute, allowing for greater scalability. This is a must-have when you are competing with companies like Databricks and Snowflake, and other cloud service providers such as AWS and Google.</p>



<h2 class="wp-block-heading">After Some Thought&#8230;</h2>



<p>After the dust had settled and time passed, the launch and concept of Fabric started to make more sense. For the past 15+ years, Microsoft has been building the individual pieces of Fabric. Here’s a handful of features and services Fabric contains:</p>



<ul class="wp-block-list">
<li>Data Warehouse/Lakehouse – the storing of large volumes of structured and unstructured data in OneLake, which separates storage and compute</li>



<li>Real-time analytics – the ability to stream data into OneLake, or pull data from external sources such as SnowFlake</li>



<li>Data Engineering – the ability to extract, load, and transform data including the use of notebooks</li>



<li>Data Science – leverage machine learning to gain insights from your data</li>



<li>PowerBI – create interactive reports and dashboards</li>
</ul>



<p>Many of these services were built to support traditional data storage, retrieval, and analytical processing. This type of data processing focuses on data at rest, as opposed to streaming event data. This is not to say you couldn’t use these services for streaming, you could try if you wanted. After all, the building blocks for real-time analytics go back to SQL Server 2008, with the release of <a href="https://learn.microsoft.com/en-us/archive/msdn-magazine/2012/march/microsoft-streaminsight-building-the-internet-of-things?WT.mc_id=twitter&amp;sharingId=DP-MVP-4025219" target="_blank" rel="noreferrer noopener">StreamInsight</a>, a fancy way to build pipelines for refreshing dashboards with up to date data.</p>



<p>Streaming event data is where the real data race is taking place today. According to the IDC, by 2025 <a href="https://www.zdnet.com/article/by-2025-nearly-30-percent-of-data-generated-will-be-real-time-idc-says/">nearly 30% of data will need real-time processing</a>. This is the market Microsoft, among others, is targeting, which is roughly 54 ZB in size.</p>



<figure class="wp-block-image size-large"><img fetchpriority="high" decoding="async" width="600" height="389" src="https://thomaslarock.com/wp-content/uploads/2024/07/image-600x389.png?x65915" alt="" class="wp-image-29270" srcset="https://thomaslarock.com/wp-content/uploads/2024/07/image-600x389.png 600w, https://thomaslarock.com/wp-content/uploads/2024/07/image-486x315.png 486w, https://thomaslarock.com/wp-content/uploads/2024/07/image-768x498.png 768w, https://thomaslarock.com/wp-content/uploads/2024/07/image.png 975w" sizes="(max-width: 600px) 100vw, 600px" /></figure>



<p>So, it seems the more data collected, the more likely it is used for real-time processing. Therefore, if you are a cloud company, it is rather important to your bottom line to find a way to make it easy for your customers to store their data in your cloud. The next best thing, of course, is making it easy for your customers to use your tools and services to work with data stored elsewhere. This is part of the brilliance of Fabric, as it allows ease of access to real time data you are already using in places like Databricks, Confluent, and Snowflake.</p>



<h2 class="wp-block-heading">The Bundle</h2>



<p>Now, if you are Microsoft, with a handful of data services ready to meet the needs of a growing market, you have some choices to make. You could continue to do what you have done for 15+ years and keep selling individual products and services and hope you earn some of the market going forward. Or you could bundle the products and services, unifying them into one platform, and make it easy for users to ingest, transform, analyze, and report on their data.</p>



<p>Well, if you want to gain market share, bundling makes the most sense. And Microsoft is uniquely positioned to pull this off for two reasons. First, they have a comprehensive data platform which is second to none. Sure, you can point to other companies who might do one of those services better, but there is no company on Earth, or in the Cloud, which offers a complete end-to-end data platform like Fabric.</p>



<p>Second, bundling software is something Microsoft has a history of doing, and doing it quite well in some cases. People reading this post in 2024 may not be old enough to recall a time when you purchased individual software products like Excel and Word. But I do recall the time before Microsoft Office existed. Bundling everything into Fabric allows users to work with their data anywhere and, most importantly to Microsoft’s bottom line, the result is more data flowing to Azure servers.</p>



<p>I am not here to tell you everything is perfect with Fabric. In the past year I have seen a handful of negative comments about Fabric, most of them nitpicking about things like brand names, data type support, and file formats. There is always going to be a person upset about how Widget X isn’t the Most Perfect Thing For Them at This Moment and They Need to Tell the World. I think most people believe when a product is released, even if it is marked as “Preview”, it should be able to meet the demands of every possible user. It is just not practical.</p>



<h2 class="wp-block-heading">Summary</h2>



<p>Microsoft Fabric was announced at Build this year to be GA, which also makes users believe it should meet the demands of every possible user. The fastest way for Microsoft to grab as much market share as possible is to focus on the customer experience and remove those barriers. You can find roadmap details <a href="https://learn.microsoft.com/en-us/fabric/release-plan/overview?WT.mc_id=twitter&amp;sharingId=DP-MVP-4025219" target="_blank" rel="noreferrer noopener">here</a>, giving you an idea about the effort going on behind the scenes with Fabric today. For example, for everyone who has raised issues with security and governance, you can see the list of what has shipped and what is planned <a href="https://learn.microsoft.com/en-us/fabric/release-plan/admin-governance?WT.mc_id=twitter&amp;sharingId=DP-MVP-4025219" target="_blank" rel="noreferrer noopener">here</a>.</p>



<p>It is clear Microsoft is investing in Fabric, much like they invested in Office 30+ years ago. If there is one thing Microsoft knows how to do, it is creating value for shareholders:</p>



<figure class="wp-block-image size-large"><img decoding="async" width="600" height="513" src="https://thomaslarock.com/wp-content/uploads/2024/07/image-1-600x513.png?x65915" alt="" class="wp-image-29271" srcset="https://thomaslarock.com/wp-content/uploads/2024/07/image-1-600x513.png 600w, https://thomaslarock.com/wp-content/uploads/2024/07/image-1-368x315.png 368w, https://thomaslarock.com/wp-content/uploads/2024/07/image-1-768x657.png 768w, https://thomaslarock.com/wp-content/uploads/2024/07/image-1.png 975w" sizes="(max-width: 600px) 100vw, 600px" /></figure>



<p>Since the announcement of Fabric last May, Microsoft is up over 25%. I am not going to say the increase is the direct result of Fabric. What I am saying is Microsoft might have an idea about what they are doing, and why.</p>



<p>Microsoft Fabric is the new Office – it is a bundle of data products, meant to boost productivity for data professionals and dominate the data analytics landscape. Much in the same way Office dominates the business world.</p>
<p>The post <a href="https://thomaslarock.com/2024/07/microsoft-fabric-is-the-new-office/">Microsoft Fabric is the New Office</a> appeared first on <a href="https://thomaslarock.com">Thomas LaRock</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://thomaslarock.com/2024/07/microsoft-fabric-is-the-new-office/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">29269</post-id>	</item>
		<item>
		<title>Book Review: The AI Playbook</title>
		<link>https://thomaslarock.com/2024/02/book-review-the-ai-playbook/</link>
					<comments>https://thomaslarock.com/2024/02/book-review-the-ai-playbook/#respond</comments>
		
		<dc:creator><![CDATA[Thomas LaRock]]></dc:creator>
		<pubDate>Tue, 27 Feb 2024 13:38:58 +0000</pubDate>
				<category><![CDATA[Book Reviews]]></category>
		<category><![CDATA[Data Analytics]]></category>
		<category><![CDATA[Professional Development]]></category>
		<guid isPermaLink="false">https://thomaslarock.com/?p=28750</guid>

					<description><![CDATA[<p>Imagine you conceive an idea which will save your company millions of dollars, reduce workplace injuries, and increase sales. Now imagine company executives dislike the idea because it seems difficult to implement, and the implementation details are not well understood. Despite the stated benefits of saving money, reducing injuries, and increasing sales your idea hits ... <a title="Book Review: The AI Playbook" class="read-more" href="https://thomaslarock.com/2024/02/book-review-the-ai-playbook/" aria-label="Read more about Book Review: The AI Playbook">Read more</a></p>
<p>The post <a href="https://thomaslarock.com/2024/02/book-review-the-ai-playbook/">Book Review: The AI Playbook</a> appeared first on <a href="https://thomaslarock.com">Thomas LaRock</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<p>Imagine you conceive an idea which will save your company millions of dollars, reduce workplace injuries, and increase sales. Now imagine company executives dislike the idea because it seems difficult to implement, and the implementation details are not well understood. Despite the stated benefits of saving money, reducing injuries, and increasing sales your idea hits a brick wall and falls flat.</p>



<p>Welcome to the world of artificial intelligence (AI) and machine learning (ML), where the struggle is real.</p>



<p>At some point in your career, you have experienced a failed project. If not, don’t worry, you will. Projects fail for all sorts of reasons. Unclear objectives. Unrealistic expectations. Poor planning. Lack of resources. Scope creep. Just to name a few of the more common reasons.</p>



<p>When it comes to projects with AI/ML at the core, all those same reasons apply, plus a few new ones. AI/ML is perhaps the most important piece of general-purpose technology today, which means we are bombarded with AI/ML solutions to solve random or ill-defined problems in much the same way we are bombarded by blockchain solutions for tracking fruit trucks or <a href="https://www.digitaltrends.com/computing/dentacoin-bitcoin-for-dentistry-patients/" target="_blank" rel="noreferrer noopener">visiting the dentist</a>.</p>



<p>The overhype of AI/ML has left people skeptical regarding the promises made through project proposals. Even if you manage to get a project funded, the initial results produced by your model may be difficult to explain, leading to apprehension about deploying solutions which cannot be understood. Nobody wants to blindly follow the decisions and predictions produced by machine learning models no one understands.</p>



<p>It is clear the business world needs a way to build, deploy, and maintain AI/ML models in a consistent manner, with a higher rate of success than failure, and completed on time and within budget.</p>



<h2 class="wp-block-heading" id="h-bizml" style="text-transform:none">bizML</h2>



<p>Thankfully, there exists a modern approach to AI/ML projects. It is called <a href="https://bizml.com" target="_blank" rel="noreferrer noopener">bizML</a>, and it is the core subject inside the new book by Dr. Eric Siegel – <a href="https://amzn.to/3uFR632" target="_blank" rel="noreferrer noopener"><em>The AI Playbook</em></a>.</p>



<p>For any project, not just AI/ML projects, to succeed there must be a rigorous and systematic approach for real-world deployments. Every successful project has similar characteristics &#8211; measurable goals, stakeholder involvement, risk management, resource allocation, fighting scope creep, effective communication, and monitoring project progress before, during, and after deployment.</p>



<p>The AI Playbook breaks this down into digestible sections for anyone with business experience to understand. It outlines bizML as a six-step process for guiding AI/ML projects from conception to deployment: define, measure, act, learn, iterate, and deploy. Using stories from familiar companies such as UPS, FICO, and various dot-coms, <a href="https://www.linkedin.com/in/predictiveanalytics/" target="_blank" rel="noreferrer noopener">Dr. Siegel</a> leans on his experience to help the reader understand how and why even the best ideas often fail.</p>



<p>I don’t want to give away the surprise ending, so I will just say the real secret behind bizML is starting with the end state in mind. Many projects fail due to stakeholders not aligned with the reality of deployment versus expectations. bizML attempts to remove this roadblock by getting everyone aligned with what the end state will look like, and then build towards the agreed upon state.</p>



<p>I read through the book in less than a couple of days, absorbing the material as fast as possible. The use of personal stories was easier to read as opposed to a purely technical book focusing on code and examples. I cannot emphasize enough how this book is <strong>not a technical manual</strong>, but a business guide for business professionals, executives, managers, consultants, and anyone else wanting to learn how to capitalize on AI/ML tech and collaborate with data professionals.</p>



<h2 class="wp-block-heading" id="h-summary">Summary</h2>



<p>As AI/ML solutions continue to gain traction in the market, this book provides the right framework (bizML) for successful AI/ML deployments at the right time. Anyone, or any company, looking to deploy (or has deployed) AI/ML projects should buy copies of this book for all stakeholders.</p>



<p>I’m putting this onto my <a href="https://thomaslarock.com/sqlserverbooks/" target="_blank" rel="noreferrer noopener">bookshelf </a>and 15/10 would recommend.</p>
<p>The post <a href="https://thomaslarock.com/2024/02/book-review-the-ai-playbook/">Book Review: The AI Playbook</a> appeared first on <a href="https://thomaslarock.com">Thomas LaRock</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://thomaslarock.com/2024/02/book-review-the-ai-playbook/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">28750</post-id>	</item>
		<item>
		<title>Export to CSV in Azure ML Studio</title>
		<link>https://thomaslarock.com/2024/01/export-to-csv-in-azure-ml-studio/</link>
					<comments>https://thomaslarock.com/2024/01/export-to-csv-in-azure-ml-studio/#comments</comments>
		
		<dc:creator><![CDATA[Thomas LaRock]]></dc:creator>
		<pubDate>Wed, 17 Jan 2024 20:48:02 +0000</pubDate>
				<category><![CDATA[Azure]]></category>
		<category><![CDATA[Data Analytics]]></category>
		<guid isPermaLink="false">https://thomaslarock.com/?p=28511</guid>

					<description><![CDATA[<p>The most popular feature in any application is an easy-to-find button saying &#8220;Export to CSV.&#8221; If this button is not visibly available, a simple right-click of your mouse should present such an option. You really should not be forced to spend any additional time on this Earth looking for a way to export your data ... <a title="Export to CSV in Azure ML Studio" class="read-more" href="https://thomaslarock.com/2024/01/export-to-csv-in-azure-ml-studio/" aria-label="Read more about Export to CSV in Azure ML Studio">Read more</a></p>
<p>The post <a href="https://thomaslarock.com/2024/01/export-to-csv-in-azure-ml-studio/">Export to CSV in Azure ML Studio</a> appeared first on <a href="https://thomaslarock.com">Thomas LaRock</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<p>The most popular feature in any application is an easy-to-find button saying &#8220;Export to CSV.&#8221; If this button is not visibly available, a simple right-click of your mouse should present such an option. You really should not be forced to spend any additional time on this Earth looking for a way to export your data to a CSV file. </p>



<p>Well, in Azure ML Studio, exporting to a CSV file should be simple, but is not, unless you already know what you are doing and where to look. I was reminded of this recently, and decided to write a quick post in case a person new to ML Studio was wondering how to export data to a CSV file. </p>



<p>When you are working inside the ML Studio designer, it is likely you will want to export data or outputs from time to time. If you are starting from a blank template, the designer does not make it easy for you to know what module you need (<a href="https://thomaslarock.com/2024/01/azure-ml-studio-sample-data/" target="_blank" rel="noreferrer noopener">similar to my last post on finding sample data</a>). Would be great if CoPilot was available!</p>



<p>Now, if you are similar to 99% of data professionals in the world, you will navigate to the section named Data Input and Output, because that’s what you are trying to do, export data from the designer. It even says in the description “Writes a dataset to…”, very clear what will happen.</p>


<div class="wp-block-image">
<figure class="aligncenter size-large"><img decoding="async" width="416" height="600" src="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-data-input-and-output-416x600.png?x65915" alt="" class="wp-image-28535" srcset="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-data-input-and-output-416x600.png 416w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-data-input-and-output-218x315.png 218w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-data-input-and-output.png 462w" sizes="(max-width: 416px) 100vw, 416px" /></figure></div>


<p>So, using <a href="https://thomaslarock.com/2024/01/azure-ml-studio-sample-data/" target="_blank" rel="noreferrer noopener">the imdb sample data</a>, we add a module to select all columns, then attach the module to the Export Data model. So easy!</p>


<div class="wp-block-image">
<figure class="aligncenter size-full"><img loading="lazy" decoding="async" width="578" height="573" src="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-export-data.png?x65915" alt="" class="wp-image-28536" srcset="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-export-data.png 578w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-export-data-318x315.png 318w" sizes="auto, (max-width: 578px) 100vw, 578px" /></figure></div>


<p>When you attach you need to configure some details for the module. Again, so easy!</p>


<div class="wp-block-image">
<figure class="aligncenter size-large"><img loading="lazy" decoding="async" width="526" height="600" src="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-export-data-config-526x600.png?x65915" alt="" class="wp-image-28537" srcset="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-export-data-config-526x600.png 526w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-export-data-config-276x315.png 276w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-export-data-config.png 582w" sizes="auto, (max-width: 526px) 100vw, 526px" /></figure></div>


<p>We save our configuration options and submit the job to run. When the job is complete, we navigate to view the dataset.</p>


<div class="wp-block-image">
<figure class="aligncenter size-full"><img loading="lazy" decoding="async" width="482" height="369" src="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-export-data-output.png?x65915" alt="" class="wp-image-28538" srcset="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-export-data-output.png 482w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-export-data-output-411x315.png 411w" sizes="auto, (max-width: 482px) 100vw, 482px" /></figure></div>


<p>Uh-oh, I was expecting a different set of options here. Viewing the log and various outputs does not reveal any CSV file either. Maybe I need to choose the select columns module:</p>


<div class="wp-block-image">
<figure class="aligncenter size-large"><img loading="lazy" decoding="async" width="600" height="470" src="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-select-columns-output-600x470.png?x65915" alt="" class="wp-image-28539" srcset="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-select-columns-output-600x470.png 600w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-select-columns-output-402x315.png 402w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-select-columns-output.png 618w" sizes="auto, (max-width: 600px) 100vw, 600px" /></figure></div>


<p>Ah, that’s better. </p>



<p>Except it isn’t. Instead of showing me the location of the expected CSV file, what I find is this:</p>


<div class="wp-block-image">
<figure class="aligncenter size-full"><img loading="lazy" decoding="async" width="472" height="373" src="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-select-columns-output-2.png?x65915" alt="" class="wp-image-28540" srcset="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-select-columns-output-2.png 472w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-select-columns-output-2-399x315.png 399w" sizes="auto, (max-width: 472px) 100vw, 472px" /></figure></div>


<p>I can preview the data from the select columns module, but there isn’t a way to access the CSV file I was expecting. I suspect this export module is really meant to pass data between pipelines or services. But the purpose and description of the export module is not clear, and a novice user would be unhappy to head down this path only to be disappointed and frustrated. </p>



<p>What we really want to use here is the Convert to CSV module:</p>


<div class="wp-block-image">
<figure class="aligncenter size-full is-resized"><img loading="lazy" decoding="async" width="490" height="228" src="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-convert-csv.png?x65915" alt="" class="wp-image-28541" style="width:490px;height:auto"/></figure></div>

<div class="wp-block-image">
<figure class="aligncenter size-large"><img loading="lazy" decoding="async" width="558" height="600" src="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-convert-csv-module-558x600.png?x65915" alt="" class="wp-image-28542" srcset="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-convert-csv-module-558x600.png 558w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-convert-csv-module-293x315.png 293w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-convert-csv-module-768x826.png 768w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-convert-csv-module.png 800w" sizes="auto, (max-width: 558px) 100vw, 558px" /></figure></div>


<p>Viewing the results will display this:</p>


<div class="wp-block-image">
<figure class="aligncenter size-full"><img loading="lazy" decoding="async" width="363" height="335" src="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-convert-csv-output.png?x65915" alt="" class="wp-image-28543" srcset="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-convert-csv-output.png 363w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-convert-csv-output-341x315.png 341w" sizes="auto, (max-width: 363px) 100vw, 363px" /></figure></div>


<p>Which has what we are looking for, a download button:</p>


<div class="wp-block-image">
<figure class="aligncenter size-large"><img loading="lazy" decoding="async" width="600" height="199" src="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-convert-csv-download-600x199.png?x65915" alt="" class="wp-image-28544" srcset="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-convert-csv-download-600x199.png 600w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-convert-csv-download-560x186.png 560w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-convert-csv-download.png 683w" sizes="auto, (max-width: 600px) 100vw, 600px" /></figure></div>


<p>Selecting Download will either default to your browser settings, or you can do a Save As. </p>



<p>As I wrote at the beginning of this post, exporting to a CSV file from within Azure ML Studio is easy to do, if you already know what you are doing. If you are new to Azure ML Studio, you may find yourself frustrated if you expect the Export Data module to produce a CSV file. You will want to use the Convert to CSV module instead. </p>
<p>The post <a href="https://thomaslarock.com/2024/01/export-to-csv-in-azure-ml-studio/">Export to CSV in Azure ML Studio</a> appeared first on <a href="https://thomaslarock.com">Thomas LaRock</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://thomaslarock.com/2024/01/export-to-csv-in-azure-ml-studio/feed/</wfw:commentRss>
			<slash:comments>1</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">28511</post-id>	</item>
		<item>
		<title>Azure ML Studio Sample Data</title>
		<link>https://thomaslarock.com/2024/01/azure-ml-studio-sample-data/</link>
					<comments>https://thomaslarock.com/2024/01/azure-ml-studio-sample-data/#comments</comments>
		
		<dc:creator><![CDATA[Thomas LaRock]]></dc:creator>
		<pubDate>Mon, 08 Jan 2024 20:17:48 +0000</pubDate>
				<category><![CDATA[Azure]]></category>
		<guid isPermaLink="false">https://thomaslarock.com/?p=28471</guid>

					<description><![CDATA[<p>This is one of those posts you write as a note to &#8220;future you&#8221;, when you&#8217;ll forget something, do a search, and find your own post. Recently I was working inside of Azure ML Studio and wanted to browse the sample datasets provided. Except I could not find them. I *knew* they existed, having used ... <a title="Azure ML Studio Sample Data" class="read-more" href="https://thomaslarock.com/2024/01/azure-ml-studio-sample-data/" aria-label="Read more about Azure ML Studio Sample Data">Read more</a></p>
<p>The post <a href="https://thomaslarock.com/2024/01/azure-ml-studio-sample-data/">Azure ML Studio Sample Data</a> appeared first on <a href="https://thomaslarock.com">Thomas LaRock</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<p>This is one of those posts you write as a note to &#8220;future you&#8221;, when you&#8217;ll forget something, do a search, and find your own post. </p>



<p>Recently I was working inside of Azure ML Studio and wanted to browse the sample datasets provided. Except I could not find them. I *knew* they existed, having used them previously, but could not remember if that was in the original ML Studio (classic) or not. </p>



<p>After some trial and error, I found them and decided to write this post in case anyone else is wondering where to find the sample datasets. You&#8217;re welcome, future Tom!</p>



<p>First, you need to login to Azure ML Studio: <a href="https://ml.azure.com/" target="_blank" rel="noreferrer noopener">https://ml.azure.com/</a>. Once logged in, you will create a workspace. Once the workspace is ready, open it and you will see a splash screen with a lot of interesting widgets, but alas no sample datasets to select.</p>


<div class="wp-block-image">
<figure class="aligncenter size-large"><img loading="lazy" decoding="async" width="600" height="377" src="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-studio-workspace-600x377.png?x65915" alt="" class="wp-image-28480" srcset="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-studio-workspace-600x377.png 600w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-studio-workspace-501x315.png 501w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-studio-workspace-768x483.png 768w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-studio-workspace.png 1364w" sizes="auto, (max-width: 600px) 100vw, 600px" /></figure></div>


<p>To locate the sample datasets you must create a Pipeline. You create a Pipeline either through the designer or the Pipeline menu on the left of the workspace screen, as selecting Pipeline | New Pipeline opens the Designer.</p>


<div class="wp-block-image">
<figure class="aligncenter size-large"><img loading="lazy" decoding="async" width="265" height="600" src="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-designer-265x600.png?x65915" alt="" class="wp-image-28481" srcset="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-designer-265x600.png 265w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-designer-139x315.png 139w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-designer.png 350w" sizes="auto, (max-width: 265px) 100vw, 265px" /></figure></div>


<p>Once inside the Designer, create a Pipeline either by selecting the pre-defined samples or by selecting the upper-left tile:</p>


<div class="wp-block-image">
<figure class="aligncenter size-full"><img loading="lazy" decoding="async" width="369" height="320" src="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-new-pipeline.png?x65915" alt="" class="wp-image-28482" srcset="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-new-pipeline.png 369w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-new-pipeline-363x315.png 363w" sizes="auto, (max-width: 369px) 100vw, 369px" /></figure></div>


<p>Now you are in the Authoring screen, and here is where you will find the sample data. However, your default portal experience could have the left-hand menu collapsed. You can expand the menu by clicking on the two brackets (WTH is this really called, a vertical chevron? No idea.) This was not intuitive for me, it took me a bit of time to understand I needed to click on this to view a menu.</p>


<div class="wp-block-image">
<figure class="aligncenter size-full"><img loading="lazy" decoding="async" width="561" height="225" src="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-designer-authoring-menu.png?x65915" alt="" class="wp-image-28483"/></figure></div>


<p>Once opened, you’ll find sample data as well as some other goodies.</p>


<div class="wp-block-image">
<figure class="aligncenter size-large"><img loading="lazy" decoding="async" width="276" height="600" src="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-sample-data-276x600.png?x65915" alt="" class="wp-image-28484" srcset="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-sample-data-276x600.png 276w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-sample-data-145x315.png 145w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-sample-data.png 523w" sizes="auto, (max-width: 276px) 100vw, 276px" /></figure></div>


<p>Expand the Sample data option and view the full list of datasets.</p>


<div class="wp-block-image">
<figure class="aligncenter size-full"><img loading="lazy" decoding="async" width="504" height="444" src="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-sample-data-census.png?x65915" alt="" class="wp-image-28485" srcset="https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-sample-data-census.png 504w, https://thomaslarock.com/wp-content/uploads/2024/01/azure-ml-sample-data-census-358x315.png 358w" sizes="auto, (max-width: 504px) 100vw, 504px" /></figure></div>


<p>I don’t know how often the sample data is refreshed, and the answer is “likely never”. So, if you are looking for up to date census data, or iMDB movie data, you should consider a different source than the sample datasets provided through Azure ML Studio.</p>
<p>The post <a href="https://thomaslarock.com/2024/01/azure-ml-studio-sample-data/">Azure ML Studio Sample Data</a> appeared first on <a href="https://thomaslarock.com">Thomas LaRock</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://thomaslarock.com/2024/01/azure-ml-studio-sample-data/feed/</wfw:commentRss>
			<slash:comments>2</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">28471</post-id>	</item>
		<item>
		<title>Microsoft Data Platform MVP &#8211; Fifteen Years</title>
		<link>https://thomaslarock.com/2023/08/microsoft-data-platform-mvp-fifteen-years/</link>
					<comments>https://thomaslarock.com/2023/08/microsoft-data-platform-mvp-fifteen-years/#respond</comments>
		
		<dc:creator><![CDATA[Thomas LaRock]]></dc:creator>
		<pubDate>Thu, 17 Aug 2023 19:53:36 +0000</pubDate>
				<category><![CDATA[Azure]]></category>
		<category><![CDATA[SQL MVP]]></category>
		<guid isPermaLink="false">https://thomaslarock.com/?p=27668</guid>

					<description><![CDATA[<p>I am happy, honored, and humbled to receive the Microsoft Data Platform MVP award for the fifteenth (15th) straight year. Receiving the MVP award during my unforced sabbatical this summer was a bright spot, no question. It reinforced the belief I have in myself &#8211; my contributions have value. Microsoft puts this front and center ... <a title="Microsoft Data Platform MVP &#8211; Fifteen Years" class="read-more" href="https://thomaslarock.com/2023/08/microsoft-data-platform-mvp-fifteen-years/" aria-label="Read more about Microsoft Data Platform MVP &#8211; Fifteen Years">Read more</a></p>
<p>The post <a href="https://thomaslarock.com/2023/08/microsoft-data-platform-mvp-fifteen-years/">Microsoft Data Platform MVP &#8211; Fifteen Years</a> appeared first on <a href="https://thomaslarock.com">Thomas LaRock</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<p>I am happy, honored, and humbled to receive the Microsoft Data Platform MVP award for the fifteenth (15th) straight year.</p>



<p>Receiving the MVP award during my unforced sabbatical this summer was a bright spot, no question. It reinforced the belief I have in myself &#8211; my contributions have value. Microsoft puts this front and center on the award by stating (emphasis mine):</p>



<p class="has-text-align-left"><em>&#8220;We <strong>recognize </strong>and <strong>value </strong>your exceptional contributions to technical communities worldwide.&#8221;</em></p>


<div class="wp-block-image">
<figure class="aligncenter size-large"><img loading="lazy" decoding="async" width="449" height="600" src="https://thomaslarock.com/wp-content/uploads/2023/08/IMG_9272-449x600.jpg?x65915" alt="" class="wp-image-27669" srcset="https://thomaslarock.com/wp-content/uploads/2023/08/IMG_9272-449x600.jpg 449w, https://thomaslarock.com/wp-content/uploads/2023/08/IMG_9272-236x315.jpg 236w, https://thomaslarock.com/wp-content/uploads/2023/08/IMG_9272-768x1026.jpg 768w, https://thomaslarock.com/wp-content/uploads/2023/08/IMG_9272-1149x1536.jpg 1149w, https://thomaslarock.com/wp-content/uploads/2023/08/IMG_9272-1532x2048.jpg 1532w, https://thomaslarock.com/wp-content/uploads/2023/08/IMG_9272-scaled.jpg 1915w" sizes="auto, (max-width: 449px) 100vw, 449px" /><figcaption class="wp-element-caption">I&#8217;m running out of room.</figcaption></figure></div>


<p>I recall the aftermath of my first award, when I was told I was the &#8220;least technical SQL Server MVP ever awarded&#8221;. Talk about feeling you have no value! And that was certainly the feeling I had two months ago. </p>



<p>It&#8217;s amazing how something as simple as being recognized by your peers can go so far in making a person feel valued. We should all strive to go out of our way daily to help another human feel valued. </p>



<p>There are plenty of people in the world who are recognized as experts in the Microsoft Data Platform. I&#8217;d like to think I am one of them. I also happen to be fortunate enough to know Microsoft recognizes me as one as well. </p>



<p>But MVPs advocate for Microsoft because we want to, not because we want an award. After all these years I’m still crazy for Microsoft, and I am happy to help promote the best data platform on the planet.</p>



<p>For my fellow MVPs renewed this year, I offer this suggestion &#8211; say thank you. Then say it again. Email the person on the product team who made the widget you enjoy using over and over and tell them how much you appreciate their effort. Email your MVP lead(s) and thank them for all their hard work as well.</p>



<p>A little kindness goes a long way. You never know how much reaching out could mean to that person at that moment. </p>
<p>The post <a href="https://thomaslarock.com/2023/08/microsoft-data-platform-mvp-fifteen-years/">Microsoft Data Platform MVP &#8211; Fifteen Years</a> appeared first on <a href="https://thomaslarock.com">Thomas LaRock</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://thomaslarock.com/2023/08/microsoft-data-platform-mvp-fifteen-years/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">27668</post-id>	</item>
		<item>
		<title>Pro SQL Server 2022 Wait Statistics Book</title>
		<link>https://thomaslarock.com/2022/10/pro-sql-server-2022-wait-statistics-book/</link>
					<comments>https://thomaslarock.com/2022/10/pro-sql-server-2022-wait-statistics-book/#comments</comments>
		
		<dc:creator><![CDATA[Thomas LaRock]]></dc:creator>
		<pubDate>Mon, 10 Oct 2022 20:26:54 +0000</pubDate>
				<category><![CDATA[Azure]]></category>
		<category><![CDATA[SQL Server 2022]]></category>
		<category><![CDATA[SQL Server Performance]]></category>
		<guid isPermaLink="false">https://thomaslarock.com/?p=24908</guid>

					<description><![CDATA[<p>After many months of editing, revising, and writing, my new book Pro SQL Server 2022 Wait Statistics is ready for print!</p>
<p>The post <a href="https://thomaslarock.com/2022/10/pro-sql-server-2022-wait-statistics-book/">Pro SQL Server 2022 Wait Statistics Book</a> appeared first on <a href="https://thomaslarock.com">Thomas LaRock</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<p>After many months of editing, revising, and writing, my new book <em>Pro SQL Server 2022 Wait Statistics: A Practical Guide to Analyzing Performance in SQL Server and Azure SQL Database</em> is ready for print!</p>


<div class="wp-block-image">
<figure class="aligncenter size-full"><a href="https://amzn.to/3fQr7hz" target="_blank" rel="noreferrer noopener"><img loading="lazy" decoding="async" width="420" height="598" src="https://thomaslarock.com/wp-content/uploads/2022/10/pro_sql_server_2022_wait_statictics.jpg?x65915" alt="Pro SQL Server 2022 Wait Statistics" class="wp-image-24909" srcset="https://thomaslarock.com/wp-content/uploads/2022/10/pro_sql_server_2022_wait_statictics.jpg 420w, https://thomaslarock.com/wp-content/uploads/2022/10/pro_sql_server_2022_wait_statictics-221x315.jpg 221w" sizes="auto, (max-width: 420px) 100vw, 420px" /></a></figure></div>


<p>You can pre-order here: <a href="https://amzn.to/3fQr7hz" target="_blank" rel="noreferrer noopener">https://amzn.to/3fQr7hz</a></p>



<p>I thoroughly enjoyed this project, and I want to thank Apress and Jonathan Gennick for giving me the opportunity to update the previous edition. It felt good to be writing again, something I have not been doing enough of lately. And many thanks to Enrico van de Laar (<a href="https://twitter.com/evdlaar" target="_blank" rel="noreferrer noopener">@evdlaar</a>) for giving me amazing content to start with.</p>



<p>The book is an effort to help explain how, why, and when wait events happen. Of course, I also want to show how to solve issues when they arise. Specific wait events are broken down into parts: definition, remediation, and an example. There are plenty of code examples, allowing the reader to duplicate the scenarios to help understand the wait events better. </p>



<p>It is my understanding we will have a GitHub repository for the sample code. This will make it easy for a reader to access the code for their use. I am hoping to keep the repo up to date and expand upon the example as I look towards the next version.</p>



<h2 class="wp-block-heading" id="h-pro-sql-server-2022-wait-statistics-at-live-360"><em>Pro SQL Server 2022 Wait Statistics</em> at Live 360!</h2>



<p>I will be presenting material from the book at <a href="https://live360events.com/events/orlando-2022/Home.aspx" target="_blank" rel="noreferrer noopener">SQL Server Live!</a> this November where I have the following sessions, panel discussion, and workshop:</p>



<ul class="wp-block-list"><li>Fast Focus: SQL Server Data Types and Performance</li><li>Locking, Blocking, and Deadlocks</li><li>Performance Tuning SQL Server using Wait Statistics</li><li>SQL Server Live! Panel Discussion: Azure Cloud Migration Discussion</li><li>Workshop: Introduction to Azure Data Platform for Data Professionals</li></ul>



<p>The workshop is a full day training session delivered with Karen Lopez (<a href="https://twitter.com/datachick" target="_blank" rel="noreferrer noopener">@DataChick</a>), and you can register for Live 360 here: <a href="https://na.eventscloud.com/ereg/index.php?eventid=666070" target="_blank" rel="noreferrer noopener">Live 360 Orlando 2022 &#8211; Choose Registration</a></p>



<p>I am hopeful to have copies of <em>Pro SQL Server 2022 Wait Statistics</em> at SQL Server Live!. At the time of this post, I do not know of a publish date. Amazon shows the book as pre-order <a href="https://amzn.to/3fQr7hz" target="_blank" rel="noreferrer noopener">right now</a>.</p>
<p>The post <a href="https://thomaslarock.com/2022/10/pro-sql-server-2022-wait-statistics-book/">Pro SQL Server 2022 Wait Statistics Book</a> appeared first on <a href="https://thomaslarock.com">Thomas LaRock</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://thomaslarock.com/2022/10/pro-sql-server-2022-wait-statistics-book/feed/</wfw:commentRss>
			<slash:comments>1</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">24908</post-id>	</item>
		<item>
		<title>Stop Using Production Data For Development</title>
		<link>https://thomaslarock.com/2022/01/stop-using-production-refresh-development/</link>
					<comments>https://thomaslarock.com/2022/01/stop-using-production-refresh-development/#comments</comments>
		
		<dc:creator><![CDATA[Thomas LaRock]]></dc:creator>
		<pubDate>Mon, 31 Jan 2022 19:33:43 +0000</pubDate>
				<category><![CDATA[Data Security and Privacy]]></category>
		<category><![CDATA[SQL MVP]]></category>
		<guid isPermaLink="false">https://thomaslarock.com/?p=21592</guid>

					<description><![CDATA[<p>A common software development practice is to take data from a production system and restore it to a different environment, often called &#8220;test&#8221;, &#8220;development&#8221;, &#8220;staging&#8221;, or even &#8220;QA&#8221;. This allows for support teams to troubleshoot issues without making changes to the true production environment. It also allows for development teams to build new versions and ... <a title="Stop Using Production Data For Development" class="read-more" href="https://thomaslarock.com/2022/01/stop-using-production-refresh-development/" aria-label="Read more about Stop Using Production Data For Development">Read more</a></p>
<p>The post <a href="https://thomaslarock.com/2022/01/stop-using-production-refresh-development/">Stop Using Production Data For Development</a> appeared first on <a href="https://thomaslarock.com">Thomas LaRock</a>.</p>
]]></description>
										<content:encoded><![CDATA[
<p>A common software development practice is to take data from a production system and restore it to a different environment, often called &#8220;test&#8221;, &#8220;development&#8221;, &#8220;staging&#8221;, or even &#8220;QA&#8221;. This allows for support teams to troubleshoot issues without making changes to the true production environment. It also allows for development teams to build new versions and features of existing products in a non-production environment. Using production to refresh development is just one of those things everyone accepts and does, without question.</p>



<p>Of course the idea of testing in a non-production environment isn&#8217;t anything new. Consider Haggis. No way someone thought to themselves &#8220;let me just shove everything I can into this sheep&#8217;s stomach, boil it, and serve it for dinner tonight.&#8221; You know they first fed it to the neighbor nobody liked. Probably right after they shoved a carton of milk in their face and asked &#8220;does this smell bad to you?&#8221;</p>



<p>For decades software development has made it a standard practice to create copies of production data and restore it to other non-production environments. It was not without issues, however. For example, as data sizes grew so did the length of time to do a restore. This also clogged network bandwidth, not to mention the costs associated with storage. </p>



<p>And then there is this:</p>



<figure class="wp-block-embed is-type-rich is-provider-twitter wp-block-embed-twitter"><div class="wp-block-embed__wrapper">
<blockquote class="twitter-tweet" data-width="550" data-dnt="true"><p lang="en" dir="ltr">If you restore a production database to a development environment and don’t cleanse or mask the data, it’s still production data.</p>&mdash; Henge Witch (@HengeWitch) <a href="https://twitter.com/HengeWitch/status/1483500385180418048?ref_src=twsrc%5Etfw">January 18, 2022</a></blockquote><script async src="https://platform.twitter.com/widgets.js" charset="utf-8"></script>
</div></figure>



<p>If you read that tweet and thought &#8220;yeah, what&#8217;s your point?&#8221; then you are part of the problem. </p>



<p>As an industry we focus on access to specific environments, but not the assets in the environments. This is wrong. The royal family knows where the Crown Jewels are stored but if they are moved to another location you know the Jewels are heavily guarded at all times. <strong>Access to the jewels is important no matter where the jewels are located</strong>. The same should be true of your production data.</p>



<div class="wp-block-image"><figure class="aligncenter"><img decoding="async" src="https://upload.wikimedia.org/wikipedia/commons/e/eb/Crown_jewels_Poland_8.JPG" alt="Use production to refresh development."/><figcaption><em>Then again, that stick might be pointy enough to fend off any attacker.</em></figcaption></figure></div>



<p>Data is the most critical asset your company owns. <strong>If you make efforts to lock down production but allow production data to flow to less-secure environments, then you haven&#8217;t locked down production</strong>.</p>



<p>It is ludicrous to think about the billions of dollars spent to lock down physical access to data centers only to allow junior developers to stuff customer data on a laptop they will then leave behind on a bus. Or senior developers leaving S3 buckets open. Or forgetting they pushed credentials to a GitHub repo. </p>



<p>If you are still moving production data between environments you are a data breach waiting to happen. I don&#8217;t care what the auditors say, you are at an elevated and unnecessary risk. Like when Obi-Wan decides to protect baby Luke by keeping his name and taking him to Darth Vader&#8217;s home planet. Nice job, Ben, no way this ends up with you dying, naked, in front a few dozen onlookers. </p>



<p>I think what frustrates me most is this entire system is unnecessary. You have options when moving production data. You can use data masking, obfuscation, and encryption in order to reduce your risk. But the best method is to <strong>not move your data at all</strong>.</p>



<p>After years of being told &#8220;don&#8217;t test in production&#8221; it&#8217;s time to think about testing in production. <a href="https://www.infoworld.com/article/3271126/what-is-cicd-continuous-integration-and-continuous-delivery-explained.html" target="_blank" rel="noreferrer noopener">Continuous integration and continuous delivery/deployment</a> (CI/CD) allow for you to achieve this miracle. And for those that say &#8220;No, you dummy, CI/CD is what you do in test <strong>before</strong> you push to production,&#8221; I offer the following.</p>



<p>Use dummy data.</p>



<p>You don&#8217;t need production data, you need data that <strong>looks</strong> like production data. You don&#8217;t need actual customer names and address, you need similar names and address. And there are ways to <a href="https://thomaslarock.com/2015/07/how-to-recreate-sql-server-statistics-in-a-different-environment/" target="_blank" rel="noreferrer noopener">simulate the statistics in your database</a>, too, so your query plans have the same shape as production without the actual volume of data.</p>



<p>It&#8217;s possible for you to develop software code against simulated production data, as opposed to actual production data. But doing so requires more work, and nobody likes more work.</p>



<p>Until you are breached, of course. Then the extra work won&#8217;t be optional. </p>
<p>The post <a href="https://thomaslarock.com/2022/01/stop-using-production-refresh-development/">Stop Using Production Data For Development</a> appeared first on <a href="https://thomaslarock.com">Thomas LaRock</a>.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://thomaslarock.com/2022/01/stop-using-production-refresh-development/feed/</wfw:commentRss>
			<slash:comments>5</slash:comments>
		
		
		<post-id xmlns="com-wordpress:feed-additions:1">21592</post-id>	</item>
	</channel>
</rss>

<!--
Performance optimized by W3 Total Cache. Learn more: https://www.boldgrid.com/w3-total-cache/

Page Caching using Disk: Enhanced (Requested URI contains query) 
Minified using Disk
Database Caching 49/83 queries in 0.103 seconds using Disk (Request-wide modification query)

Served from: thomaslarock.com @ 2025-08-06 15:42:06 by W3 Total Cache
-->