The SAS Data Science Blog

There is yet another AI productivity gap

Sophia Rowland — Mon, 22 Dec 2025 15:20:47 +0000

When I first started as a data scientist, there was a gap. I met with dozens of organizations who would invest time and resources into building accurate and tuned models and then ask, “What now?” They had a fantastic model in hand but couldn’t get it into a place and form where it could be used to make a better decision or improve a specific outcome. At the time, I called this chasm “the gap between finding insights and using insights.”

Earlier this year, research from MIT found that 95% of GenAI investments have a 0% return on investment. When you take into consideration that the enterprise investments into GenAI projects have been to the tune of $30 - $40 billion, you realize just how much money gets thrown into GenAI haphazardly. MIT has called this new gap “The GenAI Divide.” I call it “Yet Another AI Productivity Gap” or YAAPG for short.

But within the YAAPG, there is a paradox. Personal GenAI tools are improving personal productivity. Anthropic recently reported that Claude says it speeds up individual tasks by about 80%. Yes, Claude, the LLM. The LLM that also said that it could deliver products in person at one point, so I won’t fault you if you take Claude’s statements with a pinch of salt. Yet, when used thoughtfully, many individuals report productivity gains from GenAI. Power users understand how to use GenAI in their personal work to create value and avoid AI Slop. In fact, All Things Open released a whole guide that features tips and use cases for improving personal productivity using GenAI.

As we look across the GenAI Divide, what prevents organizations from seeing a return on their investments? Is it poor data processing capabilities? Is it GenAI systems that can’t learn? Is it a lack of trust in these systems? Is it a lack of AI Governance? Or is this just the same problems we’ve always seen when organizations are operationalizing AI? And what can we learn from personal productivity gains?

Lessons from Russian Novelists

In the opening line of Anna Karenina, Leo Tolstoy wrote, “All happy families are alike; each unhappy family is unhappy in its own way.” This led to the Anna Karenina principle in that success relies on several key factors coming together correctly, and failure can occur in any number of ways. Operationalizing AI requires a lot of things being done correctly.

All happy families are alike; each unhappy family is unhappy in its own way.

And when we look at power users of GenAI for personal productivity, they’re often doing several things right. First, they know what problem they’re trying to solve. Second, they understand the benefits of solving that problem. They find the right tool for the problem, make adjustments to the tool, and understand how to feed their inputs to that tool. They observe the outputs, judge their quality, and make adjustments. They’ll try new things, learn how best to work with the tool, and trust their usage of the tool.

Let’s start with a personal productivity example to demonstrate those points. Many people use generative AI tools as coding assistants. These assistants may write comments to document code, answer questions, debug problems, suggest changes, write unit tests, and more. Each time a software developer uses their coding assistant, they use it to address a specific problem — and they understand the time-saving benefits of the tool. Software developers may be limited to using the tools their organization purchases, but these tools are often vetted and provided context through the organization’s code base. The input to the tool is often just the developer’s questions and relevant code. Importantly, the programmer is in the loop. The software developer sees what the coding assistant suggests, determines if it is satisfactory, and either approves it or makes adjustments. And, over time, the developer learns the ins and outs of using the coding assistant. The programmers begin to understand how they can ask better questions or create better prompts to get more satisfactory answers from the coding assistant. Through this experimentation, the software developer learns how to better use the GenAI tool, and not just trusts the tool, but trusts their use of the tool.

Scaling to Enterprise Problems

How does this scale to enterprise projects? First, before any project starts, you should know what you’re trying to accomplish. Many organizations see GenAI as a hammer where everything is a nail. I’ve spoken with individuals across organizations that were tasked with creating a project where they could use an LLM, GenAI tool, or Agent. An indiscriminate use of AI does not create ROI. A better approach would be to find a pain point or something the organization can do better. If you don’t know what your pain points or gaps are, ask. Employees, customers, or users are a great place to start.

By having a clear problem to solve or an outcome you’re trying to achieve, you can next state the benefits of solving that problem. If you can prevent customers from abandoning their carts in your online store, you can increase your revenue. If you can help customers answer simple questions or perform basic tasks using an online chatbot, you can give your employees time to work on more productive tasks, improving output. Now that you know the benefit of solving the problem, you can create a metric to measure project success. At this stage you can estimate if this problem is worth solving by weighing the expected benefit against the estimated costs.

If you’re ready to move forward, now you need to find the right tool for the job. Sometimes this is a shiny new AI tool, but sometimes it’s just an ETL pipeline feeding to a dashboard. Sometimes you need to look at the simplest solution that will get the job done. AI is more expensive and less predictable than business rules and code. If this problem truly is best solved using AI, next determine which AI tool is best for the job. There are many choices among AI tools with options like building in-house, getting access to a generalized model, acquiring a model fine-tuned for a specific task, or even getting a tool that wraps a model and other enhancements. Here you have to weigh the purchase cost, hosting costs, and employee time to implement the tools, with the expected benefit of the model towards the task.

Before you can productionalize your new tool, you need to understand and set up the downstream processes that feed data to the tool and upstream processes to monitor and observe the tools outputs. For some use cases, the user will provide their prompts or questions directly to the tool. In others, the tool may be operating as a part of an automatic pipeline. In that case, your organization may provide the tool access to a variety of documents, data files, or a fine-tuned specific prompt. Additionally, if any information is collected from a user, you may need to clean that input to remove instances of Personally Identifiable Information (PII), Intellectual Property (IP), toxic input, or attempts of prompt injection.

Beyond downstream data, the tool’s outputs should also be monitored, with guardrails in place to evaluate, approve, reject, or adjust outputs from the GenAI tool. Again, you want to ensure that the tool doesn’t return PII, IP, or toxic responses, but you may also want to review if the response is formatted correctly, is on task, and references accurate information. If a response has a flaw, perhaps there is logic to provide why the response was incorrect and let the GenAI tool try again. Downstream and upstream processes can be performed manually, as a part of an automated pipeline, or as a hybrid approach where manual intervention can take place when specific conditions are met. But before investing into a GenAI project, it’s important to have some idea about the level of effort to integrate the tool into a wider production pipeline.

Finally, you should encourage teams to experiment with their tools in a responsible manner. By having a safe space to experiment, teams can learn, see what works, see what doesn’t work, what can be adjusted in their user input, report problems, and trust their usage of the tool to improve outcomes for the organization.

Conclusion

In conclusion, before enterprises can see ROI on their GenAI investments, they must master the following:

Know what problems they’re trying to solve
Understand the benefits of solving that problem
Find the right tool for solving the problem
Know their data pipeline
Monitor, observe, and adjust outputs from the tool
Experiment, learn, and trust their usage of the tool

Traversing the GenAI divide will take a deliberate and thoughtful approach to solve a real problem rather than an indiscriminate use of AI.

The post There is yet another AI productivity gap appeared first on The SAS Data Science Blog.

The rise of small language models for information extraction

William Nadolski — Wed, 17 Dec 2025 14:11:47 +0000

Part 2 in the multimodal transformers: AI foundation models series

In the previous post, we explored how transformer-based models became the foundation for the modern wave of multimodal AI. This post continues that conversation but shifts the focus from architecture to application. To be more specific, how organizations extract structured information from unstructured text by using named entity recognition (NER) and related text analytics tasks.

Recently, enterprises have had to rely on one of two very different approaches. One, classic NLP systems built on rules and statistical models. Or two, large language models (LLMs) that can generalize across domains with little to no training. Although both approaches are valuable, they represent opposite ends of a spectrum. Traditional NLP is fast, deterministic, and easy to audit, but rigid and difficult to maintain. LLMs, on the other hand, are astonishingly flexible but computationally expensive, unpredictable, and challenging to operationalize in regulated environments.

Now, a new class of lightweight transformer models—known as small language models (SLMs)—is emerging as a promising middle ground. In this post, we explore how an exciting family of SLM models, the Generalized and Lightweight Model for Named Entity Recognition (GLiNER), combines the strengths of traditional NLP and LLM-based approaches.

Traditional NLP versus LLMs: Two ends of the spectrum

Before the rise of LLMs, developers built most NER systems by using regular expressions, dictionaries, and heuristic rules. These approaches remain incredibly useful. They run efficiently on CPUs, scale to millions of documents, and produce fully deterministic outputs. For tasks where auditability matters—such as identifying sensitive information in clinical records or regulatory filings—these methods still shine. They provide the literal match text, exact character offsets, and clear justification for why a match occurred. They are also easy to deploy in locked-down, privacy-conscious environments because they require no external APIs or specialized hardware.

But their reliability comes at the cost of flexibility. Traditional NLP is only as good as the rules or dictionaries behind it. Even small variations in wording or formatting can break those rules. Maintaining these systems often becomes a never-ending cycle of patching and re-patching, especially in domains where terminology shifts frequently. By tying to predefined labels, traditional methods make it difficult to identify new concepts without significant re-engineering.

The rise of LLMs promised a cure for these pain points. Models like GPT and Gemini can recognize new entities or categories on the fly, even when those categories were never part of their original training. A simple instruction—“extract all construction activities” or “find all mentions of medications” is often enough to get meaningful results without writing a single rule. The flexibility and generalization ability of these models have made them extremely appealing for information extraction tasks.

However, they introduce a different set of challenges. Running a large transformer model locally is slow and resource-intensive, often requiring expensive GPU hardware. Cloud-based inference is faster but raises cost, latency, and data privacy concerns. Most importantly, LLMs are non-deterministic: the same input might produce different outputs at times, making auditing and validation harder. They can also hallucinate—confidently generating entities or facts that do not appear in the source text. These factors limit their usefulness in workflows requiring strict reproducibility and trust.
Given these trade-offs, many organizations are left wondering whether there is a practical middle path between rule-based systems and full-scale LLMs. This is where SLMs, and GLiNER specifically, enter the picture. Table 1 compares the strengths and weaknesses of different NER approaches.

Table 1: Comparison of the strengths and weaknesses of different NER approaches

GLiNER: A practical middle ground for modern NER

The GLiNER family was designed precisely to bridge the gap between traditional NLP and LLM-based extraction. GLiNER models are transformer-based, but much smaller than the generative models dominating the headlines. What makes them compelling is that they inherit many of LLMs' capabilities while avoiding many of their drawbacks by virtue of being discriminative rather than generative AI models.

GLiNER models run efficiently on CPUs, making them easy to deploy on laptops, servers, or edge devices without any specialized hardware (though they can still benefit from GPU acceleration). Because they operate deterministically, they produce stable, repeatable outputs. Unlike many LLM-based systems, GLiNER returns literal match text, character offsets, and confidence scores. This makes it highly auditable and suitable for regulated domains. And because they can be executed locally, they preserve privacy and avoid the cost and latency of cloud inference.

Despite their small size, GLiNER models support impressive zero-shot entity recognition capability. This means they can dynamically identify new entity categories from user-provided descriptions. This gives them a degree of flexibility that traditional NLP systems simply cannot match. At the same time, GLiNER avoids many of the pitfalls of LLMs: there is no risk of hallucination, it does not require heavy hardware infrastructure, and it behaves predictably across repeated runs.

The primary trade-off is that GLiNER still benefits from being tuned to the specific domain or task at hand. While zero-shot extraction works well for many general categories, domain-heavy environments such as transportation, finance, or clinical workflows typically see a quality boost when the model is calibrated with example documents or lightly fine-tuned. But this level of tuning is dramatically simpler than maintaining a full rules-based pipeline or training a large generative model from scratch.

How GLiNER works: A high-level view

At a conceptual level, GLiNER works by transforming both the input text and the user-provided labels into vector representations. Instead of relying on fixed, predefined labels, GLiNER dynamically compares the input against these label embeddings. If a span of text is semantically like the embedding of a requested label, say, “disease,” “equipment failure,” or “construction activity”, it identifies that span as an entity. The model outputs the matched text, its character offsets, and a confidence score indicating how well the span corresponds to the label.

For example, given the sentence: “The patient was diagnosed with Type 2 diabetes and prescribed 500mg metformin.” You could instruct GLiNER to extract concepts like “disease,” “medication,” or “dosage” without having pre-trained the model on these terms. GLiNER will identify Type 2 diabetes as a disease, metformin as a medication, and 500mg as a dosage, complete with offsets and confidence levels. This ability to generalize using dynamic labels makes GLiNER extremely powerful for workflows where new categories often emerge, or there are many permutations of a desired concept definition. Figure 1 shows a dashboard view of the output using a different, more complex example.

Figure 1: Example GLiNER dashboard

The same underlying mechanism enables GLiNER to perform lightweight document classification. Instead of looking for text spans, the model simply compares the overall document embedding to a set of category embeddings. It then returns the closest matches. The result is a fast, flexible classification engine that requires minimal setup.

This approach aligns with an important trend highlighted in NVIDIA’s recent paper, “S mall Language Models Are the Future of Agentic AI.” The industry is recognizing that although large models provide broad capability, small models provide practical usability. They are faster, cheaper, and easier to deploy in environments where reliability, privacy, and control matter (for example, enterprise applications). For reference, the GLiNER architecture described within Figure 2 is sourced from here.

Figure 2: Illustration of the GLiNER model architecture

Where the industry is heading: LLM-Orchestrated SLMs and the rise of agentic AI

The past two years have shown that although LLMs excel at broad reasoning tasks, they are not always the best tool for executing specialized or repetitive functions such as NER, classification, retrieval, or data validation. As a result, a new architectural pattern has begun to dominate research and emerging commercial systems. That is, using large, highly capable LLMs as orchestrators and planners, while delegating most task execution to specialized SLMs.

Instead of relying on a single monolithic model to do everything (which is often needlessly expensive, computationally slow, and overkill for simpler tasks), the industry is moving toward distributed, tool-driven AI ecosystems in which:

The LLM acts as the “brain”, handling reasoning, decomposition of tasks, decision-making, and task orchestration.
SLMs and other specialized models act as “tools”, performing concrete actions such as extraction, classification, vision segmentation, retrieval, or structured data generation.

The paper “Small Language Models Are the Future of Agentic AI” underscores this trend. The authors argue that the most effective AI systems of the future will rely on large models not as all-purpose engines, but as generalist controllers that coordinate fleets of smaller, optimized components. This shift dramatically improves efficiency, reliability, and cost.

For information extraction specifically, this trend means that an LLM might eventually be responsible for interpreting the user’s intent. For example, “extract all construction activity mentions from these reports”. It could also include deciding which tools to call and assembling the final structured output. But the actual extraction, the token-level work of identifying entities and capturing offsets, is performed by deterministic, efficient models like GLiNER.

This architecture combines the reasoning power of LLMs with the stability and speed of SLMs. This means producing outputs that are far more robust and cost-effective than using a single model alone. It also reflects a broader convergence between symbolic AI (systems that value determinism and structure) and neural AI (systems that value generalization and flexibility). The next generation of enterprise AI systems will increasingly be hybrid, agentic, and tool-aware.

Where the SAS Applied AI and Modeling Division is heading next

As agentic AI and SLMs continue to mature, they are rapidly becoming the most practical option for enterprise-grade information extraction. They combine the stability and auditability of traditional NLP with the flexibility and intelligence of transformer-based systems. All this without incurring the operational challenges of full-scale LLMs.

Recognizing this, the SAS Applied AI and Modeling division is actively working to incorporate GLiNER-based capabilities directly into our SAS Document Analysis offering. This will enable customers to perform zero-shot and domain-specific entity extraction locally, with deterministic behavior, full auditability, and minimal hardware requirements. It represents a significant step forward in making advanced information extraction more accessible, reliable, and affordable across industries.

In the next post in this series, we’ll shift from text to vision and explore the Segment Anything Model (SAM). This is another zero-shot foundation model reshaping how organizations approach image segmentation in computer vision.

Stay tuned—there’s a lot more to come!

LEARN MORE | GLiNER Python Implementation on GitHub

LEARN MORE | GLiNER Model Weights on HuggingFace

The post The rise of small language models for information extraction appeared first on The SAS Data Science Blog.

No one wants your AI Slop

Sophia Rowland — Tue, 16 Dec 2025 15:42:57 +0000

Generative AI has seen drastic improvement in image, video, audio, and text generation within the last few years. Humans, on the other hand, are still catching up on determining when it’s appropriate to use Generative AI and how to review the content generated by AI before sharing it with others. In September 2025, research found 40% of office workers received AI Slop in the last month.

AI Slop is AI-generated content that is poor-quality and low-effort. AI Slop can include research that cites made-up references, blogs with unnecessarily superfluous word choices, emails that are difficult to understand, slide decks that are mostly fluff, code with nonsensical design patterns, or images with strange visuals. In the workplace, individuals who receive AI Slop from their coworkers spend up to 2 hours fixing it. Individuals who receive AI Slop also report feeling annoyed, confused, and offended. An overreliance on AI tooling can not only frustrate your coworkers but can also erode critical thinking and reduce your own understanding.

AI Slop isn’t exclusively a workplace problem. AI Slop has been found in advertising, films, blogs, books, and social media. And one person’s AI Slop may be another's misinformation. For example, social media is rife with content where users argue in the comments about whether the post is AI Generated.

Crab Jesus is not real and cannot hurt you

Like AI Slop, Deepfakes are AI-generated content, but Deepfakes are created to depict real or realistic people, events, or environments. Deepfakes can be created for entertainment but they are often used to spread misinformation and disinformation. Deepfakes can be used to increase engagement for a social media post or cause divisions between communities. And while some attempts at Deepfakes are clear AI Slop, others are harder to spot. Creators of Deepfakes are often not transparent in how the image, video, or audio was generated as it is often not in their best interest to do so.

Are these bunnies really bouncing on a trampoline?

Best practices for using Generative AI thoughtfully as an individual

Generative AI can be a powerful productivity tool when used thoughtfully. Many of us don’t pick up Generative AI or Copilot tools with the intention of frustrating our coworkers or spreading disinformation. When used correctly, it can save time, help brainstorm, reduce busy work, and visualize ideas. The most effective users of Generative AI are often using the content as a starting point, in brainstorming, as a proof-of-concept, or they have the expertise to manually review and edit the content before sharing. Additionally, they’re appropriately informing others when content as AI-generated.

For personal use, AI is a great starting point for research as it can compile information from several sources. Recent enhancements in the Microsoft 365 Copilot and the Google AI Overview include links to source material. This can help you validate the information provided by AI as well as provide additional relevant information. To expand your knowledge of the area, be sure to include additional sources of information.

Copilots can also be helpful for visualizing ideas or creating mock-ups. One great use case is for home decorating. You can take pictures of a room in your home and use Generative AI to add furniture or try different paint colors. It’s a great way to see how something may look before spending anything. In software development, product teams can visualize how a new feature or enhancement could look in their user interface. They can even create simple, clickable prototypes to gather feedback on the enhancement before development begins.

Before you start sharing your AI Generated content to others, you should review it, understand it, and adjust as necessary. How detailed your review is and how much you adjust depends on how important the content is and how much you care about the opinions of those who will see it. Printing throw-away decorations for a birthday party? You can get by just making sure the people have a normal number of fingers. But if you’re pushing code to the production server, you’re going to need a much more robust review.

Context matters! This magical AI-generated jelly cake is fine as a decoration but will frustrate bakers if they expect the result of the online recipe they followed to look like that

With generated images, many of us can recognize when the image matches its subject and can make a judgment call on whether the image is acceptable or not. Authors who lean on AI to generate blogs read through the generated text and have enough experience in the area to determine if the content represents the subject area well. (Side note that my blogs are not AI generated because I personally enjoy writing, especially when I get to slip a meme into my posts).

Some code is more important than others. When developing an app for yourself, manual testing and a review of the generated code is often enough. Alternatively, production software has very strong standards for a reason. One bad block of code can add security vulnerabilities, delete databases, and cause outages. Nonetheless, many programmers uses AI as a brainstorming tool. You can bounce ideas off of Copilots (like a rubber duck), get suggestions of things to try, or ask for feedback. For production software, AI generated code should be manually reviewed for accuracy and efficiency, robustly tested, and documented as generated by AI. AI can make programming accessible for many, but learning to program is still a worthwhile skill as it helps you understand the quality of AI-generated code.

Copilots are taking jobs from hard working rubber ducks!

Stop the Slop

In the workplace, AI Slop can create additional work for the coworkers who have to translate or clean up the poor work. Sending and posting AI Slop may cause others to think less of you. When applied without transparency, AI Slop can mislead or misinform others. If used as a replacement for learning, reliance on AI can reduce critical thinking skills.

If the personal, professional, or educational costs of AI Slop still haven’t convinced you, think of the poor AI models! AI Slop can end up in the training data for future generative models and lead to model collapse. Model collapse can lead to lower quality, less accurate, and less creative content generated by future models. Like the hammer, AI is a productive tool when wielded correctly but can be a destructive force when used carelessly.

If you have any tips or use cases for using Generative AI effectively, please share those in the comments! Or, if you want to commiserate and share the worst examples of AI Slop you’ve come across, that’s also welcome.

The post No one wants your AI Slop appeared first on The SAS Data Science Blog.

SAS Viya: Powering smarter decisions at lower cost and in shorter time

Dave Kessler — Mon, 08 Dec 2025 18:48:29 +0000

You have heard many sayings about time, money, or both. The phrase "Time is money" is frequently cited, as well as the complementary adage "You can get more money, but you cannot get more time." This is particularly true when conducting an analysis, as you are always on a tight schedule. If you are using shared resources, you are on everyone's clock, and those resources are everyone's money.

You know you need to save both time and money. In this post, you will learn a method to save time and possibly money when performing multiple repeated measures analyses with the LOGSELECT and GENSELECT procedures in SAS Viya. You will do this by using the APPLYROWORDER option and a data set that has a predefined organization.

Repeated measures… one more time

You might have used repeated measures analysis before, but it is always helpful to review the fundamentals. If your analysis involves making measurements of the same subject at separate times, you cannot always assume that these separate observations are independent. For example, it is reasonable to assume that a person has some latent qualities that tend to influence the observed characteristics of that person. When you use a generalized linear model, you can address this intra-subject correlation by using generalized estimating equations (GEE).

SAS Viya supports GEE in the GENSELECT and LOGSELECT procedures. In each of these procedures, you specify a subject effect, a working correlation type, and a within-subject ordering effect in the REPEATED statement. The subject effect identifies the individual subjects. The working correlation type specifies the assumed correlation structure between the repeated measurements. The within-subject effect specifies the order in which the measurements were taken; some correlation structures require this information.

The following SAS statements specify a logistic regression in PROC GENSELECT:

   proc genselect data=mycas.wheeze;
      class smoke subject visit;
      model wheeze(event='1') = age smoke / dist=binary;
      repeated subject=subject / type=UN within=visit;
   run;

This models the binary variable wheeze by using a continuous effect ofage and a classification effect of smoke. In the REPEATED statement, the subject variable identifies the distinct subjects in the data set. The type=un option specifies an unstructured working correlation structure. In this setting, unstructured means that there is no specific pattern in the correlation between pairs of observations. Because the order is important for the unstructured correlation type, the REPEATED statement also specifies that the visit variable identifies the order of the observations within each subject.

This is simple enough so far, but there are some hidden operations at work!

Hidden time

Repeated-measures analysis involves working with a set of observations for each individual subject. For computational efficiency, the procedure organizes the data set so that all observations for an individual subject are contiguous. This organization is called a partition of the data set. The procedure creates this partition for you before the analysis begins.

That is certainly a kind service the procedure provides, but what if you want to run more than one analysis using the same subject definition? For example, you might want to evaluate different working correlation structures, different mean effects, or different response distributions.

You might wonder, “What’s so bad about that?” Well, each analysis requires a re-partition. Because the data set might have observations for each subject scattered across different worker nodes, the partition process can be time-consuming. That time will show up in the overall time needed to complete the analysis.

Table 1 shows the maximum time required to complete the repeated-measures analysis for data sets of increasing size. The input data set wheeze has been randomly shuffled across five worker nodes before the analysis.

Notice how the time needed grows with the number of observations:

Number of Observations	Number of Subjects	Maximum run time (sec)
640,000	160,000	3.8
1,280,000	320,000	7.3
2,560,000	640,000	13.1
5,120,000	1,280,000	25.3
10,240,000	2,560,000	62.2
20,480,000	5,120,000	233.5
40,960,000	10,240,000	382.6

Table 1: Increasing run time as the number of subjects grows

This increase in run time with larger data sets is not surprising because there are more observations to process during the analysis. However, the time shown here also includes the time needed to partition the input data set. If you are performing several different analyses that use the same subject effect, then you are paying the price to repeat the same partition operation during each analysis. That does not feel like an effective use of your time and money – why repeat the partition for each analysis?

Time to relax - With a partition!

Beginning with the 2025.10 release of SAS Viya, you can use a pre-partitioned data set, removing the need for the procedure to re-partition the data set for each analysis. To do this, you will use PROC CAS to create the partition by using the partition action. The following SAS statements create a data set, wheezepart, that is a partitioned version of the wheeze data set:

   proc cas;
      table.partition /
        table={name="wheeze", groupBy={"subject"}, orderBy={"visit"}}, 
        casout={name="wheezepart", replace=true};
      run;
   quit;

The groupBy list specifies the variables that define the partition, and the orderBy list specifies the variables that define the order of the observations within each unique level of the groupBy list. The casout= option defines the output data set, named wheezepart. Notice that the groupBy list includes all the variables that define the subject effect.

Now that you have a pre-partitioned data set, you can proceed with many different analyses, all without having to wait for the partition process. You use the new wheezepart data set, and you specify the APPLYROWORDER option to the procedure:

   proc genselect data=mycas.wheezepart applyroworder;
      class smoke subject visit;
      model wheeze(event='1') = age smoke / dist=binary;
      repeated subject=subject / type=un within=visit;
   run;

Behind the scenes, the procedure will verify that the partition information in the wheezepart data set is compatible with the subject effect specified in the REPEATED statement. After the action verifies this, the analysis continues without the partitioning step. This feature is also available with the LOGSELECT procedure when you are analyzing repeated measures data in a logistic regression.

The results of another timing experiment that illustrates the benefit of pre-partitioning are shown in Table 2. The table compares the maximum time to complete the analysis for an unpartitioned input data set with the maximum time for a partitioned version of that data set. It also includes the maximum time for the partitioning step:

Number of Observations	Number of Subjects	Maximum partition time (sec)	Maximum run time *WITH* pre-partitioned data set (sec)	Maximum run time *WITHOUT* pre-partitioned data set (sec)
640,000	160,000	1.9	2.3	3.8
1,280,000	320,000	3.5	3.9	7.3
2,560,000	640,000	6.4	7.1	13.1
5,120,000	1,280,000	12.4	13.2	25.3
10,240,000	2,560,000	26.0	25.8	62.2
20,480,000	5,120,000	149.4	67.8	233.5
40,960,000	10,240,000	342.0	70.7	382.6

Table 2: Comparison of run time using a pre-partitioned data set to run time using an unpartitioned data set

The experiment aggregates results across multiple trials, so the sum of the partition time and the time for analysis with the pre-partitioned data set does not exactly equal the time for analysis without a pre-partitioned data set. However, the order of magnitude is clear. If you are using the data set with 40,960,000 observations and you run a dozen separate analyses, you could save an hour. If you are running in a cloud environment where you pay as you go for computation and storage, you might also save the costs associated with the redundant partitioning steps.

Enjoy your time off

In this post, you learned how to save time and resources in repeated-measures analysis by using a pre-partitioned data set and the APPLYROWORDER option for the GENSELECT and LOGSELECT procedures in SAS Viya. Your next assignment is to decide what to do with the time you save!

The post SAS Viya: Powering smarter decisions at lower cost and in shorter time appeared first on The SAS Data Science Blog.

Revolutionizing industrial safety: How digital twins and AI are transforming PPE detection

John Campbell — Fri, 14 Nov 2025 14:15:16 +0000

Authors: John Campbell, Priti Upadhyay, and Jonny McElhinney

Digital twins technology, powered by advanced simulation platforms such as the Unreal Engine, is revolutionizing the creation and management of training data for deep learning models. Traditional data collection and manual labeling for Personal Protective Equipment (PPE) detection are often time-consuming, costly, and prone to human error. Moreover, capturing diverse real-world conditions, such as varying lighting conditions, worker movements, and different types of PPE, can be challenging and sometimes hazardous.

To overcome these challenges, our digital twin approach enables the generation of synthetic, accurately labeled datasets. We perform this in a fully controlled virtual environment. The platform allows extensive customization. This includes adjustable lighting conditions, diverse avatar animations, flexible PPE configurations, and dynamic camera placements to simulate multiple real-world perspectives. In this post, we cover the following:

Discuss the advantages of using synthetic data for training and validation
Outline the architecture of our digital twin simulation platform
Describe the demo environment used for validating model performance
Highlight how this approach accelerates model development while improving safety and scalability

Background

In industrial environments, safety is the foundation of smooth and efficient operations. The safety of workers is directly linked to the overall safety and productivity of the industry itself. When workers are protected, processes run without disruption, equipment is used responsibly, and costly accidents are avoided. Among the many factors that contribute to industrial safety, the correct use of PPE stands out as one of the most critical.

However, maintaining consistent PPE compliance across large industrial sites can be challenging. Human error or time pressure can result in workers failing to wear essential protective gear. This includes items such as hard hats, gloves, safety glasses, or vests. These lapses not only endanger individuals but can also lead to serious incidents, production downtime, and regulatory penalties that affect the entire operation.

To address this challenge, industries are increasingly adopting PPE detection technology. Using artificial intelligence (AI) and computer vision, these systems automatically detect whether workers are wearing the required PPE in real time. They can alert supervisors, log compliance data, and help companies maintain a culture of safety and accountability.

Real-world data challenges

Collecting real-world data from industrial sites for training PPE detection models presents several challenges. Capturing images and videos of workers wearing PPE in different scenarios can be time-consuming, expensive, and potentially hazardous. There are also privacy concerns, as recording employees raises ethical and legal issues. Furthermore, real data often suffers from imbalances and biases. Certain types of PPE, worker demographics, or lighting conditions may be underrepresented, leading to models that perform poorly in these scenarios.

Another significant issue is human labeling inconsistency, where manual annotation of PPE in images can be error-prone and subjective. This results in mislabeled or inconsistently labeled data that negatively impacts model accuracy. Enter the game-changer: Digital twin technology unlocks unparalleled problem-solving power. By creating highly realistic virtual replicas of industrial environments, digital twins enable the generation of large volumes of diverse, labeled, and unbiased data safely and efficiently. Synthetic data from digital twins ensures balanced representation across PPE types, worker appearances, and environmental conditions. Thus, overcoming the limitations of real-world data while reducing risks, costs, and compliance challenges.

By integrating PPE detection into safety management systems, industries can move from reactive responses to proactive prevention. It ensures that worker protection becomes a continuous, automated process, reinforcing the idea that the safety of every worker is essential to the safety and success of the entire industry.

Synthetic data generation

A digital twin of a real-world industrial environment enables organizations to simulate numerous workplace scenarios safely and efficiently. Using advanced 3D modeling, physics engines, and visual rendering from gaming technology, synthetic data can be generated to represent workers wearing different types of PPE in various conditions, lighting, and poses. This synthetic data becomes a valuable training resource for deep learning (DL) models. It enables them to recognize PPE more accurately and reliably. By leveraging digital twins, industries can create realistic, labeled datasets at scale. Thus, accelerating the development of robust PPE detection algorithms without disrupting actual operations or exposing workers to risk.

At SAS, we have developed a digital twin platform utilizing the advanced capabilities of Unreal Engine. It was specifically designed to support the creation and training of PPE detection models for industrial safety applications. This digital twin serves as a highly realistic virtual environment that mirrors real-world industrial settings, including factories, construction sites, and warehouses. This platform enables the simulation of diverse workplace conditions. This would include things such as dynamic lighting, realistic worker movements, and complex environmental factors that are difficult or hazardous to capture in real-world settings, as shown in Figure 1 below.

Figure 1: Simulation of an avatar preparing for a drilling operation

Our digital twin features customizable worker avatars. They can represent individuals of diverse body types, skin tones, ages, and genders, helping to eliminate demographic bias in data generation. Each avatar can be configured to wear or omit specific PPE items such as hard hats, gloves, safety glasses, and safety vests. As shown in Figure 2 below, this allows us to simulate countless combinations for accurate model training. The system also incorporates environmental dynamics, including lighting changes, camera perspectives, and background variations, ensuring that the synthetic data closely mimics real-world variability.

Figure 2: Customizable number of avatars and their PPE status

A key advantage of this approach lies in automated and consistent label generation. See Figure 3 below. The system takes a JSON input containing customized parameters for avatars and their PPE configurations, such as hard hats, vests, gloves, and safety glasses. With this flexible setup, users can easily create diverse and realistic scenarios using a random scenario generator. This approach enables efficient dataset balancing, allowing targeted data generation for underrepresented PPE classes or specific conditions where the detection model underperforms. Every synthetic image produced within the digital twin is automatically annotated with precise metadata. This includes the type, location, and state of PPE. Bounding boxes are generated solely for the visible portions of the objects within the rendered scene, with occluded regions excluded from annotation.

This level of labeling accuracy and consistency is extremely difficult to achieve with manual annotation of real-world data, which is often prone to human error and subjective interpretation. By generating perfectly aligned and error-free labels, we ensure that our deep learning models are trained on reliable, standardized data. This significantly improves detection accuracy, reduces training time, and enhances the model's overall robustness.

Figure 3: Automatic generation of precise bounding box for detecting presence and absence of PPE

With these features, our digital twin not only accelerates data generation but also ensures fairness, consistency, and precision. Creating a solid foundation for developing advanced, bias-free, and dependable PPE detection systems strengthens industrial safety.

PPE detection model

To validate and fine-tune our PPE detection solution, we have created a demo physical environment at SAS. This space closely mimics a real industrial workspace. See Figure 4 below. This controlled setup features actual machinery, workstations, and safety zones. It also includes strategically positioned cameras that capture worker movements and PPE usage from multiple angles. By deploying our AI models and digital twin–generated scenarios in this environment, we can achieve several objectives. These would include observing system performance in real-world conditions, identifying potential blind spots, and optimizing detection accuracy. The demo environment also allows us to test alerts, workflows, and integration with SAS Event Stream Processing, ensuring that the solution functions seamlessly before scaling it to full industrial sites. This hands-on approach provides critical insights, bridging the gap between simulated training data and actual operational deployment.

Figure 4: Worker performing a safe scenario with use of PPE in a demo physical environment

Using the data generated from our digital twin and validated through the demo physical environment, we trained a deep learning DETR (DEtection TRansformer) based object detection model to accurately identify PPE compliance in industrial settings. DETR’s transformer architecture enables the model to directly predict object bounding boxes and classes without requiring complex post-processing. This makes it highly effective for detecting multiple PPE items in cluttered or dynamic environments. By training solely on synthetic data that includes diverse worker poses, lighting conditions, PPE types, and demographics, the model learned to generalize well across different scenarios, as shown in Figure 5 below. The trained DETR model demonstrated high accuracy in detecting PPE, accurately identifying compliance or violations in real-time. These results confirm that synthetic data from digital twins can effectively train advanced AI models for reliable, bias-free PPE detection in industrial settings.

Figure 5: Detection of PPE using the deep learning model trained using synthetic data

SAS ESP for deployment

To bring our PPE detection solution from development to real-world deployment, we are leveraging SAS Event Stream Processing (ESP). SAS ESP enables real-time ingestion, processing, and analysis of high-volume data streams from industrial sites. An example would be video feeds from cameras monitoring worker activity. By integrating our deep learning models with SAS ESP, we can instantly detect PPE compliance. Alerts are generated the moment a violation occurs. This streaming analytics approach enables the quick identification and resolution of safety incidents. Thus, it enhances both worker protection and operational efficiency. By combining advanced AI with SAS ESP, we are deploying a robust, real-time PPE detection system that turns data into actionable insights for safer industrial operations.

Summary

Ensuring industrial safety requires more than just providing protective equipment. It also depends on actively monitoring compliance and fostering a culture of accountability. PPE detection systems enhance industrial safety by ensuring that workers wear the necessary protective gear in real-time. This reduces accidents and operational disruptions. Digital twins and gaming technologies play a crucial role in generating large, diverse, and unbiased datasets for training machine learning models, thereby ensuring fair representation across various demographics, including skin tones, ages, genders, and others. The automated and consistent labeling provided by digital twins further strengthens model accuracy and reliability.

Finally, leveraging SAS Event Stream Processing (ESP) enables real-time deployment of these AI models on industrial sites. In other words, they instantly detect PPE compliance and generate alerts to prevent hazards. Together, these technologies create a comprehensive, proactive approach to industrial safety, protecting workers, enhancing compliance, and improving overall operational efficiency. Interested in learning more? Read more about the SAS Industrial Safety PPE model.

LEARN MORE | SAS Digital Twins

Model Risk Management at your fingertips: Just ask!

Phoemphun Oothongsap — Tue, 04 Nov 2025 17:10:18 +0000

Authors: Phoemphun Oothongsap, Lili Li, Derya Biryol, and Artin Armagan

In modern banking, fraud detection models are the silent heroes, scoring billions of transactions annually and standing as the primary shield against catastrophic financial and reputational damage. But the models protecting the institution are only as strong as the system that supports them: Model Risk Management (MRM).

Historically, MRM has been an anchor of inefficiency. Risk teams are forced into a cycle of manual, arduous work, including waiting for scheduled reports, sifting through static dashboards, and chasing down data owners for model performance updates. This slow, backward-looking approach creates dangerous reporting delays, leaving institutions vulnerable to regulatory gaps and undetected model drift.

It doesn't have to be this way. Imagine moving beyond the confines of old-school Model Risk Management. Our new approach fundamentally streamlines your compliance and review process by delivering real-time model reporting and statistical outputs, finally eliminating the delays caused by static dashboards and manual reviews.

What if we could simply ask for the answers to our MRM reporting?

With the power of Large Language Models (LLMs), MRM preparations can be shifted to a more conversational style. Instead of combing through many spreadsheets, documents, and databases, you would simply type:

"Show me transaction volume by month and portfolio."
"What is the monthly fraud transaction percentage?"
"Show me the last quarter’s data drift."

Within seconds, you would receive a summary presented as clean tables and visual charts.

Another example would be typing:

"Has the fraud model been retrained since the last validation?"

The system responds this time with a timeline, links to documentation, and a summary of changes.

This could be the future of MRM reporting: conversational, intelligent, and on-demand. A future where users can interact with MRM reports and data via a chat, instead of a static dashboard. An MRM agent would understand context, retrieve relevant report sections, and perform calculations. Thus, analysts, auditors, and regulators could query the system at any time and receive instant answers.

Why Generative AI is a game-changer for fraud detection

Fraud detection models are continually evolving to address new threats, shifts in customer behavior, and emerging technologies. To keep up, MRM needs to be equally adaptable. Here’s how conversational MRM, powered by generative AI, could revolutionize the process:

Real-Time Transparency: Instant visibility into model lineage, validation status, and performance metrics.
Audit-Ready Insights: Generate regulatory-compliant summary reports instantly.
LLM-Integrated Analysis: Use LLMs to process analyst requests, written in plain English, and convert them into actions within a statistical framework.
Dynamic Visualizations: Automatically highlight model trends and outliers as they occur with dynamic dashboards and charts.

Business use case

Transactional fraud detection models continuously monitor vast volumes of monetary and non-monetary transactions in real-time to detect anomalous behavior. These models must be continuously validated, monitored, and documented. Traditional MRM workflows often struggle to keep pace with fraud. Risk teams struggle to keep up with the following:

Data monitoring
Model performance evaluations
Data drift
Shifts in fraud patterns
Regulatory documentation requirements

This is where an MRM AI Agent could become transformative.

Instant Data & Model Monitoring

The agent can automatically track model performance metrics, such as the fraud detection rate, false positives, and score distributions, in real-time. For example:

"Show me the fraud detection rate by portfolio over the last quarter."
"Show me the bi-weekly score distribution by portfolio over the last quarter."

Data Drift Detection

It can analyze distributions of input variables and detect drift over time. It could be asked:

"Has the transaction amount distribution shifted since the first quarter?"

Conversational Access to Insights to Update Business Rules

Instead of querying various databases, teams can simply ask questions and receive visual summaries, charts, and links to documentation. Questions such as:

"Compare fraud scores for flagged transactions last month versus this month."
“Compare last week's score distribution to the preceding one month.”
“Have there been any significant shifts in modeling fields of the data over the last six months?”
“What is the transaction detection rate of the model at an alert rate of 0.5% over the last 3 months?”

Scenario Simulation & Stress Testing

The agent can simulate edge cases. Examples would be:

“Construct a scenario in which 1% of monthly transactions exhibit high-velocity fallback behavior, where the payment system rapidly and automatically switches to an alternative transaction method. Assess the impact of this scenario on model performance.”
"Show me the score distribution shift over a week, assuming that 50% of transactions had their amounts doubled. Summarize the impact of this change on model performance."

Audit-Ready Documentation

The agent maintains a timeline of model changes, validations, and retraining events. It could be asked:

"Summarize model changes since last review."

Figures 1, 2, and 3 illustrate how the MRM Agent Console retrieves relevant report content and performs real-time statistical analysis in response to a user query. Figure 1 demonstrates a sample of an inquiry-and-response flow. The user inquires about the monthly transaction volume associated with the loaded card authorization data. The MRM agent then generates a plot and summarizes the findings.

Figure 1: MRM Agent console interface - monthly transaction volume for the loaded card authorization data in a plot graph with a summary

Another example of an inquiry-and-response flow is shown in Figure 2. A user asks about the monthly total transaction amount per portfolio for the loaded card authorization data. So, the MRM agent generates a plot, and the findings are summarized.

Figure 2: MRM Agent console interface - monthly total transaction amount per portfolio for the loaded card authorization data in a plot graph with a summary

A third example of an inquiry-and-response is shown in Figure 3. A user inquires about the distribution of model scores over the last six months, and the MRM agent generates a PSI table and summary.

Figure 3: MRM Agent console interface - the last six months of model score distribution in a PSI table with a summary

The Payoff: Generative AI’s impact on fraud detection

Banks that adopt conversational MRM for fraud detection can achieve more than just operational speed. They could secure clarity, control, and confidence across their model risk management processes and reporting.

Ultimately, adopting conversational MRM is a strategic move that can empower banks to lead with intelligence and resilience. By embracing Generative AI innovation, banks can position themselves not just to keep up, but to set the pace for the future of fraud models. This approach transforms model governance from a static compliance process into a dynamic, interactive one, where decisions are made instantly, confidently, and backed by real-time insights. By enabling conversational engagement, institutions shift from reactive reporting to proactive risk management, fostering transparency and agility across the entire lifecycle of fraud detection models.

Lili Li, PhD, Senior Data Scientist at SAS, specializes in statistical modeling, machine learning, and large-scale data integration. After earning her PhD from North Carolina State University, she played a pivotal role in developing JMP Clinical and JMP Genomics, advancing interactive visualization and exploratory analytics for life sciences. Since 2020, she has concentrated on advanced analytics consulting, leveraging predictive modeling, anomaly detection, and optimization techniques to support clients in financial risk assessment and fraud mitigation.

Derya Biryol, PhD, is a Senior Data Scientist at SAS. Since joining SAS in 2016, Derya has specialized in fraud detection modeling, model evaluation, and risk analytics. With a Ph.D. in Applied Mathematics, she brings deep expertise in advanced statistical methods and machine learning to develop and optimize real-time fraud detection solutions as part of the R&D Applied AI and Modeling team.

Artin Armagan, PhD, Sr Manager at SAS. Artin is a statistician working with a brilliant team of data scientists to build real-time transactional fraud-detection models for financial institutions. He has worked on analytical modeling across various business domains, including banking, insurance, and healthcare, throughout his tenure at SAS. Previously, he held postdoctoral positions at Duke University after completing his graduate studies at the University of Tennessee.

The post Model Risk Management at your fingertips: Just ask! appeared first on The SAS Data Science Blog.

Power loss prediction in solar farms with SAS

Shahrzad Azizzadeh — Wed, 08 Oct 2025 13:00:03 +0000

Authors: Shahrzad Azizzadeh, Kaustubh Khandwe, Bahar Biller, and Paul Venditti

On large-scale solar farms, power loss is the silent drain on profits. Unoptimized panels chip away at efficiency, causing hidden losses that people often overlook—but those losses are never insignificant. In this post, we’ll uncover how to spot and solve these inefficiencies. Drawing from a real-world U.S. case study, you’ll see how SAS machine learning algorithms turn vague estimates into accurate forecasts of power loss in solar farms. The result? A clear, data-driven roadmap for smarter operational decisions, improved system efficiency, and ultimately, stronger profitability.

Background

In the rapidly growing field of renewable energy, solar farms play a vital role in supplying clean electricity to the grid. Yet, even with advances in technology, installations can underperform due to subtle and often invisible issues. This includes issues such as misaligned panels, weather-related impacts, or gradual wear and tear. These inefficiencies are not always apparent in daily operations. They can accumulate over time, resulting in significant revenue loss and reduced energy output.

To put this in perspective, panel angle deviations from optimal positioning can reduce power output by 2-8%. Panel degradation typically causes 0.5-0.8% power loss annually, accumulating to 15-20% over a panel's 25-year lifespan. Perhaps most significantly, high surface temperatures during peak operating hours can cause temporary power losses of 10-20% when panels reach 60-70°C. This represents millions of dollars in lost revenue for utility-scale installations. This is why the ability to anticipate and quantify those losses is so important, especially at scale.

Multiple factors contribute to performance degradation, including suboptimal panel tilt angles, adverse conditions, and equipment aging. By using historical data and advanced modeling techniques, we demonstrate how SAS machine learning procedures can be employed to construct a predictive framework that quantifies power loss in relation to ideal operating conditions. This model enables operators to isolate inefficiencies, forecast degradation trends, and proactively manage maintenance.

The resulting insights are essential not only for maximizing energy yield but also for supporting financial planning and grid reliability. We will outline the end-to-end analytical process, including data preprocessing, predictor variable selection, model development, and validation. We also include visualizations by using the SGPLOT and GCHART procedures that highlight key insights and inverter-level performance findings.

Use case

Figure 1 illustrates an asset hierarchy in a solar farm with three inverters. Each is connected to a set of combiner boxes, which in turn are connected to a collection of solar panels. Each sub-system, consisting of an inverter, a set of combiner boxes, and a collection of solar panels, includes a solar tracker system. This is responsible for adjusting panel tilts. All panels connected to a combiner box share the same design and orientation. They, therefore, share the same optimal angle for maximum sun exposure. Several sensors monitor various environmental and operational variables at five-minute intervals.

Guided by discussions with industry experts, we established a set of assumptions to support the prediction of power loss in the solar energy system. Losses are evaluated at both the inverter and combiner box levels. The example system in this post consists of 15 combiner boxes. they are organized into three groups that supply direct current (DC) power to three solar inverters. Each combiner box connects to a set of identical solar panels, ensuring uniform performance characteristics.

For modeling purposes, we assume that the sun delivers consistent solar irradiance—defined as power per unit area—across all panels connected to the same combiner box. Although this assumption simplifies the modeling process, it does not fully reflect real-world conditions. Factors such as shading, dust accumulation, or panel soiling can cause non-uniform irradiance, which can impact performance. These conditions can limit model accuracy, as a result. Future work could explore the incorporation of spatial variability into irradiance measurements.

Figure 1: An Illustration of an asset hierarchy in a solar farm

Input data

A snapshot of 10 sample rows from the analytics base table, used as the input data set for our project, is shown in Figure 2. In this table, the TimeStep column represents the timestamp at 5-minute intervals. amb_temp indicates the ambient air temperature at the site, measured in degrees Celsius at the time of the reading. wind refers to the wind speed at the solar farm, recorded in meters per second (m/s). irradiance refers to the solar irradiance on the panel surface, measured in watts per square meter (W/m²).

Panel_age denotes the age of the photovoltaic (PV) panel in years since installation. angle_deviation measures the difference between the panel’s actual tilt and its optimal orientation, in radians. temp_diff_with_STC represents the deviation of the panel surface temperature from the Standard Test Condition (25 °C), also in degrees Celsius. Finally, panel_type specifies the material used in the solar module. The objective is to understand how these factors affect the power loss experienced by each inverter shown in Figure 1.

Figure 2: Sample of the historical data set

Model

Based on data availability and the primary factors contributing to power loss, we focused our analysis on three key categories:

Performance degradation due to equipment aging
Sub-optimal panel tilt angles
Temperature-related losses caused by device overheating and thermal fluctuations.

The historical data set does not directly include power loss values. However, it contains several operational variables that influence and contribute to power loss. Table 1 summarizes the methods used to compute power loss for each of the three identified categories. In this content, P_current refers to the actual direct output power generated. P_loss represents the estimated power loss.

Category	Loss Calculation Method
Equipment Aging	Linear degradation model (Jordan, Dirk C., and Sarah R. Kurtz) based on panel age (in years), with degradation rates varying by panel material: P_loss = P_current x $\frac{degradation\_rate\; x\; average \;panel \;age} {1\; -\; degradation\_rate \;x \;average \;panel \;age}$
Panel Angle Deviation	The amount of loss (Barbón, J., Fernández-Ibáñez, E., and Martínez-Alonso, M) is affected by the cosine of the difference between the optimal and current angles: P_loss = $\frac{P_{current}} {cos(angle\_deviation)}$ - P_current
Temperature Effects	A coefficient is applied (Dash, P. K., and N. C. Gupta) to represent the fractional power loss per °C increase above the Standard Testing Condition (STC) temperature of 25°C: P_loss = P_current x $\frac{\gamma\Delta\tau} {1\;-\;\gamma\Delta\tau\:}$

Table 1: Power loss calculation method

The target variable for the predictive model is P_current, which represents the actual power output. Once this value is predicted, power loss (Ploss) is calculated for each observation using the methods outlined in Table 1. The model uses temperature, wind speed, and solar irradiance as predictor variables. The model excludes factors like panel age, angle deviation, and temperature deviation from standard test conditions as predictors and applies them after prediction to calculate power loss. Because the original data set was recorded at 5-minute intervals, it was aggregated to hourly intervals to improve the clarity of insights and visualizations related to power loss.

We split the data into training and testing sets using a 70:30 ratio. Power output predictions were generated by using the FOREST and GRADBOOST procedures, with the model achieving the lowest Average Squared Error (ASE) on the test set selected for final deployment. To enhance interpretability, in addition to the variable importance scores, Shapley values were calculated using the TreeSHAP option in the ASTORE procedure. We shared the results through both data sets and visualizations.

Results

Figure 3 illustrates the relative importance of each predictor variable for the target variable P_current.This represents the sum of the direct currents emanating from all the combiner boxes connected to each inverter. We quickly identify irradiance as the primary variable affecting power generation, followed by ambient temperature and wind speed.

Figure 3: Relative Importance of predictor variables for the target variable Pcurrent

On the other hand, Figure 4 illustrates the power loss by category for each inverter over a 10-day period. For Inverter 1, the percentage of power loss due to panel angle deviation is small, indicating the tracker system is working as expected. For Inverter 2, however, the tracker system appears to be experiencing issues during the first three days. Temperature-related power loss plays a major role for inverters 1 and 3, indicating the overheating of the panels, and the need to monitor the cooling systems.

Figure 4: Power loss over 10 days by category for each inverter

Finally, the pie charts of Figure 5 summarize the share of each power loss over the same 10-day period, for each inverter system.

Figure 5: Total power loss breakdown by category and inverter

Conclusion

This post presents a practical, end-to-end framework for forecasting power loss in utility-scale solar farms using SAS Viya. By integrating domain expertise, such as panel degradation behavior, tilt misalignment effects, and temperature sensitivity, with advanced machine learning methods, we achieved a comprehensive understanding of power loss dynamics. Our approach enabled us to quantify power loss drivers at both the inverter and combiner‐box levels, forecast actual power output, and translate those predictions into interpretable, category-specific loss estimates. By using Shapley-value analysis, we also identified the environmental and operational variables that have the most significant impact on performance deviations.

Insights from our case study revealed that temperature fluctuations were the leading cause of power loss in two of the three inverters. Solar tracking systems generally maintain optimal alignment. These findings empower operators to prioritize cooling system maintenance, optimize tracker calibration, and schedule targeted inspections. This ultimately enhanced energy yield and minimized downtime. The economic impact of these insights cannot be overstated: with individual loss factors capable of reducing power output by 2-20% depending on the issue, our predictive framework addresses inefficiencies that could otherwise cost utility-scale solar farms millions of dollars annually in lost energy production. By combining predictive accuracy with interpretability, this framework lays a strong foundation for proactive, data-driven solar asset management.

Paul Venditti

Paul Venditti is a seasoned industry consultant with over three decades of experience in industrial analytics and digital transformation. His career began in heavy equipment services and extended through a decade at GE Research, where he patented advanced analytics and digital twin solutions. At SAS, as an Advisory Industry Consultant, Paul helps organizations leverage AI, IoT, and machine learning to enhance manufacturing quality, minimize downtime, and drive operational resilience.

Kaustubh Khandwe

Kaustubh Khandwe is a Senior Data Scientist in the SAS Pune Applied AI and Modeling (AAIM) Division. He has developed solutions across various industries, including manufacturing, retail, and IOT. With AAIM, he has contributed to projects such as Preventive Maintenance for Solar and Wind Farms and Auto Damage Fraud Detection in the Insurance industry. Future goals include building industry-impactful models and driving their integration with cutting-edge technologies, such as Agentic AI, ultimately providing a value-driven software experience to customers.

The post Power loss prediction in solar farms with SAS appeared first on The SAS Data Science Blog.

Streamlining public health analytic software costs in a time of budgetary challenge

Meg Schaeffer — Mon, 29 Sep 2025 10:59:52 +0000

Over the course of the last several decades, public health has experienced dramatic fluctuations in funding. After the 9/11 terrorist attacks, new emergency preparedness departments were created within agencies to draft plans, develop training, and execute responses to disasters impacting human health. Funding dwindled in the years that followed, up until the emergence of SARS-CoV2. A rapid infusion into public health budgets enabled a mass expansion of staff and equipment, followed by efforts to fortify and modernize public health infrastructure. Many agencies successfully revised outdated software, data stores, and duplicative—and often slow—network connections. Agencies were also left to evaluate which procurements made during COVID were worth keeping.

The landscape has shifted again, and in response to shortened grant periods and reductions in federal funding, agencies are critically reassessing their analytic needs. SAS has supported several agencies in these evaluations and developed a set of recommendations to streamline and simplify enterprise analytic solutions.

Execute a comprehensive user survey

Targeted to all employees, the survey should assess which statistical, visual, and peripheral tools are being used, how frequently they are used, what data they access and its respective size, whether and how open-source code is utilized, and what challenges or barriers hinder efficient work.
Capture license costs for all software tools

For assessing SAS licensing, some licenses are provided by the CDC, others may be managed by a centralized IT organization, and still others may reside on SAS servers or as stand-alone desktop licenses. Your SAS Account Executive (accessible via the SAS Customer Service Portal) can help determine the types and counts of licenses assigned to your agency. The same approach may be needed to assess licenses for other analytic or visualization software such as SPSS, RStudio (Posit) individual, small business, or enterprise editions, Tableau, ArcGIS, Power BI, STATA, ActivePython, and others.
Identify cost and labor-intensive constraints on infrastructure

Are there redundant servers supporting specific analytic processes? Are there security concerns with users interacting with protected data using open-source tools? Are excessive processing times contributing to increased cloud computing costs?
Crosswalk software functionality and identify redundancy

The following questions may help further identify potential cost savings:
- Are employees clustering use of certain tools without collaborating or exploring the option of streamlining to fewer tools?
- Are users publishing dashboards and reports in multiple visualization tools (e.g., tableau Power BI ArcGIS)?
- Are there options to migrate burdensome processing jobs into a more efficient statistical software tool?
- Can software be migrated to a single hosted environment?
Explore potential changes by requesting trial environments

SAS Viya offers supported trial environments for agencies seeking to compare its cloud-based version with traditional desktop configurations. These trials allow users to load de-identified data, build dashboards, test SAS code, create workflows, and publish and share reports.

As part of the SAS trial experience, a content assessment is typically included, where SAS desktop code is evaluated to determine which procedures are in use and how they align with SAS Viya license levels.

Conclusion

SAS Viya is one of the few statistical platforms capable of replacing and expanding upon SAS 9.4 or Enterprise Guide functions, offering a drag-and-drop interface, integration with open-source tools, and powerful visualizations that match or exceed those of Tableau, Power BI, or R Shiny. Investing in SAS Viya can streamline and simplify statistical and visualization tools, improve software management, boost productivity, reduce cloud computing costs, and foster collaboration within an agency.

Learn more

The post Streamlining public health analytic software costs in a time of budgetary challenge appeared first on The SAS Data Science Blog.

Real-time computer vision for worker safety using SAS Event Stream Processing

Alexandru Bobe — Thu, 25 Sep 2025 13:08:05 +0000

In high-risk industries like construction and manufacturing, worker safety isn’t just a priority; it’s a constant challenge. Fast-moving environments, heavy machinery, and human unpredictability make it incredibly tough to monitor compliance and catch dangerous behavior before it leads to injury.

As data scientists, we wanted to tackle that challenge head-on. What if we could use computer vision to monitor safety in real time—right at the edge?

In this post, I’ll guide you through a project where we developed a modular, reusable safety monitoring component utilizing advanced AI techniques. By combining YOLO-based SAS Event Stream Processing (ESP) on NVIDIA’s Jetson Orin edge device, we created a system capable of real-time hazard detection that can literally help save lives.

Let’s dive into how this cutting-edge blend of AI and edge computing is reshaping workplace safety—one frame at a time.

Why real-time monitoring matters

Every year, countless workplace injuries and fatalities occur. Many of these could have been avoided with better safety oversight. According to the U.S. Bureau of Labor Statistics, from 2022 to 2023, there were 3.5 fatalities per 100,000 full-time workers. Although traditional methods such as manual inspections and CCTV cameras help, they often lack the ability to provide actionable, real-time insights.

For example, a worker forgetting to wear gloves while handling dangerous equipment might go unnoticed until an incident occurs. Similarly, a worker operating machinery without a helmet could be at risk of serious head injuries from falling objects. Someone entering a hazardous zone without a high-visibility vest might go unnoticed, increasing the likelihood of accidents involving moving vehicles. Unsafe postures, such as crouching under unstable machinery or standing too close to heavy loads being lifted, are also hard to monitor continuously. These gaps in oversight highlight the critical need for automated, real-time safety monitoring.

Real-time computer vision bridges this gap by delivering instant detection and alerts. By automating safety monitoring, organizations can mitigate risks, improve compliance, and ultimately protect their workforce.

The solution: AI meets safety

Our solution focuses on two main capabilities:
Personal Protective Equipment (PPE) Detection: Using You Only Look Once X (YOLOX), the system identifies whether workers are wearing required safety gear such as helmets and vests.
Pose Estimation: Leveraging YOLOv7 Pose, it analyzes worker movements and postures to flag potentially dangerous behaviours.

These models are deployed on SAS ESP for real-time inference, leveraging its native support for ONNXruntime. This enables us to run YOLO-based models in the Open Neural Network Exchange (ONNX) format, ensuring compatibility and efficient execution across various platforms.

Building the system

Model Training and Optimization
- YOLOX for PPE Detection: YOLOX is a state-of-the-art object detection framework. To tailor it for this project, we trained it on a custom data set of workers in various industrial settings. The data set included images with and without PPE under different lighting conditions, ensuring the model performs robustly in real-world scenarios.
- YOLOv7 Pose for Posture Monitoring: YOLOv7 Pose excels at key point detection and tracking, making it ideal for recognizing body parts, such as shoulders, elbows, hips, knees, and ankles. By applying heuristics to the model output, we can identify risks such as improper lifting techniques or workers engaging in hazardous actions.
Real-Time Deployment with SAS ESP

Deploying the models through SAS ESP permitted us to use both SAS and open-source models, which enabled us to achieve:
- Event-Driven Alerts: The system immediately notifies workers and supervisors when it detects non-compliance or unsafe behaviours.
- Scalability: SAS ESP’s architecture supports the simultaneous processing of multiple video feeds, making it suitable for large facilities.
- Low-Latency Inference: Processing occurs within milliseconds, ensuring alerts are timely and actionable.
Edge Deployment on Jetson Orin

To ensure real-time performance and data privacy, we chose NVIDIA’s Jetson Orin. This edge-first design keeps all video and PII on-device, which is critical for regulated industries that demand auditability and compliance. A standard webcam serves as the video input, making the solution cost-effective and easy to deploy across diverse environments.

Physical setup: Bringing real-time AI to life

The physical setup for this system was designed to be both practical and scalable for industrial environments, illustrated in Figure 1. At its core is the NVIDIA Jetson Orin, a compact yet powerful edge computing device capable of running advanced AI models with low latency.

Figure 1: Schema of the physical setup

A standard webcam serves as the video input, positioned strategically to capture activity within a designated monitoring zone, where a manufacturing belt is placed, as shown in Figure 2.

Figure 2: Worker safety demo area in the SAS warehouse

A video of the demo in action, showing the workspace and some employees wearing Personal Protective Equipment (PPE) and simulating some safe and unsafe leaning behaviors with real-time alerts, can also be viewed here.

Although this setup demonstrates the concept in client deployments, we utilize IP cameras to stream footage in real-time, ensuring higher resolution, flexibility in placement, and enhanced network connectivity for robust performance in industrial environments. The area is delimited by crowd management belts. The live stream, with real-time detections and alerts, is visible on the monitor. Moreover, alerts produce an acoustic hint.

How it works: A use case

Scenario: A manufacturing plant requires workers to wear helmets and vests while operating heavy machinery.

Monitoring PPE Compliance
The system continuously analyses video feeds. If it detects a worker without a vest, it sends an immediate alert to their supervisor and displays a warning on-site.
Tracking Unsafe Postures
Workers' movements are tracked throughout the camera’s field of view, and bending over to lift a heavy load incorrectly triggers the pose estimation model. An alert is sent, enabling the supervisor to intervene before an injury occurs.
Real-Time Alerts and Reports
All detections are logged, enabling managers to analyse trends and improve overall safety protocols.

Results and insights

The system was tested in simulated industrial settings, yielding impressive results:

PPE Detection Accuracy: Great performance even under challenging conditions like poor lighting or partial occlusions.
Latency: Sub-100ms inference times, ensuring near-instantaneous feedback.
Ease of Deployment: The solution was operational within hours, requiring only minimal hardware setup and configuration.

The feedback from test users highlighted the system’s ability to proactively prevent accidents, reducing both human and financial costs.

Benefits of real-time safety monitoring

This project demonstrates how AI can fundamentally enhance workplace safety. Key advantages include:

Proactive Risk Mitigation: By identifying risks in real-time, the system allows for immediate corrective actions.
Improved Compliance: Continuous monitoring ensures adherence to safety protocols without requiring constant human supervision.
Scalability and Cost Efficiency: The edge-based architecture reduces infrastructure costs, making it accessible for organizations of all sizes.

Challenges and Future Directions

No system is without its challenges. For this project, the primary hurdles included:

Data set Diversity: Capturing a sufficient number of varied examples of PPE and unsafe behaviors to ensure robust model performance was a significant challenge. To address this, we tackled data set diversity in-house by collecting a custom data set in a warehouse environment. This setup enabled us to simulate real-world conditions, including varied lighting, angles, and worker movements, ensuring the models were robust and adaptable to diverse scenarios.
Edge Hardware Optimization: Balancing model complexity with inference speed on Jetson Orin. We fine-tuned the YOLOX-s model to maintain high accuracy while ensuring that inference times remained within acceptable limits for real-time applications.

Looking ahead, we can further augment the deployed system to:

Integrate IoT Sensors: Combine video data with environmental sensors (such as noise levels and temperature) to create a comprehensive safety solution. This extends to PLC (Programmable Logic Controller) integration, enabling the implementation of kill switches for electronic machinery.
Enhance Predictive Capabilities: Utilize historical data to forecast and mitigate future risks.
Expand Deployment: Scale the system to handle multi-camera setups across larger facilities.
Expand Use-cases: Improve the system to handle other classes of protective equipment and detect other unsafe behaviours (for example, forklift violations)

Call to Action

If you’re interested in leveraging AI to improve workplace safety, explore the following resources:

Want to see the solution in action? Contact us to explore how real-time computer vision can enhance safety within your organization.

Final Thoughts

As data scientists, we have the unique privilege of transforming complex AI algorithms into practical solutions that make a tangible impact in the real world. This project showcases how cutting-edge computer vision technologies, combined with real-time processing, can proactively save lives and enhance workplace safety.

This is more than a proof of concept; it’s a foundation for scaling AI-driven safety monitoring across entire industries, standardizing compliance, and preventing injuries on a massive scale.

The post Real-time computer vision for worker safety using SAS Event Stream Processing appeared first on The SAS Data Science Blog.

Simulated Annealing (SA) Metaheuristic in SAS Optimization

Subbu Pazhani — Fri, 05 Sep 2025 12:45:22 +0000

Authors: Subbu Pazhani and Rob Pratt

Large-scale real-world optimization problems with advanced business rules are often difficult to solve with standalone traditional optimization algorithms. Metaheuristic algorithms often complement these traditional optimization techniques. These are a class of powerful and flexible algorithms designed to address complex optimization problems, which traditional methods often struggle to solve optimally. They act as a framework for guiding the search process by using problem-specific search methods. These algorithms provide a flexible way to search large solution spaces and quickly identify feasible options. They also refine the most promising ones to find good initial solutions within a reasonable computational time. Traditional optimization techniques further improve and optimize this solution. In this post, we demonstrate a Simulated Annealing (SA) metaheuristic algorithm to solve the Traveling Salesman Problem (TSP) in SAS Optimization.

Researchers have applied the SA metaheuristic algorithm, a popular search method, to solve various combinatorial optimization problems. People use it especially when working with nonlinear functions and large search spaces. The SA algorithm draws inspiration from the physical annealing processes of heating a material to a high temperature. This enables its atoms to rearrange and then slowly cool, achieving a more stable and lower-energy state. SA mimics this concept to optimization problems where a probabilistic approach is used to explore solutions at higher temperatures. The search gradually cools toward a stable and better solution. A key advantage of SA is its ability to escape local optima by accepting an inferior solution by using a probabilistic rule. This enables it to explore the large search spaces to find better solutions.

Pseudocode of the algorithm

This section details the pseudocode of the algorithm along with the details on the variables used in the algorithm, initialization parameters, initialization step, and the steps in the iterations.

best_obj: best upper bound
best_iter: iteration identifier corresponding to the best upper bound
sa_best_obj: best accepted SA solution
sa_best_iter: iteration identifier corresponding to the best accepted SA solution
obj_step: objective of that SA iteration

Initialization parameters

temp: Temperature
MAX_STEPS: maximum iterations
fraction_sa: cooling scheme
max_cons_obj: maximum allowable consecutive solutions where the best_obj of this iteration is the same as the best_obj of the previous iteration

Initialization step

Generate an initial solution string randomly or by using any heuristic and compute its obj_step.
Set best_obj, best_iter, sa_best_obj, sa_best_iter based on the initial solution.

Iterations:

done = 0
do until (done = 1)
   /* Generate a new neighbor solution from the string corresponding to sa_best_iter */
   if obj_step < best_obj
      Update best_obj, best_iter, sa_best_obj, sa_best_iter
   else
   /* Compute acceptance probability to accept or reject inferior solution */
   if acceptance probability < rand(0,1)
      /* Accept the inferior neighbor solution */
   /* Compute number of consecutive best_obj solutions (sa_cons_obj_count) */
   if sa_cons_obj_count > max_cons_obj
      sa_best_iter = best_iter
      sa_best_obj = best_obj
   else
      sa_cons_obj_count = 0
 
   /* If stopping criteria met (number of iterations) */
      done = 1
end;

Input data

To illustrate the SA algorithm and its usefulness, we consider an example from the Traveling Salesman Tour of US Capital Cities. This scenario involves traveling the fewest miles to visit all the capital cities in the US states (and the District of Columbia), excluding Alaska and Hawaii.

PROC OPTMODEL code

The model is coded and solved by using the OPTMODEL procedure in SAS Optimization. We begin by defining the index sets and parameters for the TSP problem and by using a READ DATA statement to read data into the index sets and parameters.

proc optmodel;
   /* Declare parameters and read data */
   set  NODEPAIRS;
   set NODES = union { in NODEPAIRS} {i,j};
   num distance{NODEPAIRS};
   num node_id{NODES};
   num id init 1;
   read data CitiesDist (where=(upcase(city1) ne upcase(city2))) 
      into NODEPAIRS=[city1 city2] distance; 
   set  NUM_NODES = 1..CARD(NODES);
   for {n in NODES} do;
      node_id[n] = id;
      id = id + 1;
   end;

The next line calls a macro named sa_tsp_dec, which defines index sets and variables needed for the SA algorithm:

%sa_tsp_dec();

%macro sa_tsp_dec(
);
 
   /* Simulated Annealing - index sets and variables */
   set MAX_STEPS = 1..20000; 
   set NODES_TMP;
   set UPDATE_SET;
   str sa_tsp_tour{MAX_STEPS,NUM_NODES};
   num infinity = constant('BIG');
   num fraction_sa init 0;
   num temp init 1;
   num obj_step{MAX_STEPS};
   num best_obj_step{MAX_STEPS};
   num sa_best_obj_step{MAX_STEPS};
   num best_obj init infinity;
   num best_iter init 1;
   num sa_best_obj init infinity;
   num sa_best_iter init 1;
   num sa_starttime;
   num sa_endtime;
   num sa_runtime init 0;
   num sa_cons_obj_count init 0;
   num max_cons_obj = 10;
   num node1; 
   num node2; 
   str pair1; 
   str pair2; 
   num min_dist;
   num this_node_dist;
   num next_i{i in NUM_NODES} = if i < card(NUM_NODES) then i+1 else 1;
   str next_node;
   num max_node;
   num size_update_set;
   num dec_roundoff = 0.1;
   num inv_mutation_threhold = 0.1;
   num min_temp = 0.01;
   call streaminit(12345678);
 
%mend sa_tsp_dec;

The following set of code executes the first iteration of the SA algorithm. The initial solution string is generated by using the nearest neighborhood method (defined in the sa_nneighbor macro). Then the objective value of this solution string is evaluated by using the sa_compute_obj macro. The system updates this as the best objective and best iteration variables at both the global and SA algorithm levels. Iteration 2 receives this solution string as its best solution.

sa_starttime = time();
   /* Initializing Iteration 1 */
   for {st_sa in {1}} do; 
      /* Generating an initial solution using nearest neighborhood method */
      %sa_nneighbor(); 
 
      /* Evaluating objective function value */
      %sa_compute_obj();
 
      /* Updating best objective and best iteration variables 
         - both at the global level and the SA algorithm level */
      best_obj = obj_step[st_sa];
      best_iter = st_sa;
      sa_best_obj = obj_step[st_sa];
      sa_best_iter = st_sa;
 
      /* Reporting variables */
      best_obj_step[st_sa] = best_obj;
      sa_best_obj_step[st_sa] = sa_best_obj;
 
   end;

%macro sa_nneighbor(
);
 
   NODES_TMP = NODES;
   for {n in NODES: node_id[n] = 1} do;
      sa_tsp_tour[st_sa,node_id[n]] = n;
      NODES_TMP = NODES diff {n};
      leave;
   end;
   id = 1;
   do while (CARD(NODES_TMP)>=1);
      min_dist = infinity;
      for {n1 in NODES_TMP} do;
         this_node_dist = distance[sa_tsp_tour[st_sa,id],n1];
         if min_dist >= this_node_dist then do;
            min_dist = this_node_dist;
            next_node = n1;
         end;
      end;
      id = id + 1;
      sa_tsp_tour[st_sa,id] = next_node;
      NODES_TMP = NODES_TMP diff {next_node};
   end;
 
%mend sa_nneighbor;

%macro sa_compute_obj(
);
 
   obj_step[st_sa] = sum{i in NUM_NODES} distance[sa_tsp_tour[st_sa,i],sa_tsp_tour[st_sa,next_i[i]]];
 
%mend sa_compute_obj;

The next set of statements is a loop for running the SA algorithm iterations. In each iteration, we reduce the annealing (cooling) temperature, generate a neighborhood solution by perturbing the best solution string at the SA algorithm level, evaluate its objective function value, update the best objective and best iteration variables at the global level, and update the best objective and best iteration variables at the SA algorithm level based on the probabilistic acceptance criteria. The algorithm then computes the number of best consecutive solutions, compares it with the maximum allowable consecutive solutions, and resets the best solution string at the SA algorithm level or updates the variable for the number of best consecutive solutions.

for {st_sa in MAX_STEPS: st_sa > 1} do;
 
      /* Computing the cooling temperature step */
      fraction_sa = st_sa / CARD(MAX_STEPS); 
 
      /* Computing temperature */
      temp = max(min_temp,1 - fraction_sa);
 
      /* Generating a neighborhood solution */
      if temp >= inv_mutation_threhold then do; 
         %sa_inverse();
      end;
      else do; 
         %sa_swap(); 
      end;
 
      /* Evaluating objective function value */
      %sa_compute_obj();
 
      /* Updating best objective and best iteration variables 
         - both at the global level and the SA algorithm level */
      if (obj_step[st_sa] < best_obj) then do; best_obj = obj_step[st_sa]; best_iter = st_sa; sa_best_obj = obj_step[st_sa]; sa_best_iter = st_sa; end; else if (obj_step[st_sa] > best_obj and 
         exp(-((obj_step[st_sa]-best_obj) / obj_step[st_sa]) / temp ) > rand("Uniform")) then do; 
         sa_best_obj = obj_step[st_sa];
         sa_best_iter = st_sa;
      end;
      else do;
         best_obj = best_obj;
         best_iter = best_iter;
         sa_best_obj = sa_best_obj;
         sa_best_iter = sa_best_iter;
      end; 
 
      /* Computing number of consecutive same best solution */
      if (round(obj_step[st_sa-1],dec_roundoff) 
        = round(obj_step[st_sa],dec_roundoff) or obj_step[st_sa] > best_obj) then do;
         sa_cons_obj_count = sa_cons_obj_count + 1;
      end;
      else sa_cons_obj_count = 0; 
      if sa_cons_obj_count > max_cons_obj then do;
         sa_best_iter = best_iter;
         sa_best_obj = best_obj;
      end;
 
      /* Reporting variables */
      best_obj_step[st_sa] = best_obj;
      sa_best_obj_step[st_sa] = sa_best_obj;
 
   end;

The following two macros (sa_inverse and sa_swap) illustrate the inverse mutation scheme and the swap mutation scheme for generating a neighborhood solution.

%macro sa_inverse(
);
   /* Entering inverse mutation */
   do until (node1 ne node2);
      node1 = rand("integer",1,CARD(NUM_NODES)); 
      node2 = rand("integer",1,CARD(NUM_NODES)); 
   end;
 
   /* Initializing the string to SA best iteration */
   for {i in NUM_NODES} sa_tsp_tour[st_sa,i] = sa_tsp_tour[sa_best_iter,i];
 
   /* Performing inverse mutation */
   max_node = max(node1,node2); 
   UPDATE_SET = if node1 < node2 then node1..node2 else node2..node1;
   size_update_set = CARD(UPDATE_SET);
   id = 1; 
   do while (id <= size_update_set);
      for {i in UPDATE_SET} do;
         sa_tsp_tour[st_sa,i] = sa_tsp_tour[sa_best_iter,max_node-id+1];
         id = id + 1;
      end;
   end;
 
%mend sa_inverse;

%macro sa_swap(
);
 
   /* Entering pairwise mutation */
   do until (node1 ne node2);
      node1 = rand("integer",1,CARD(NUM_NODES)); 
      node2 = rand("integer",1,CARD(NUM_NODES)); 
   end;
 
   pair1 = sa_tsp_tour[sa_best_iter,node2]; 
   pair2 = sa_tsp_tour[sa_best_iter,node1]; 
 
   for {i in NUM_NODES} do;
      if i = node1 then sa_tsp_tour[st_sa,i] = pair1;
      else if i = node2 then sa_tsp_tour[st_sa,i] = pair2;
      else sa_tsp_tour[st_sa,i] = sa_tsp_tour[sa_best_iter,i];
   end;
 
%mend sa_swap;

The following statement is used to extract the best solution from the SA algorithm and output the resulting solution:

 /* Run time of the SA algorithm */
   sa_endtime = time();
   sa_runtime = intck('second',sa_starttime,sa_endtime);
   put sa_runtime=;
 
   /* Creating output table */
   set  TSPEDGES;
   TSPEDGES = union{i in NUM_NODES} {[sa_best_iter,i],sa_tsp_tour[sa_best_iter,next_i[i]]>};
   create data TSPTourLinks from [city1 city2] = TSPEDGES distance;
   create data TSPSAIterations from [Iteration_no] = MAX_STEPS best_obj_step sa_best_obj_step;
 
quit;

We execute the SA algorithm, coded within PROC OPTMODEL, by using the input data. The optimal objective value of the problem is 10,635.09 and can be found in the Traveling Salesman Tour of US Capital Cities. The SA algorithm started with a solution that had an objective function value of 13,402.74. It terminated with a solution with an objective function value of 10,918.47 (a gap of 2.66% from the optimal objective value). The algorithm took 4 seconds to run the set of 20000 iterations. Table 1 shows a few iterations with starting solutions. While Table 2 shows a few iterations with the final solution from the Tspsaiterations table.

Table 1: Starting solution

Table 2:. Final solution

The choice of neighborhood mutation schemes directly influences the quality of solutions the SA algorithm generates. These include inverse and swap mutations, as well as the selection of runtime parameters, including the number of iterations, cooling scheme, and mutation threshold. Fine-tuning these aspects can lead to a more robust search process, improving both solution diversity and overall effectiveness in tackling complex optimization problems. However, fine-tuning these aspects is problem-specific and requires conducting experiments to determine appropriate settings.

Plots of solutions

The statements to produce a graphical display of the solution can be referred to from the example in the Traveling Salesman Tour of US Capital Cities. Figures 1 and 2 illustrate the optimal solution and the best tour of the capital cities generated by the SA algorithm. As we observe, there are route differences between the optimal solution and the best solution from the SA algorithm.

Figure 1: Optimal solution

Figure 2: Best solution from SA

The following two statements plot the iterations. Figure 3 illustrates the progress of the best objective value and the corresponding best objective value for the solution accepted by the SA algorithm (see Figure 4). The iterations versus best objective value corresponding to the solution accepted at the SA algorithm level (refer to Figure 4) show the probabilistic behavior of the algorithm in accepting an inferior solution, showing occasional increases in objective value when inferior solutions are accepted, before resuming improvement. The overall downward trend captures the algorithm’s effectiveness in navigating the solution landscape towards near-optimality.

/* Progress of best objective */
title 'Iterations vs best objective value';
%let optimal = 10635;
proc sgplot data = TSPSAIterations noautolegend;
   step x = Iteration_no y = best_obj_step / markers;
   refline &optimal.;
   xaxis label = 'Iteration';
   yaxis label = 'Objective Value' min = 10500;
run;
 
/* Progress of sa best objective */
title 'Iterations vs best objective value of the solution accepted by SA';
proc sgplot data = TSPSAIterations noautolegend;
   step x = Iteration_no y = sa_best_obj_step / markers;
   refline &optimal.;
   xaxis label = 'Iteration';
   yaxis label = 'Objective Value' min = 10500;
run;

Figure 3: Iterations versus best objective value

Figure 4: Iterations versus best objective value of the solution accepted by SA

Conclusion

This example illustrates solving a TSP by using an SA algorithm coded and then executed in PROC OPTMODEL. Note that the SA algorithm provides only an approximate solution to the global optimum. However, it could be useful for obtaining a good, feasible solution for a wide variety of complex, real-world problems. Other applications of SA include vehicle routing problems and extensions, supply chain network design, and scheduling problems.

Another extension uses this SA framework in conjunction with state-of-the-art solvers in SAS Optimization. For example, SA fixes some of the complicated variables. Then it solves the optimization model to determine the remaining unfixed variables.

SAS Optimization has specialized solvers for various graph theory, combinatorial optimization, and network analysis algorithms, including TSP.

The post Simulated Annealing (SA) Metaheuristic in SAS Optimization appeared first on The SAS Data Science Blog.

The SAS Data Science Blog

There is yet another AI productivity gap

Lessons from Russian Novelists

Scaling to Enterprise Problems

Conclusion

The rise of small language models for information extraction

Part 2 in the multimodal transformers: AI foundation models series

Traditional NLP versus LLMs: Two ends of the spectrum

GLiNER: A practical middle ground for modern NER

How GLiNER works: A high-level view

Where the industry is heading: LLM-Orchestrated SLMs and the rise of agentic AI

Where the SAS Applied AI and Modeling Division is heading next

No one wants your AI Slop

Best practices for using Generative AI thoughtfully as an individual

Stop the Slop

SAS Viya: Powering smarter decisions at lower cost and in shorter time

Repeated measures… one more time

Hidden time

Time to relax - With a partition!

Enjoy your time off

Revolutionizing industrial safety: How digital twins and AI are transforming PPE detection

Background

Real-world data challenges

Synthetic data generation

PPE detection model

SAS ESP for deployment

Summary

Model Risk Management at your fingertips: Just ask!

What if we could simply ask for the answers to our MRM reporting?

Why Generative AI is a game-changer for fraud detection

Business use case

Instant Data & Model Monitoring

Data Drift Detection

Conversational Access to Insights to Update Business Rules

Scenario Simulation & Stress Testing

Audit-Ready Documentation

The Payoff: Generative AI’s impact on fraud detection

Power loss prediction in solar farms with SAS

Background

Use case

Input data

Model

Results

Conclusion

Streamlining public health analytic software costs in a time of budgetary challenge

Execute a comprehensive user survey

Capture license costs for all software tools

Identify cost and labor-intensive constraints on infrastructure

Crosswalk software functionality and identify redundancy

Explore potential changes by requesting trial environments

Conclusion

Learn more

Real-time computer vision for worker safety using SAS Event Stream Processing

Why real-time monitoring matters

The solution: AI meets safety

Building the system

Model Training and Optimization

Real-Time Deployment with SAS ESP

Edge Deployment on Jetson Orin

Physical setup: Bringing real-time AI to life

How it works: A use case

Results and insights

Benefits of real-time safety monitoring

Challenges and Future Directions

Call to Action

Final Thoughts

Simulated Annealing (SA) Metaheuristic in SAS Optimization

Pseudocode of the algorithm

Initialization parameters

Initialization step

Iterations:

Input data

PROC OPTMODEL code

Plots of solutions

Conclusion