<?xml version="1.0" encoding="UTF-8"?><rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Ai Junzi</title>
	<atom:link href="https://blog.forth-media.com/feed/" rel="self" type="application/rss+xml" />
	<link>https://blog.forth-media.com</link>
	<description>blog.forthmedia</description>
	<lastBuildDate>Mon, 13 Oct 2025 19:08:42 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>
	hourly	</sy:updatePeriod>
	<sy:updateFrequency>
	1	</sy:updateFrequency>
	

<image>
	<url>https://blog.forth-media.com/wp-content/uploads/2022/12/cropped-aijunzi_icon-32x32.png</url>
	<title>Ai Junzi</title>
	<link>https://blog.forth-media.com</link>
	<width>32</width>
	<height>32</height>
</image> 
	<item>
		<title>Navigating AI Systems Integration</title>
		<link>https://blog.forth-media.com/2025/01/12/navigating-ai-systems-integration/</link>
					<comments>https://blog.forth-media.com/2025/01/12/navigating-ai-systems-integration/#respond</comments>
		
		<dc:creator><![CDATA[aijunzi]]></dc:creator>
		<pubDate>Sun, 12 Jan 2025 20:00:00 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[Cloud]]></category>
		<category><![CDATA[Data]]></category>
		<category><![CDATA[Foundation Models]]></category>
		<category><![CDATA[GPU]]></category>
		<category><![CDATA[Hardware]]></category>
		<category><![CDATA[aiTechStack]]></category>
		<guid isPermaLink="false">https://blog.forth-media.com/?p=783</guid>

					<description><![CDATA[An introduction to the AI Technology Stack Artificial Intelligence (AI) is integrated into our core business processes and daily workflows. You&#8217;ve heard the buzz, but how does it work? It&#8217;s actually a combination of different technologies. That&#8217;s what we&#8217;ll break down in this post. I have divided AI into functional layers and created a tower ... <a title="Navigating AI Systems Integration" class="read-more" href="https://blog.forth-media.com/2025/01/12/navigating-ai-systems-integration/" aria-label="Read more about Navigating AI Systems Integration">Read more</a>]]></description>
										<content:encoded><![CDATA[		<div class="wpulike wpulike-default " ><div class="wp_ulike_general_class wp_ulike_is_not_liked"><button type="button"
					aria-label="Like Button"
					data-ulike-id="783"
					data-ulike-nonce="61b0a59406"
					data-ulike-type="post"
					data-ulike-template="wpulike-default"
					data-ulike-display-likers=""
					data-ulike-likers-style="popover"
					class="wp_ulike_btn wp_ulike_put_image wp_post_btn_783"></button><span class="count-box wp_ulike_counter_up" data-ulike-counter-value="0"></span>			</div></div>
	
<h5 class="wp-block-heading">An introduction to the AI Technology Stack</h5>



<figure class="wp-block-image size-full"><a href="https://blog.forth-media.com/wp-content/uploads/2025/01/AI-Tech-Stack-800.png"><img fetchpriority="high" decoding="async" width="800" height="400" src="https://blog.forth-media.com/wp-content/uploads/2025/01/AI-Tech-Stack-Wide-400.png" alt="AI technology stack" class="wp-image-782" srcset="https://blog.forth-media.com/wp-content/uploads/2025/01/AI-Tech-Stack-Wide-400.png 800w, https://blog.forth-media.com/wp-content/uploads/2025/01/AI-Tech-Stack-Wide-400-300x150.png 300w, https://blog.forth-media.com/wp-content/uploads/2025/01/AI-Tech-Stack-Wide-400-768x384.png 768w" sizes="(max-width: 800px) 100vw, 800px" /></a><figcaption class="wp-element-caption">AI technology stack (click to zoom)</figcaption></figure>



<p class="has-medium-font-size wp-block-paragraph">Artificial Intelligence (AI) is integrated into our core business processes and daily workflows. You&#8217;ve heard the buzz, but how does it work? It&#8217;s actually a combination of different technologies. That&#8217;s what we&#8217;ll break down in this post.</p>



<p class="wp-block-paragraph">I have divided AI into functional layers and created a tower diagram to guide our discussion.</p>



<span id="more-783"></span>



<h4 class="wp-block-heading">The Foundation: Hardware and Chips</h4>



<p class="wp-block-paragraph">Computer hardware and chips are the foundation layer for AI. They are the components that perform the programmed calculations. We’re talking about powerful CPU-based servers and workstations integrated with specialized processors like GPUs.</p>



<p class="wp-block-paragraph">AI is not only about algorithms and models, but also about the underlying physical infrastructure that makes higher-level abstractions possible.</p>



<h4 class="wp-block-heading">The Fundamentals: Data</h4>



<p class="wp-block-paragraph">AI systems fundamentally require data for input. Imagine data as patterns that energize fluctuations in a complex network. Each data point—whether a customer record, a sales figure, a product image, or text—contributes to the overall flow and output of the system.</p>



<p class="wp-block-paragraph">This diverse data landscape is used by AI to evaluate and generate results. The quality and quantity of data directly impact AI performance.</p>



<h4 class="wp-block-heading">The Network Infrastructure: Cloud</h4>



<p class="wp-block-paragraph">How is AI data exchanged and saved? The cloud, or network, layer is the infrastructure that interacts with and backs up the devices you use. AI processes run on a shared cloud of computers that automatically scale up or down, and restart, as needed.</p>



<p class="wp-block-paragraph">Commercial cloud is not the only option. You can also maintain a private network and keep everything on-premises, meaning that all the hardware and software is located at your place of business.</p>



<h4 class="wp-block-heading">The Tools: Frameworks and Toolkits</h4>



<p class="wp-block-paragraph">AI developers don’t build from scratch; they use frameworks and toolkits. These are software resources that serve as building blocks for creating AI systems. They allow engineers to design the flow of data processing.</p>



<p class="wp-block-paragraph">This level provides access to foundation models, and supports the creation of powerful AI applications.</p>



<h4 class="wp-block-heading">Pre-Built Components: Foundation Models</h4>



<p class="wp-block-paragraph">Only a small group of large high-tech companies create what are known as foundation models. These are pre-trained AI models that have already learned from massive amounts of data, providing the base for a wide range of applications. They avoid the need to train models from scratch and accelerate innovation.</p>



<p class="wp-block-paragraph">This level is a powerful shortcut, that can be adapted for various tasks, such as generating text or images, and recognizing patterns that are otherwise undetectable.</p>



<h4 class="wp-block-heading">The Final Product: AI Applications</h4>



<p class="wp-block-paragraph">At the top of the AI tech stack are user-facing AI applications. These are designed to be intuitive, easy to use, and keep humans in the loop (HITL). Examples include a chatbot that answers your questions, a recommendation system that suggests your next query, or an image app that creates the pictures you describe.</p>



<p class="wp-block-paragraph">The application layer we interact with every day is where AI becomes genuinely useful. It is the culmination of all the layers below, working together to put the power of AI beneath our fingertips.</p>



<h4 class="wp-block-heading">Putting it all together </h4>



<p class="wp-block-paragraph">With the layers of the AI Tech Stack explored, we can see how it all comes together. For me, working with this framework helps make sense of the fast-paced and sometimes bewildering changes in the tech world. I hope this framework helps you frame your own thoughts about the evolution of artificial intelligence.</p>
]]></content:encoded>
					
					<wfw:commentRss>https://blog.forth-media.com/2025/01/12/navigating-ai-systems-integration/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>AI: Asking the Right Questions 2</title>
		<link>https://blog.forth-media.com/2024/11/08/ai-asking-the-right-questions-2/</link>
					<comments>https://blog.forth-media.com/2024/11/08/ai-asking-the-right-questions-2/#respond</comments>
		
		<dc:creator><![CDATA[aijunzi]]></dc:creator>
		<pubDate>Fri, 08 Nov 2024 19:31:08 +0000</pubDate>
				<category><![CDATA[Anthropic]]></category>
		<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[ChatGPT]]></category>
		<category><![CDATA[Claude]]></category>
		<category><![CDATA[Gemini]]></category>
		<category><![CDATA[Generative AI]]></category>
		<category><![CDATA[Google]]></category>
		<category><![CDATA[Grok]]></category>
		<category><![CDATA[LLM]]></category>
		<category><![CDATA[Meta]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[OpenAI]]></category>
		<category><![CDATA[Phi]]></category>
		<category><![CDATA[X]]></category>
		<category><![CDATA[CoT]]></category>
		<category><![CDATA[genAI]]></category>
		<guid isPermaLink="false">https://blog.forth-media.com/?p=719</guid>

					<description><![CDATA[Chain of thought geometric reasoning The technique of Chain-of-Thought (CoT) prompting evaluates the ability of an AI model to break complex problems into a series of logical steps. To validate the integrity and reproducibility of the expected results, two volunteers successfully completed the exercise embedded in this paper, prior to querying several large language models ... <a title="AI: Asking the Right Questions 2" class="read-more" href="https://blog.forth-media.com/2024/11/08/ai-asking-the-right-questions-2/" aria-label="Read more about AI: Asking the Right Questions 2">Read more</a>]]></description>
										<content:encoded><![CDATA[		<div class="wpulike wpulike-default " ><div class="wp_ulike_general_class wp_ulike_is_not_liked"><button type="button"
					aria-label="Like Button"
					data-ulike-id="719"
					data-ulike-nonce="3b39642dea"
					data-ulike-type="post"
					data-ulike-template="wpulike-default"
					data-ulike-display-likers=""
					data-ulike-likers-style="popover"
					class="wp_ulike_btn wp_ulike_put_image wp_post_btn_719"></button><span class="count-box wp_ulike_counter_up" data-ulike-counter-value="0"></span>			</div></div>
	
<h5 class="wp-block-heading">Chain of thought geometric reasoning</h5>



<figure class="wp-block-image size-full"><a href="https://blog.forth-media.com/wp-content/uploads/2024/11/straightedge-compas-crop.png"><img decoding="async" width="800" height="400" src="https://blog.forth-media.com/wp-content/uploads/2024/11/straightedge-compas-400.png" alt="" class="wp-image-720" srcset="https://blog.forth-media.com/wp-content/uploads/2024/11/straightedge-compas-400.png 800w, https://blog.forth-media.com/wp-content/uploads/2024/11/straightedge-compas-400-300x150.png 300w, https://blog.forth-media.com/wp-content/uploads/2024/11/straightedge-compas-400-768x384.png 768w" sizes="(max-width: 800px) 100vw, 800px" /></a><figcaption class="wp-element-caption">Geometric exercise (click to zoom)</figcaption></figure>



<p class="has-medium-font-size wp-block-paragraph">The technique of Chain-of-Thought (CoT) prompting evaluates the ability of an AI model to break complex problems into a series of logical steps.  To validate the integrity and reproducibility of the expected results, two volunteers successfully completed the exercise embedded in this paper, prior to querying several large language models (LLMs).</p>



<span id="more-719"></span>



<p class="wp-block-paragraph">The LLMs were prompted to evaluate a straightedge and compass construction illustrating Definitions 3 &amp; 4 from Euclid&#8217;s &#8220;Elements&#8221;, Book III.  Two open-source models ran in a laboratory environment on a single GPU isolated from the Internet.  The remaining models were tested online.</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="has-accent-color has-text-color has-link-color has-medium-font-size wp-elements-d14dec835f65db97146c00c3ae22a7f8 wp-block-paragraph">No, none of the straight lines 1, 2, 3, or 4 are at a greater distance from the center of the circle. Due to the symmetric nature of the construction.</p>



<p class="has-small-font-size wp-block-paragraph">— Claude 3.5 (The construction is actually rectangular)</p>
</blockquote>



<p class="wp-block-paragraph">Many of the LLMs correctly parsed the logic but were unable to reach a conclusion absent exact measurement.  Note that Euclid&#8217;s approach in &#8220;Elements&#8221; was to emphasize general principles and logical reasoning rather than numerical calculation.</p>



<p class="wp-block-paragraph">Some of the LLMs gave contradictory answers within the same response.  One model produced an SVG diagram of a squared-off construction, whereas the instructions specifically stated rectangular.</p>



<p class="wp-block-paragraph">The steps are easy to complete.  The answer to both questions is yes.  None of the LLMs successfully answered the prompt.</p>



<h2 class="wp-block-heading">Understanding Euclidean Geometric Construction</h2>


<div class="wp-block-image">
<figure class="aligncenter size-full"><img decoding="async" width="600" height="480" src="https://blog.forth-media.com/wp-content/uploads/2024/11/straightedge-compass-600.png" alt="Figure 1" class="wp-image-735" srcset="https://blog.forth-media.com/wp-content/uploads/2024/11/straightedge-compass-600.png 600w, https://blog.forth-media.com/wp-content/uploads/2024/11/straightedge-compass-600-300x240.png 300w" sizes="(max-width: 600px) 100vw, 600px" /><figcaption class="wp-element-caption">Figure 1</figcaption></figure>
</div>


<p class="has-medium-font-size wp-block-paragraph"><em>Euclid&nbsp;is one of the most influential figures in the history of mathematics.  He is known for a comprehensive description of geometry, &#8220;Elements&#8221;.  Its proofs are known for their logical rigor and their systematic approach.  In this essay I illustrate simple straightedge and compass construction.</em></p>



<h4 class="wp-block-heading">Clarifying definitions by illustration</h4>



<p class="wp-block-paragraph">The essay in Part 1 of this series required both mathematical understanding and a high level of reading comprehension.  This helped us to evaluate Artificial Intelligence (AI) Large Language Models (LLMs) on both of those challenges.</p>



<p class="wp-block-paragraph">In this part we want to make geometric concepts as easy to understand as possible.</p>



<p class="wp-block-paragraph">&#8220;Elements&#8221; Book III begins with a number of Definitions.  However, reading through them we find there&#8217;s a lot to unpack here.  Let&#8217;s try to make these easier to understand with a drawing <strong>(Figure1)</strong>.</p>



<h4 class="wp-block-heading">The definitions</h4>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="has-accent-color has-text-color has-link-color has-medium-font-size wp-elements-5530853a1ef69343dd320cf40a1dabe3 wp-block-paragraph">4.  In a circle straight lines are said to be equally distant from the center when the perpendiculars drawn to them from the center are equal.</p>



<p class="has-accent-color has-text-color has-link-color has-medium-font-size wp-elements-ff1a46d867435a6611aef58457a2d732 wp-block-paragraph">5.  And that straight line is said to be at a greater distance on which the greater perpendicular falls.</p>



<p class="has-small-font-size wp-block-paragraph">— Elements, Book 3 Definitions</p>
</blockquote>



<h4 class="wp-block-heading">Geometry exercise</h4>



<p class="wp-block-paragraph">Tools:</p>



<ul class="wp-block-list">
<li>Blank sheet of paper</li>



<li>Straightedge</li>



<li>Compass</li>



<li>Red marker</li>



<li>Orange marker</li>
</ul>



<h4 class="wp-block-heading">Using a straightedge and compass complete the following steps</h4>



<ol class="wp-block-list">
<li>On a blank sheet of rectangular paper draw a line from the top-left corner to the bottom right corner of the page.</li>



<li>Draw a line from the bottom left corner to the top right corner. The lines create an intersection at the center of the page.</li>



<li>Set the compass to a width that allows a complete circle small enough to fit on the page. From the center of the page draw a circle.</li>



<li>Label the point at the top-left where the line crosses the circle, point A.<br>Label the point at the top-right where the line crosses the circle, point B.<br>Label the point at the bottom-right where the line crosses the circle, point C.<br>Label the point at the bottom-left where the line crosses the circle, point D.</li>



<li>Without changing the compass, use it to mark two points on the circle, starting from point B.<br>Label the new point between A and B as point E.<br>Label the new point between B and C as point F.</li>



<li>Without changing the compass, use it to mark two points on the circle starting from point D.<br>Label the new point between C and D as point G.<br>Label the new point between D and A as point H.</li>



<li>Draw a straight line from point E to point F and label it line 1.<br>Draw a straight line from point G to point H and label it line 2.<br>Draw a straight line from point B to point G and label it line 3.<br>Draw a straight line from point E to point H and label it line 4.</li>



<li>Using the compass, bisect lines 1, 2, 3 and 4 to find the center of each line.</li>



<li>From the center point of lines 1, 2, 3, and 4 draw perpendicular lines to the center point of the circle.</li>



<li>For this exercise make the perpendiculars from lines 1, 2, and 3 red.<br>Make the perpendicular from line 4 orange.</li>
</ol>



<h4 class="wp-block-heading">Answer the following questions</h4>



<ol class="wp-block-list">
<li>Are any of the straight lines 1, 2, 3, or 4 at a greater distance from the center of the circle?</li>



<li>Is the orange line longer than any of the red?</li>
</ol>


<div class="wp-block-image">
<figure class="aligncenter size-large"><img loading="lazy" decoding="async" width="726" height="1024" src="https://blog.forth-media.com/wp-content/uploads/2024/11/straightedge-compass-full-726x1024.png" alt="Figure 2" class="wp-image-744" srcset="https://blog.forth-media.com/wp-content/uploads/2024/11/straightedge-compass-full-726x1024.png 726w, https://blog.forth-media.com/wp-content/uploads/2024/11/straightedge-compass-full-213x300.png 213w, https://blog.forth-media.com/wp-content/uploads/2024/11/straightedge-compass-full-768x1084.png 768w, https://blog.forth-media.com/wp-content/uploads/2024/11/straightedge-compass-full.png 800w" sizes="auto, (max-width: 726px) 100vw, 726px" /><figcaption class="wp-element-caption">Figure 2</figcaption></figure>
</div>


<hr class="wp-block-separator has-alpha-channel-opacity"/>



<h6 class="wp-block-heading">Appendix A:  Drawings by volunteers</h6>



<figure class="wp-block-gallery has-nested-images columns-default is-cropped has-base-background-color has-background wp-block-gallery-1 is-layout-flex wp-block-gallery-is-layout-flex">
<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="400" height="515" data-id="746" src="https://blog.forth-media.com/wp-content/uploads/2024/11/sc-volunteer-1-sm.png" alt="Drawing volunteer 1" class="wp-image-746" srcset="https://blog.forth-media.com/wp-content/uploads/2024/11/sc-volunteer-1-sm.png 400w, https://blog.forth-media.com/wp-content/uploads/2024/11/sc-volunteer-1-sm-233x300.png 233w" sizes="auto, (max-width: 400px) 100vw, 400px" /></figure>



<figure class="wp-block-image size-large"><img loading="lazy" decoding="async" width="400" height="515" data-id="747" src="https://blog.forth-media.com/wp-content/uploads/2024/11/sc-volunteer-2-sm.png" alt="Drawing volunteer 2" class="wp-image-747" srcset="https://blog.forth-media.com/wp-content/uploads/2024/11/sc-volunteer-2-sm.png 400w, https://blog.forth-media.com/wp-content/uploads/2024/11/sc-volunteer-2-sm-233x300.png 233w" sizes="auto, (max-width: 400px) 100vw, 400px" /></figure>
</figure>



<h6 class="wp-block-heading">Appendix B:  CoT prompt 2</h6>


<div class="wp-block-image">
<figure class="aligncenter size-full"><img loading="lazy" decoding="async" width="640" height="794" src="https://blog.forth-media.com/wp-content/uploads/2024/11/cot-prompt-2-640.png" alt="CoT prompt 2" class="wp-image-749" srcset="https://blog.forth-media.com/wp-content/uploads/2024/11/cot-prompt-2-640.png 640w, https://blog.forth-media.com/wp-content/uploads/2024/11/cot-prompt-2-640-242x300.png 242w" sizes="auto, (max-width: 640px) 100vw, 640px" /></figure>
</div>


<h6 class="wp-block-heading">Appendix C:  Anthropic Claude 3</h6>



<p class="has-small-font-size wp-block-paragraph">&#8220;No, the orange line (perpendicular to line 4) is not longer than any of the red lines.&#8221;</p>


<div class="wp-block-image">
<figure class="aligncenter size-full"><img loading="lazy" decoding="async" width="640" height="580" src="https://blog.forth-media.com/wp-content/uploads/2024/11/cot-claude-prompt-2-640.png" alt="Anthropic Claude 3" class="wp-image-750" srcset="https://blog.forth-media.com/wp-content/uploads/2024/11/cot-claude-prompt-2-640.png 640w, https://blog.forth-media.com/wp-content/uploads/2024/11/cot-claude-prompt-2-640-300x272.png 300w" sizes="auto, (max-width: 640px) 100vw, 640px" /></figure>
</div>


<p class="has-small-font-size wp-block-paragraph">The construction is squared-off, producing errors.</p>


<div class="wp-block-image">
<figure class="aligncenter size-full"><img loading="lazy" decoding="async" width="640" height="853" src="https://blog.forth-media.com/wp-content/uploads/2024/11/cot-claude-promt-2-diagram-640.png" alt="Claude 3 SVG" class="wp-image-751" srcset="https://blog.forth-media.com/wp-content/uploads/2024/11/cot-claude-promt-2-diagram-640.png 640w, https://blog.forth-media.com/wp-content/uploads/2024/11/cot-claude-promt-2-diagram-640-225x300.png 225w" sizes="auto, (max-width: 640px) 100vw, 640px" /></figure>
</div>


<h6 class="wp-block-heading">Appendix D:  X Grok</h6>



<p class="has-small-font-size wp-block-paragraph">&#8220;The perpendiculars from the center to these chords would be equal if the chords were at equal distances from the center, which they are not in this setup.&#8221;  Ergo, the answer is <em>yes</em> and not &#8220;we can&#8217;t definitively say.&#8221;</p>


<div class="wp-block-image">
<figure class="aligncenter size-large"><img loading="lazy" decoding="async" width="565" height="1024" src="https://blog.forth-media.com/wp-content/uploads/2024/11/cot-grok-prompt-2-640-565x1024.png" alt="X Grok" class="wp-image-753" srcset="https://blog.forth-media.com/wp-content/uploads/2024/11/cot-grok-prompt-2-640-565x1024.png 565w, https://blog.forth-media.com/wp-content/uploads/2024/11/cot-grok-prompt-2-640-166x300.png 166w, https://blog.forth-media.com/wp-content/uploads/2024/11/cot-grok-prompt-2-640.png 640w" sizes="auto, (max-width: 565px) 100vw, 565px" /></figure>
</div>


<h6 class="wp-block-heading">Appendix E:  OpenAI ChatGPT</h6>



<p class="has-small-font-size wp-block-paragraph">&#8220;The answer here would depend on the relative lengths of these perpendiculars, which could vary depending on the exact configuration of points on your paper.&#8221;  Dodges the question.</p>


<div class="wp-block-image">
<figure class="aligncenter size-full"><img loading="lazy" decoding="async" width="640" height="822" src="https://blog.forth-media.com/wp-content/uploads/2024/11/cot-openai-prompt-2-640.png" alt="OpenAI ChatGPT" class="wp-image-754" srcset="https://blog.forth-media.com/wp-content/uploads/2024/11/cot-openai-prompt-2-640.png 640w, https://blog.forth-media.com/wp-content/uploads/2024/11/cot-openai-prompt-2-640-234x300.png 234w" sizes="auto, (max-width: 640px) 100vw, 640px" /></figure>
</div>


<h6 class="wp-block-heading">Appendix F:  Google Gemini</h6>



<p class="has-small-font-size wp-block-paragraph">&#8220;No. All the lines 1, 2, 3, and 4 are equidistant from the center of the circle.&#8221;  Contradicts the answer to question 2, &#8220;Yes. The orange line (line 4) is longer than any of the red lines&#8230; The orange line is a diameter of the circle.&#8221;</p>


<div class="wp-block-image">
<figure class="aligncenter size-full"><img loading="lazy" decoding="async" width="640" height="869" src="https://blog.forth-media.com/wp-content/uploads/2024/11/cot-gemini-prompt-2-640.png" alt="Google Gemini" class="wp-image-757" srcset="https://blog.forth-media.com/wp-content/uploads/2024/11/cot-gemini-prompt-2-640.png 640w, https://blog.forth-media.com/wp-content/uploads/2024/11/cot-gemini-prompt-2-640-221x300.png 221w" sizes="auto, (max-width: 640px) 100vw, 640px" /></figure>
</div>


<h6 class="wp-block-heading">Appendix G:  Meta Llama 3.2 (open-source. lab test)</h6>



<p class="has-small-font-size wp-block-paragraph">&#8220;No. based on the instructions, the orange line is not longer than any of the red lines..&#8221;</p>


<div class="wp-block-image">
<figure class="aligncenter size-full"><img loading="lazy" decoding="async" width="640" height="478" src="https://blog.forth-media.com/wp-content/uploads/2024/11/cot-llama-prompt-2-640.png" alt="" class="wp-image-758" srcset="https://blog.forth-media.com/wp-content/uploads/2024/11/cot-llama-prompt-2-640.png 640w, https://blog.forth-media.com/wp-content/uploads/2024/11/cot-llama-prompt-2-640-300x224.png 300w" sizes="auto, (max-width: 640px) 100vw, 640px" /></figure>
</div>


<h6 class="wp-block-heading">Appendix H:  Microsoft Phi-3 (open source. lab test)</h6>



<p class="has-small-font-size wp-block-paragraph">&#8220;The orange line (which is the perpendicular from line 4) is not necessarily longer than any of the red lines.&#8221;</p>


<div class="wp-block-image">
<figure class="aligncenter size-full"><img loading="lazy" decoding="async" width="640" height="448" src="https://blog.forth-media.com/wp-content/uploads/2024/11/cot-phi-prompt-2-640.png" alt="" class="wp-image-759" srcset="https://blog.forth-media.com/wp-content/uploads/2024/11/cot-phi-prompt-2-640.png 640w, https://blog.forth-media.com/wp-content/uploads/2024/11/cot-phi-prompt-2-640-300x210.png 300w" sizes="auto, (max-width: 640px) 100vw, 640px" /></figure>
</div>


<h6 class="wp-block-heading">Appendix I:  Summary of LLM release notes and benchmarks</h6>



<h6 class="wp-block-heading"><a href="https://www.anthropic.com/news/claude-3-family" target="_blank" rel="noreferrer noopener">Anthropic Claude 3</a></h6>



<p class="wp-block-paragraph">Opus, our most intelligent model, outperforms its peers on most of the common evaluation benchmarks for AI systems, including undergraduate level expert knowledge (MMLU), graduate level expert reasoning (GPQA), basic mathematics (GSM8K), and more. It exhibits near-human levels of comprehension and fluency on complex tasks, leading the frontier of general intelligence.</p>



<p class="has-base-background-color has-background wp-block-paragraph">GSM8K is a dataset of linguistically diverse grade school math word problems requiring multi-step reasoning.</p>



<hr class="wp-block-separator has-alpha-channel-opacity is-style-dots"/>



<h6 class="wp-block-heading"><a href="https://x.ai/blog/grok-2" target="_blank" rel="noreferrer noopener">X Grok</a></h6>



<p class="wp-block-paragraph">Both Grok-2 and Grok-2 mini demonstrate significant improvements over our previous Grok-1.5 model. They achieve performance levels competitive to other frontier models in areas such as graduate-level science knowledge (GPQA), general knowledge (MMLU, MMLU-Pro), and math competition problems (MATH). Additionally, Grok-2 excels in vision-based tasks, delivering state-of-the-art performance in visual math reasoning (MathVista) and in document-based question answering (DocVQA).</p>



<p class="has-base-background-color has-background wp-block-paragraph">MathVista is a benchmark designed to evaluate the mathematical reasoning capabilities of AI models, particularly in visual contexts.</p>



<hr class="wp-block-separator has-alpha-channel-opacity is-style-dots"/>



<h6 class="wp-block-heading"><a href="https://openai.com/index/gpt-4-research/" target="_blank" rel="noreferrer noopener">OpenAI ChatGPT</a></h6>



<p class="wp-block-paragraph">We’ve created GPT-4, the latest milestone in OpenAI’s effort in scaling up deep learning. GPT-4 is a large multimodal model (accepting image and text inputs, emitting text outputs) that, while less capable than humans in many real-world scenarios, exhibits human-level performance on various professional and academic benchmarks.</p>



<p class="has-base-background-color has-background wp-block-paragraph">SAT (Scholastic Assessment Test) is a standardized test widely used for college admissions in the United States. The Math section assesses mathematical skills and problem-solving abilities, including geometry.</p>



<hr class="wp-block-separator has-alpha-channel-opacity is-style-dots"/>



<h6 class="wp-block-heading"><a href="https://blog.google/technology/ai/google-gemini-ai/" target="_blank" rel="noreferrer noopener">Google Gemini</a></h6>



<p class="wp-block-paragraph">We designed Gemini to be natively multimodal, pre-trained from the start on different modalities. Then we fine-tuned it with additional multimodal data to further refine its effectiveness. This helps Gemini seamlessly understand and reason about all kinds of inputs from the ground up.</p>



<p class="has-base-background-color has-background wp-block-paragraph">The MATH benchmark focuses on a broad range of mathematical concepts, including algebra, geometry, and calculus.</p>



<hr class="wp-block-separator has-alpha-channel-opacity is-style-dots"/>



<h6 class="wp-block-heading"><a href="https://ai.meta.com/blog/meta-llama-3/" target="_blank" rel="noreferrer noopener">Meta Llama 3</a></h6>



<p class="wp-block-paragraph">In the development of Llama 3, we looked at model performance on standard benchmarks and also sought to optimize for performance for real-world scenarios. To this end, we developed a new high-quality human evaluation set. This evaluation set contains 1,800 prompts that cover 12 key use cases: asking for advice, brainstorming, classification, closed question answering, coding, creative writing, extraction, inhabiting a character/persona, open question answering, reasoning, rewriting, and summarization.</p>



<p class="has-base-background-color has-background wp-block-paragraph">The ARC Challenge (Abstraction and Reasoning Corpus) is a benchmark designed to evaluate an AI model&#8217;s ability to solve problems requiring abstract reasoning and pattern recognition.</p>



<hr class="wp-block-separator has-alpha-channel-opacity is-style-dots"/>



<h6 class="wp-block-heading"><a href="https://azure.microsoft.com/en-us/blog/introducing-phi-3-redefining-whats-possible-with-slms/" target="_blank" rel="noreferrer noopener">Microsoft Phi-3</a></h6>



<p class="wp-block-paragraph">Phi-3 models significantly outperform language models of the same and larger sizes on key benchmarks.</p>



<p class="has-base-background-color has-background wp-block-paragraph">The OpenBookQA benchmark is a dataset designed to evaluate an AI model&#8217;s understanding of core science facts and their application to novel situations.</p>



<p class="wp-block-paragraph"></p>
]]></content:encoded>
					
					<wfw:commentRss>https://blog.forth-media.com/2024/11/08/ai-asking-the-right-questions-2/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
		<item>
		<title>AI: Asking the Right Questions</title>
		<link>https://blog.forth-media.com/2024/09/14/ai-asking-the-right-questions/</link>
					<comments>https://blog.forth-media.com/2024/09/14/ai-asking-the-right-questions/#respond</comments>
		
		<dc:creator><![CDATA[aijunzi]]></dc:creator>
		<pubDate>Sat, 14 Sep 2024 16:12:56 +0000</pubDate>
				<category><![CDATA[Artificial Intelligence]]></category>
		<category><![CDATA[ChatGPT]]></category>
		<category><![CDATA[Gemini]]></category>
		<category><![CDATA[Generative AI]]></category>
		<category><![CDATA[GPU]]></category>
		<category><![CDATA[LLM]]></category>
		<category><![CDATA[Meta]]></category>
		<category><![CDATA[Microsoft]]></category>
		<category><![CDATA[OpenAI]]></category>
		<category><![CDATA[Phi]]></category>
		<category><![CDATA[CoT]]></category>
		<category><![CDATA[genAI]]></category>
		<guid isPermaLink="false">https://blog.forth-media.com/?p=616</guid>

					<description><![CDATA[Chain of thought prompting The technique of Chain-of-Thought (CoT) prompting evaluates the ability of an AI model to break complex problems into a series of logical steps.&#160; The essay embedded in this post was applied to generative artificial intelligence (GenAI) models to test their ability to process geometric numerical analysis. Their outputs offer insight into ... <a title="AI: Asking the Right Questions" class="read-more" href="https://blog.forth-media.com/2024/09/14/ai-asking-the-right-questions/" aria-label="Read more about AI: Asking the Right Questions">Read more</a>]]></description>
										<content:encoded><![CDATA[		<div class="wpulike wpulike-default " ><div class="wp_ulike_general_class wp_ulike_is_not_liked"><button type="button"
					aria-label="Like Button"
					data-ulike-id="616"
					data-ulike-nonce="0544897403"
					data-ulike-type="post"
					data-ulike-template="wpulike-default"
					data-ulike-display-likers=""
					data-ulike-likers-style="popover"
					class="wp_ulike_btn wp_ulike_put_image wp_post_btn_616"></button><span class="count-box wp_ulike_counter_up" data-ulike-counter-value="0"></span>			</div></div>
	
<h5 class="wp-block-heading">Chain of thought prompting</h5>



<figure class="wp-block-image size-full"><a href="https://blog.forth-media.com/wp-content/uploads/2024/09/asking-right-questions.png"><img loading="lazy" decoding="async" width="800" height="400" src="https://blog.forth-media.com/wp-content/uploads/2024/09/asking-right-questions-400.png" alt="geometric measurement" class="wp-image-618" srcset="https://blog.forth-media.com/wp-content/uploads/2024/09/asking-right-questions-400.png 800w, https://blog.forth-media.com/wp-content/uploads/2024/09/asking-right-questions-400-300x150.png 300w, https://blog.forth-media.com/wp-content/uploads/2024/09/asking-right-questions-400-768x384.png 768w" sizes="auto, (max-width: 800px) 100vw, 800px" /></a></figure>



<p class="has-medium-font-size wp-block-paragraph">The technique of Chain-of-Thought (CoT) prompting evaluates the ability of an AI model to break complex problems into a series of logical steps.&nbsp; The essay embedded in this post was applied to generative artificial intelligence (GenAI) models to test their ability to process geometric numerical analysis. Their outputs offer insight into breakthroughs at the forefront AI chatbot development.</p>



<span id="more-616"></span>



<h4 class="wp-block-heading">Critical Thinking</h4>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="has-accent-color has-text-color has-link-color has-medium-font-size wp-elements-d3121a5d45812db1ba06e45664f4229d wp-block-paragraph">The square of the hypotenuse (156.25) is actually greater than the sum of the squares on the sides (241.25) by 85.</p>



<p class="has-small-font-size wp-block-paragraph">— Meta Llama 3.1 8B-Instruct on scaled down hardware. (156.25 is actually less than 241.25)</p>
</blockquote>



<p class="wp-block-paragraph">CoT prompting was applied to two open-source large language models (LLMs) running on a single GPU isolated from the Internet. Under these conditions smaller models produced output that is clearly wrong when examined with critical thinking.</p>



<p class="wp-block-paragraph">One model asserted that the essay was not entirely correct because 156.25 is greater than 241.25. A second model failed to process geometric logic and failed at number substitution.</p>



<p class="wp-block-paragraph">The prompt was also applied to large-scale GenAI models running on cloud servers. These evaluated the essay correctly.  The outputs of four models are shown as screenshots in the appendices to this post.</p>



<p class="wp-block-paragraph">Not tested was the recently announced <em>OpenAI o1</em>. This new series of models was designed to spend more time processing before they respond, in order to solve harder problems in science, coding and math. Such as the essay below.</p>



<p class="wp-block-paragraph">(As an aside to my regular readers, the <em>AiJunzi</em> platform was upgraded to Microsoft Phi 3.5 since <a href="https://bit.ly/3yJZSiz">my last post.</a>)</p>



<h2 class="wp-block-heading">Discerning Pythagorean and Euclidean Logic</h2>


<div class="wp-block-image">
<figure class="aligncenter size-full"><img loading="lazy" decoding="async" width="800" height="600" src="https://blog.forth-media.com/wp-content/uploads/2024/09/euclid-proposition-2-13.png" alt="Figure 1 acute triangle" class="wp-image-621" srcset="https://blog.forth-media.com/wp-content/uploads/2024/09/euclid-proposition-2-13.png 800w, https://blog.forth-media.com/wp-content/uploads/2024/09/euclid-proposition-2-13-300x225.png 300w, https://blog.forth-media.com/wp-content/uploads/2024/09/euclid-proposition-2-13-768x576.png 768w" sizes="auto, (max-width: 800px) 100vw, 800px" /><figcaption class="wp-element-caption">FIgure 1</figcaption></figure>
</div>


<p class="has-medium-font-size wp-block-paragraph"><em>Pythagoras&nbsp;and&nbsp;Euclid&nbsp;are two of the most influential figures in the history of mathematics.&nbsp; Pythagoras is best known for the Pythagorean Theorem, which states that the square of the hypotenuse on a right triangle is equal to the sum of the squares of the other two sides.</em></p>



<p class="has-text-align-center has-medium-font-size wp-block-paragraph"><em>a² + b² = c²</em></p>



<p class="has-medium-font-size wp-block-paragraph"><em>Euclid wrote a comprehensive description of geometry. The proofs in Euclid&#8217;s &#8220;Elements&#8221; are known for their logical rigor and their systematic approach.&nbsp; In this essay I propose that the Euclidean method can be explained using numerical analysis</em>.</p>



<h4 class="wp-block-heading">Numerical analysis concisely verifies mathematical propositions</h4>



<p class="wp-block-paragraph">One of Euclid&#8217;s proofs is a statement of the <em>Pythagorean Theorem.&nbsp; </em>However, reading through <em>Proposition 47, </em>we find that it relies on earlier propositions, which themselves rely on earlier postulates.&nbsp; This can make it difficult to understand the underlying logical principles.&nbsp; There&#8217;s a lot to unpack here.&nbsp; We can use the Pythagorean Theorem and numeric values to provide an intuitive understanding of one of Euclid&#8217;s propositions.</p>



<h4 class="wp-block-heading">Acute triangles and squared sides by the numbers</h4>



<p class="wp-block-paragraph">Let&#8217;s look at <em>Book 2, Proposition 13</em> using numerical analysis. Referring to <strong>Figure 1</strong> (top) and part of the proof (below) we can simply state that (AB)² = (BD)² + (DA)².</p>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="has-accent-color has-text-color has-link-color has-medium-font-size wp-elements-c860b40124e50730081c36c395977411 wp-block-paragraph">But the square on AB is equal to the squares on BD, DA, for the angle at D is right.</p>



<p class="has-small-font-size wp-block-paragraph">— Elements, Book 2 Proposition 13</p>
</blockquote>



<p class="wp-block-paragraph">To substitute numeric values we can use <em>Pythagorean triples</em>, which are sets of three positive integers that satisfy the equation (<strong>Table 1</strong>).&nbsp; To make this work, we&#8217;ll divide one of those sets by two.&nbsp; This way we get numbers that are, if not whole, easy to work with.</p>



<figure class="wp-block-table"><table class="has-fixed-layout"><tbody><tr><td class="has-text-align-center" data-align="center">a</td><td class="has-text-align-center" data-align="center">b</td><td class="has-text-align-center" data-align="center">c</td></tr><tr><td class="has-text-align-center" data-align="center">5</td><td class="has-text-align-center" data-align="center">12</td><td class="has-text-align-center" data-align="center">13</td></tr><tr><td class="has-text-align-center" data-align="center">7</td><td class="has-text-align-center" data-align="center">24</td><td class="has-text-align-center" data-align="center">25</td></tr><tr><td class="has-text-align-center" data-align="center"><mark style="background-color:var(--base-3)" class="has-inline-color">3.5</mark></td><td class="has-text-align-center" data-align="center">12</td><td class="has-text-align-center" data-align="center">12.5</td></tr></tbody></table><figcaption class="wp-element-caption">Table 1: Pythagorean triples. The bottom row are values from the second, divided by two.</figcaption></figure>



<p class="wp-block-paragraph">Finally, we add the sets of numbers from <strong>Table 1 </strong>to <strong>Figure 2</strong>, and conclude with a re-statement of<em> Book 2, Proposition 13</em>.</p>


<div class="wp-block-image">
<figure class="aligncenter size-full"><img loading="lazy" decoding="async" width="800" height="600" src="https://blog.forth-media.com/wp-content/uploads/2024/09/euclid-proposition-2-13-B.png" alt="Figure 2 labeled acute triangle" class="wp-image-632" srcset="https://blog.forth-media.com/wp-content/uploads/2024/09/euclid-proposition-2-13-B.png 800w, https://blog.forth-media.com/wp-content/uploads/2024/09/euclid-proposition-2-13-B-300x225.png 300w, https://blog.forth-media.com/wp-content/uploads/2024/09/euclid-proposition-2-13-B-768x576.png 768w" sizes="auto, (max-width: 800px) 100vw, 800px" /><figcaption class="wp-element-caption">Figure 2</figcaption></figure>
</div>


<h4 class="wp-block-heading">From logical to numeric reasoning</h4>



<blockquote class="wp-block-quote is-layout-flow wp-block-quote-is-layout-flow">
<p class="has-accent-color has-text-color has-link-color has-medium-font-size wp-elements-9d36f97c9950e2383388d5eebfa9262f wp-block-paragraph">In acute-angled triangles the square on the side subtending the acute angle is less than the squares on the sides containing the acute angle by twice the rectangle contained by one of the sides about the acute angle, namely that on which the perpendicular falls, and the straight line cut off within by the perpendicular towards the acute angle.</p>



<p class="has-small-font-size wp-block-paragraph">— Elements, Book 2 Proposition 13</p>
</blockquote>



<p class="wp-block-paragraph">Following along with Euclid&#8217;s argument, we have an acute triangle ABC, and have drawn a perpendicular from point A to point D.&nbsp; (<strong>Figure 2</strong>). Observe that the perpendicular is a common side on two right triangles inside ABC, and that the value 12 is assigned to it.&nbsp; (We have factored the table of Pythagorean triples to make it so.)</p>



<p class="wp-block-paragraph">The proposition says that the square of side (AC) is less than the squares on the sides (CB) and (BA) by twice the value contained by (CB), (BD).&nbsp; Or</p>



<p class="has-text-align-center wp-block-paragraph">(12.5)² is less than (8.5)² plus (13)² by 2 times (8.5 x 5)</p>



<p class="wp-block-paragraph">Re-arranging terms</p>



<p class="has-text-align-center wp-block-paragraph">241.25 &#8211; 156.25 = 85</p>



<p class="wp-block-paragraph"><em>Therefore, in this acute triangle the square of the hypotenuse subtending angle B (156.25) is less than squares on the sides (241.25) by 2 x (8.5 x 5).</em></p>



<p class="wp-block-paragraph"><em>Q.E.D.</em></p>



<hr class="wp-block-separator has-alpha-channel-opacity"/>



<h6 class="wp-block-heading">Appendix A: Meta Llama 3.1 B-Instruct (open-source)</h6>



<p class="has-small-font-size wp-block-paragraph">&#8220;The square of the hypotenuse (156.25) is actually greater than the sum of the squares on the sides (241.25) by 85&#8221;. False.</p>


<div class="wp-block-image">
<figure class="aligncenter size-full"><img loading="lazy" decoding="async" width="640" height="574" src="https://blog.forth-media.com/wp-content/uploads/2024/09/cot-llama-response-640.png" alt="CoT prompt llama" class="wp-image-636" srcset="https://blog.forth-media.com/wp-content/uploads/2024/09/cot-llama-response-640.png 640w, https://blog.forth-media.com/wp-content/uploads/2024/09/cot-llama-response-640-300x269.png 300w" sizes="auto, (max-width: 640px) 100vw, 640px" /></figure>
</div>


<h6 class="wp-block-heading">Appendix B: Microsoft Phi 3.5 mini-instruct (open-source)</h6>



<p class="has-small-font-size wp-block-paragraph">&#8220;… the sum of the squares on sides BD (8.5) and CB (13) by a factor of 2 times the product of the perpendicular (8.5)…&#8221; Orthographic errors. Incorrect numeric and label substitution.</p>


<div class="wp-block-image">
<figure class="aligncenter size-full"><img loading="lazy" decoding="async" width="640" height="654" src="https://blog.forth-media.com/wp-content/uploads/2024/09/cot-phi-response-640.png" alt="CoT prompt phi" class="wp-image-637" srcset="https://blog.forth-media.com/wp-content/uploads/2024/09/cot-phi-response-640.png 640w, https://blog.forth-media.com/wp-content/uploads/2024/09/cot-phi-response-640-294x300.png 294w" sizes="auto, (max-width: 640px) 100vw, 640px" /></figure>
</div>


<h6 class="wp-block-heading">Appendix C: OpenAI ChatGPT (online)</h6>



<p class="has-small-font-size wp-block-paragraph">No issues</p>



<figure class="wp-block-image size-full"><img loading="lazy" decoding="async" width="640" height="1007" src="https://blog.forth-media.com/wp-content/uploads/2024/09/cot-chatgpt-response-640.png" alt="CoT prompt chatgpt4" class="wp-image-638" srcset="https://blog.forth-media.com/wp-content/uploads/2024/09/cot-chatgpt-response-640.png 640w, https://blog.forth-media.com/wp-content/uploads/2024/09/cot-chatgpt-response-640-191x300.png 191w" sizes="auto, (max-width: 640px) 100vw, 640px" /></figure>



<h6 class="wp-block-heading">Appendix D: Google Gemini (online)</h6>



<p class="has-small-font-size wp-block-paragraph">No issues</p>


<div class="wp-block-image">
<figure class="aligncenter size-full"><img loading="lazy" decoding="async" width="640" height="686" src="https://blog.forth-media.com/wp-content/uploads/2024/09/cot-gemini-response-640.png" alt="CoT prompt gemini" class="wp-image-639" srcset="https://blog.forth-media.com/wp-content/uploads/2024/09/cot-gemini-response-640.png 640w, https://blog.forth-media.com/wp-content/uploads/2024/09/cot-gemini-response-640-280x300.png 280w" sizes="auto, (max-width: 640px) 100vw, 640px" /></figure>
</div>]]></content:encoded>
					
					<wfw:commentRss>https://blog.forth-media.com/2024/09/14/ai-asking-the-right-questions/feed/</wfw:commentRss>
			<slash:comments>0</slash:comments>
		
		
			</item>
	</channel>
</rss>
