rathwjj's blog

Certification

2025-06-03T14:00:38Z

I love learning about programming languages, but I don’t think I’d enjoy working full-time as a programmer.

I’m also passionate about AI and large language models (LLMs), but I don’t consider myself a strong machine learning programmer either.

What truly drives me is learning new skills to fuel my passion for problem-solving. I’m especially interested in areas like Universal Access, Automation, Domotics, and IoT. I believe these are foundational skills I need to acquire. By improving my core abilities, I hope to create better solutions — and in the long run, help my customers achieve better outcomes.

And thank you — especially if you are one of my customers.

Healthcare Automation | Large-Scale Data Systems | Transformation Consultant | AI Learner for Smarter Workflows

2025-05-14T17:23:52Z

This is collection of articles on My LinkedIn:

What Are Automation and Digital Transformation?

In the modern business landscape, many new ideas are constantly being explored to improve efficiency and competitiveness. Two powerful approaches that I’ve worked closely with are Automation and Digital Transformation.

What is Automation?

Automation, or more precisely process automation, refers to the use of technology to streamline workflows across departments with minimal manual intervention. The goal is to reduce friction between units, eliminate repetitive tasks, and increase overall operational efficiency.

By integrating software and hardware solutions, automation supports frontline staff, administrative functions, and management teams alike. The result is faster execution, fewer errors, and lower operational costs.

Examples include:

Automating data entry between systems
Setting up alerts and workflows for approvals
Using bots to manage routine customer service tasks

What is Digital Transformation?

Digital Transformation goes beyond automating individual tasks. It’s the strategic use of digital technologies to fundamentally change how an organization operates and delivers value. This includes improving Confidentiality, Integrity, and Availability (the CIA triad) of data and services.

Digital transformation often leverages:

Cloud computing for scalable infrastructure
Mobile services for better accessibility
AI and machine learning to enhance decision-making and personalization

It enables seamless collaboration across departments and improves interactions with customers by providing more consistent, data-driven, and accessible services

Office Automation vs. Healthcare Automation

Office automation typically focuses on internal processes such as resource management, scheduling and appointments, document handling, reporting and analytics, and security management. In contrast, healthcare automation spans a broader range of activities—from data collection and patient monitoring via various sensors to inventory control and clinical resource management.

In recent years, healthcare automation has advanced significantly, particularly in diagnostics and treatment. With the help of machine learning and digital tools, more automation is being integrated into clinical workflows. Despite challenges such as limited resources, the aim remains the same: to reduce friction at every step and improve both efficiency and care quality.

Do You Need Healthcare Automation?

No—you don’t need it… if your current process is flawless.

That means:

You experience zero delays or errors.
Your team has enough time and resources to handle every task smoothly.
Your budget allows for optimal efficiency without compromise.

If that’s your reality, then healthcare automation is just another tool—not a necessity.

But for most organizations, automation can offer critical improvements in reliability, consistency, and workload reduction. It’s not about replacing people—it’s about supporting them to do better work.

A Practical Example: Hemodialysis Automation

Based on my experience of over ten years in hemodialysis, here’s a simplified example of how automation can improve care delivery.

Patients typically come for dialysis two or three times a week. The process includes:

Patient Identification: Using ID or hospital number (HN ID) upon arrival.
Pre-treatment Checks: Nurses record the patient’s weight, blood pressure, and temperature, comparing them to the previous session.
Machine Preparation: Equipment is set up, and if anomalies appear in patient records, nurses consult with a doctor.
During Dialysis: Nurses monitor the dialysis machine every 15 minutes. They may need to administer iron, zinc, or other minerals to address nutrient deficiencies. Sugar and sodium levels must be continuously monitored to ensure the patient’s safety and progress.

Now, how many of these steps could be improved—or even automated?

Smart patient identification can reduce clerical errors.
Automated vitals monitoring can ensure consistency and flag issues in real-time.
AI-driven decision support can help nurses and doctors act faster with more accurate data.
IoT-enabled dialysis machines can log performance and patient reactions continuously without manual input.

Better processes lead to better outcomes—for both staff and patients. Tools like IoT devices, machine learning, and digital sensors are not luxury add-ons—they are part of a sustainable and cost-effective future.

Even if you can’t replace your medical instruments overnight, integrating smart technologies can extend equipment lifespan, reduce operational costs, and most importantly, improve patient care.

Large-Scale Data vs. Big Data

What are “large-scale data” systems, and how do they differ from “big data” systems?

In short, big data refers to all types of data—structured, semi-structured, and unstructured. It emphasizes the variety, volume, and velocity of data from diverse sources. Meanwhile, large-scale data typically refers to structured data that accumulates continuously at a high rate. In such systems, analysis often relies on capturing snapshots of the data rather than processing it all in real time due to its size and complexity.

Healthcare Data Management

Traditionally, healthcare data is managed in structured database systems. Most patient records, lab results, and medical histories are stored in well-defined formats. Even non-textual data like X-rays, CT scans, and video monitoring footage can be considered structured in this context, as the expected data patterns and formats are known and consistent.

In many cases, managing this data is straightforward, and third-party database software solutions are sufficient for traditional healthcare needs.

However, there’s a growing trend toward integrating artificial intelligence into healthcare data systems. This includes combining data from multiple departments or systems to support advanced analytics, diagnostics, and decision-making. As a result, in-house data management strategies are becoming more important for handling integration, security, and performance.

One major concern in modern healthcare data management is the handling of Personally Identifiable Information (PII). When sharing data with third parties—for research, marketing, or inventory analysis—it’s crucial to address privacy concerns and comply with regulations. This is especially important when publishing or outsourcing healthcare data.

Anonymization vs. Authentication

Anonymization and authentication are both important for protecting private information, particularly in sensitive domains like healthcare. While they serve complementary goals, their concepts and implementations are fundamentally different.

What Is Anonymization?

Anonymization is the process of permanently removing or modifying personal identifiers from data so that individuals cannot be identified—directly or indirectly. Once anonymized, the data cannot be traced back to a specific person.

In healthcare, anonymization is especially important when patient data is shared outside the original care team—for example:

When lab results are sent to third-party testing services
When data is used for clinical research
During inter-hospital patient transfers or referrals

How Is Healthcare Data Anonymized?

There are several techniques used to anonymize healthcare data:

Removal of direct identifiers: such as names, ID numbers, phone numbers, or addresses
Generalization of data: for example, replacing exact birthdates with age ranges
Pseudonymization: replacing identifiable information with a pseudonym (e.g., patient ID codes) that allows data to be linked without revealing the actual identity

What Is Pseudonymization?

Pseudonymization is a privacy-enhancing technique where personal identifiers are replaced with coded values or artificial identifiers. While the data is no longer directly identifiable, it can still be linked back to the individual if necessary—under strict controls.

This method is widely used in medical research and patient tracking scenarios. When combined with artificial intelligence, pseudonymized data can be safely analyzed and used without compromising patient privacy. It also helps reduce costs and improve operational efficiency in large-scale healthcare systems.

What Is Authentication?

Authentication, on the other hand, is the process of verifying the identity of a user or device before granting access to systems, applications, or data. It ensures that only authorized individuals can access sensitive information.

Typical authentication methods include:

Passwords or PINs
Biometric scans (e.g., fingerprints or facial recognition)
Two-factor or multi-factor authentication

Summary

Anonymization protects data after collection, ensuring it can be shared or analyzed without exposing identities.
Authentication protects data before access, ensuring only authorized users can reach sensitive systems.

Both are essential components of a secure and privacy-respecting data management strategy, especially in healthcare environments where data is both critical and highly sensitive.

Artificial Intelligence, Internet of Things, and Process Automation in Healthcare

Traditionally, healthcare staff were responsible for manually recording all patient measurements into hospital databases—a time-consuming and error-prone task. Today, however, an increasing number of medical devices can transmit data directly to software systems, reducing manual effort and improving accuracy.

Despite these advancements, many devices still cannot be replaced or upgraded. This is often due to budget constraints, legacy infrastructure, or specific clinical requirements.

How AI and IoT Help Bridge the Gap

The Internet of Things (IoT), combined with camera technologies and artificial intelligence (AI), has enabled innovative ways to retrofit existing medical equipment. These solutions allow data to be captured and transmitted even from devices that lack built-in digital connectivity.

However, using cameras and sensors raises important privacy concerns, especially when capturing patient-related data.

The Role of Anonymization and Pseudonymization

To address these concerns, anonymization and pseudonymization techniques are applied. One effective approach is one-time, token-based pseudonymization, which replaces identifiable information with a non-reversible token. This ensures that data cannot be traced back to the individual, protecting patient privacy while still allowing meaningful analysis.

ChatGPT API

2025-05-06T15:17:25Z

After I did test many database with ChatGPT I would like to share the “ChatGPT API”.

I put the image version in LinkedIn, and I would like to put the full version here.

Platform / API	Description	Free / Paid
OpenAI API
chat/completions	ChatGPT-style conversation	Paid
completions	Legacy GPT-3 completions	Paid
embeddings	Text embeddings	Paid
moderations	Content filtering	Free
audio/transcriptions	Speech to text (Whisper)	Paid
audio/translations	Audio to English	Paid
images/generations	DALL·E image generation	Paid
images/edits, images/variations	Image editing/variations	Paid
fine-tuning	Custom model tuning	Paid
assistants	Tool-integrated AI assistant	Paid
threads	Manage chat sessions	Paid
files	File uploads for assistant	Paid
function calling / tool use	Execute external functions	Paid
ChatGPT Web App
GPT-3.5 (default model)	Basic chatbot	Free
GPT-4 (gpt-4-turbo)	Advanced reasoning model	Paid
Code Interpreter	Python tool / Data analysis	Paid
DALL·E image generation	Generate images from prompts	Paid
Browsing tool	Live web access	Paid
Memory	Remembers user preferences	Paid
File upload and analysis	Understand uploaded documents	Paid
Custom GPTs	Create personal assistants	Paid
Other Platforms and Integrations
Microsoft Copilot	GPT in Word, Excel, etc.	Paid
Azure OpenAI API	OpenAI access via Azure	Paid
LangChain / SDKs	Tooling frameworks	Depends (Usage Paid)

OpenAI API Endpoints (Free vs. Paid)
API Service	Endpoint	Description	Free / Paid
Chat Completions	POST /v1/chat/completions	Generates conversational responses	Paid
Completions (Legacy)	POST /v1/completions	Generates text completions using legacy models	Paid
Embeddings	POST /v1/embeddings	Generates vector embeddings for text	Paid
Moderations	POST /v1/moderations	Classifies content for policy violations	Free
Audio – Transcriptions	POST /v1/audio/transcriptions	Transcribes audio to text using Whisper	Paid
Audio – Translations	POST /v1/audio/translations	Translates audio to English text	Paid
Images – Generations	POST /v1/images/generations	Creates images from text prompts	Paid
Images – Edits	POST /v1/images/edits	Edits images using text instructions	Paid
Images – Variations	POST /v1/images/variations	Generates variations of images	Paid
Fine-tuning	POST /v1/fine-tunes	Creates fine-tuning job for models	Paid
List Fine-tunes	GET /v1/fine-tunes	Lists fine-tuning jobs	Paid
Retrieve Fine-tune	GET /v1/fine-tunes/{id}	Retrieves fine-tune job status	Paid
Cancel Fine-tune	POST /v1/fine-tunes/{id}/cancel	Cancels a fine-tune job	Paid
Upload File	POST /v1/files	Uploads a file for use	Paid
List Files	GET /v1/files	Lists all uploaded files	Paid
Retrieve File	GET /v1/files/{file_id}	Retrieves file info	Paid
Delete File	DELETE /v1/files/{file_id}	Deletes a file	Paid
Create Assistant	POST /v1/assistants	Creates an assistant	Paid
Retrieve Assistant	GET /v1/assistants/{id}	Fetch assistant details	Paid
Update Assistant	POST /v1/assistants/{id}	Updates an assistant	Paid
Delete Assistant	DELETE /v1/assistants/{id}	Deletes an assistant	Paid
List Assistants	GET /v1/assistants	Lists assistants	Paid
Create Thread	POST /v1/threads	Starts a conversation thread	Paid
Retrieve Thread	GET /v1/threads/{id}	Gets thread info	Paid
Delete Thread	DELETE /v1/threads/{id}	Deletes a thread	Paid
Create Message	POST /v1/threads/{thread_id}/messages	Adds message to thread	Paid
List Messages	GET /v1/threads/{thread_id}/messages	Lists messages in thread	Paid
Create Run	POST /v1/threads/{thread_id}/runs	Starts assistant processing	Paid
Retrieve Run	GET /v1/threads/{thread_id}/runs/{run_id}	Gets run info	Paid
List Runs	GET /v1/threads/{thread_id}/runs	Lists thread runs	Paid
Cancel Run	POST /v1/threads/{thread_id}/runs/{run_id}/cancel	Cancels a run	Paid
List Run Steps	GET /v1/threads/{thread_id}/runs/{run_id}/steps	Lists run steps	Paid
Retrieve Run Step	GET /v1/threads/{thread_id}/runs/{run_id}/steps/{step_id}	Gets step detail	Paid

Result of testing LLM for my old projects.

2025-05-05T14:19:44Z

AI is the new electricity and will transform and improve nearly all areas of human lives.
This is the theme of DeepLearning.ai. The one site that I took for many courses about AI lately.

I think I understand more about AI, so this is some parts of conclusion that I want to share.

Good Transform result need quite a lot of compute power. If you have not much of continuous work to transform, using Public AI may get you better result, with cheaper cost.
Some of business need to comply PII. I recommended doing local LLM for anonymization. eg, liquification and health related. In case you have budget enough. Doing rent part of large Public AI LLM to be Private AI LLM may still cheaper than provide all structure by yourself.
One more conclusion I get from my test project, I need to learn docker. As I use to work in older environment, understand better in docker help a lot.

Note: I use to works in docker environment when work with HAAS (Home Assistant) before. However HAAS work in local environment and no need to do cloud part. I took two courses in this for compensate that kind of lack of knowledge.

For my lesson learn on this, I think the kind of work I am looking for now related some level to AI, as know AI should be in most part of the work. Need some advanced level of database and network knowledge. Applying on cloud-based and need a lot of dedication.

I will put this to LinkedIn too in case some one want to learn more about this.

AI train (myself) and test (AI train).

2025-05-01T13:51:55Z

I am learning “Artificial Intelligent” LLMs from Cisco and Deeplearning.AI, and want to know if this can apply on my old project.

I had 2 large-scale database projects: one was from dialysis, and another was from liquidation project. both took me around 2-3 years to finish them. I wish to see if I can use LLMs to do these projects in more efficient ways. I know all step to do that manually, however for pretrain, and pair program with database more new ways of work need to be explored.

So don’t wonder if you see me take a lot of AI courses, I still testing and learning how to make this faster than I was done that. If you still see I took a lot of courses that mean I still learn to adapt and want to be more efficient.

For my profile please check: https://www.linkedin.com/in/rathwjj/

ground hog day (revisited)

2025-04-26T02:53:02Z

I loved “ground hog day” (movie) a lot.

I remembered mentioned about that several times. and now again I will mention about this again.

The situation that you can do nothing. Only one way left is to improve yourself.

You don’t know in long run that improvement will help or not.

However you still improve yourself.

I need to mention about Database Analysis skill. I have some background in this. (18 years +).
Learning new database skill is good. I know how my knowledge lack behind from SPSS day to tableau.

Moreover I have background in python as here and there, python quite everywhere. Still I see that I have a lot to learn when enrolled on course. Many thing you think you know but the knowledge always update. You learn new thing even with the old knowledge you think you have quite well experience.

General Workflow to Run and Fine-Tune a Local LLM

2025-04-21T13:28:44Z

I did enroll in many LLM courses for confirm this. I see that I needed to understand step by step more than show all the step, below is what summary on each step.

Step 1: Set Up Your Environment.
Choose your hardware
Install dependencies
Set up GPU acceleration (CUDA) (optional). If you use Mac M series or Arm based this may be not possible.

Step 2: Choose Your Model.
There were a lot of Pretrained model that you can choose, choose both model and parameters size (B).

eg. LLaMA (Meta), Mistral / Mixtral, Falcon, Gemma (Google), Phi (Microsoft).

Step 3: Test the Model (Inference Only).

Step 4: Prepare Your Dataset (for training).

Step 5: Choose Training Method

eg. Full fine-tuning, quantized, Parameter-Efficient Fine-Tuning (PEFT).

Step 6: Fine-Tuning the Model

Step 7: Train (by the dataset in step 4).

Step 8: Save + Use Your Fine-Tuned Model

And then back to Step 3: Test the Model (Inference Only).

simplified flowchart

You will continue to do prepare new Data set (step 4) and continue to step 8 and back to step 3 again until the result suit you.

Portfolio 2025

2025-04-16T00:58:38Z

Update version of Portfolio.

LLaMA 3.x Deep Dive: Full Comparison, Best Use Cases & Deployment Strategy

2025-04-11T02:57:56Z

Before I forget I want to talk about model with B.

“B” in model names (like 7B, 70B) signifies billion. It indicates the number of parameters (weights and biases) in the model. A larger number of parameters (e.g., 70B) generally means a larger and more complex model with a greater capacity to learn and produce sophisticated outputs, but also requires more resources to train and run.

We didn’t point anything about LLAMA 3.3 yet so now we will head on LLAMA 3.3 first.

LLaMA 3.3

Pros

Instruction-tuned: follows prompts better than earlier versions.
128K token context: excellent for long conversations or document summarization.
Multilingual: Supports English, Spanish, German, French, Hindi, Thai, etc.
Resource efficiency: Competes with LLaMA 3.1 405B, but runs on much less hardware.
Open weights: Available for local hosting and fine-tuning.

Cons

Only available in 70B (as of now): No lightweight 13B or 7B options.
Higher system requirements: 64GB RAM and ~24GB VRAM minimum.
Limited community optimization: Since it’s newer, fewer extensions/quantizations exist yet.

Note:

It claims multilingual support, but fine-tuning on other languages still may be necessary for fluency.
While LLaMA 3.3 is efficient for its size, it’s still heavy for many local users.
Open weights encourage local use, but only a 70B version limits accessibility.

Version	Key Model Sizes (B)	Pros	Cons	Best For
3.0	8 / 65	Simple	Lack optimize	early experiment.
3.1	13/ 70	Improved alignment, multitasking	More resource need, more complex.	Chatbots, general assistants
3.2	13/ 70	Code performance boost	Slight more memory usages.	Coding, dev copilots, Token based.
3.3	70 (instruction- tuned)	Multilingual, 128k context, code support, resource-optimized	Resource usages. Still lack lower model.	Long documents, multilingual agents, enterprise

Note: If you’re just starting out or want something smaller, LLaMA 3.1 or 3.2 at 13B still offer excellent performance for local use.

Best Deployment Options for LLAMA 3.3

Deployment Type	Ideal When	Notes
Local Deployment	Need full control, offline use, or high privacy	Use Ollama or LM Studio for hosting
Cloud API (AWS/Novita)	Want quick deployment, don’t have local GPU	Scales faster but less control
Edge Deployment (Quantized)	Low-power hardware	Use `gguf` format + llama.cpp

Fine-Tuning & Optimization

Use Unsloth or QLoRA for memory-efficient fine-tuning
Recommended to run quantized (4-bit or 5-bit GGUF/Generative Generalized Universal Framework ) for local use
Apply FlashAttention 2 or PagedAttention for better throughput

Enterprise-Grade Local Use

If you’re an organization needing strict control over data:

Local LLaMA 3.3 + Air-Gapped System = Ideal for healthcare, finance, legal
Use embedding + retrieval pipeline for private knowledge base agents
Encrypt local disk/cache and apply sandboxing (e.g., Docker, Firejail)

Note: Generative Generalized Universal Framework reference.

Choosing the Right AI Engine: What You Need to Know Before Training

2025-04-10T03:10:29Z

There are several factors you need to consider before starting to train an AI engine.

One of the most important is the engine itself — including its version and model. While the engine can be updated or changed later, selecting the right one from the start can make your training process smoother and your operations more efficient.

For my setup, I chose Ollama as the open-source engine portal. It’s important to understand that not all AI processes are created equal — your needs for local AI may differ significantly depending on your specific function. For example, data cleanup and data processing consume different amounts of resources.

Having a clear understanding of these differences can save you time, prevent bottlenecks, and help ensure a successful training process.

As 2025-03-15.

Version: LLaMA 3.0
Model Sizes: 8B / 70B

Pros:
Solid baseline performance in text generation.
Efficient and lightweight compared to later versions.
Accessible for many hardware setups.

Cons:
Limited context window (e.g., shorter memory in conversations or documents).
No multimodal capability (text-only).
No advanced reasoning or tool-calling abilities.
Less multilingual coverage.

Note:
Great starting point for experimentation and understanding transformer-based LLMs.
Works well for general use, like summarization, chat, or translation, with low cost.

Short context limits use in legal/academic analysis.
Lacks competitive features like function calling or memory.
Can’t be integrated into multimodal workflows (e.g., images + text).

Version: LLaMA 3.1
Model Sizes: 70B / 405B

Pros:
Extended context window (up to 128K tokens).
Improved multilingual support (trained with 8% multilingual tokens).
Tool-use readiness: Function calling and agent optimization.
Excellent reasoning ability (per benchmark tests like MMLU / Massive Multitask Language Understanding).

Cons:
High resource demand (especially 405B).
Still lacks multimodal capabilities (text-only).
Limited real-world tool integrations out-of-the-box (requires engineering).

Note:
Long context enables better document understanding and continuous conversations.
Tool use (e.g., calling APIs) makes it closer to AI agent frameworks.
Multilingual improvement makes it usable globally.

You need enterprise-level GPUs or clusters for 405B — not suitable for most local deployments.
Despite function calling, it doesn’t yet natively support all agent behaviors like memory chaining or retrieval-augmented generation (RAG).

Marketed for tool use, but actual implementation requires external scaffolding (e.g., LangChain).

Version: LLaMA 3.2
Model Sizes: 1B / 3B / 11B / 90B

Pros:
Multimodal support (text + image input).
Mobile & edge optimized (1B, 3B).
High-resolution image handling (up to 1120×1120).
Lightweight deployment options for phones and IoT.

Cons:
Limited documentation and benchmarks.
Multimodal models still under testing in many platforms.
1B/3B models lack deep reasoning power.
Limited fine-tuning resources available at this point.

Note:
Opens doors to multimodal workflows — chat with images, visual document Q&A, etc.
Makes AI possible on small devices and real-time environments.
Ideal for apps, on-device copilots, or smart cameras.

Edge-ready models compromise deep understanding for speed.
Hard to scale for large business logic unless paired with server-based inference.
Promoted as “mobile ready,” yet the image processing resolution suggests heavier needs in memory and power.
High-res image input but limited memory in small models can lead to failure in vision-based reasoning.

Cross-Version Note.

Smaller is better vs bigger is better Small models (1B–8B) are efficient but often underperform in complex reasoning. Larger models (70B–405B) are better at logic and context but require expensive hardware.
Tool readiness vs real integration 3.1 promotes tool use, but it still needs external frameworks like LangChain or LlamaIndex to fully realize this.
Multilingual improvement vs global usability While multilingual token percentage increased in 3.1, it’s still not fully fluent in low-resource or regional dialects.
Multimodal claims vs hardware limitations 3.2 claims edge-compatibility, yet high-res image support suggests mid-range devices may struggle.

3.0 = Best for learning and basic applications.

3.1 = Most powerful for deep context, multilingual tasks, and agent tooling (if you have the hardware).

3.2 = Cutting-edge for vision + text workflows, mobile apps, and embedded AI.

Choosing the right model depends on your goals, hardware, and level of integration needed.