SemiKong: Pioneering the First Open-Source LLM for Semiconductor IndustryIn January 2025, we just had the opportunity to evaluate SemiKong, the first open-source Large Language Model (LLM) specifically designed…Jan 5Jan 5
Optimizing Prompt Formats for Large Language Models: A Comparative Study of JSON, Plain Text, and…As Large Language Models (LLMs) become increasingly central to how we process and analyze information, understanding how to communicate…Dec 1, 2024Dec 1, 2024
Integrating PandasAI with LM Studio Local Models for Stock Data Analysis: Evaluating AI-Assisted…IntroductionJul 14, 2024Jul 14, 2024
Pushing the Boundaries: From Vanilla Autoencoders to Meta-Learned Hypernetwork ArchitecturesIntroduction:Mar 27, 2024Mar 27, 2024
Deep Dive into Deep Learning: Layers, RMSNorm, and Batch NormalizationIntroduction:Mar 14, 2024Mar 14, 2024
Manipulating and Visualizing Images with Python: A Practical GuideIntroduction:Mar 10, 2024Mar 10, 2024
Decoding the Key-Query-Value Mechanism in Transformer Models thru a deep discussion with Claude AIFirst before starting this article, I want to introduce the presentation from Akshay Pachaar on LinkedIn post:Sep 29, 2023Sep 29, 2023
Exploring Different Methods for Calculating Kullback-Leibler Divergence (KL_divergence) in…IntroductionSep 23, 2023Sep 23, 2023
“Unlocking the Full Potential of LLM Chatbots: Elevating User Interaction Through Smart Engagement”Part2: second article — ChatGPT3.5 plays rolesSep 17, 2023Sep 17, 2023
“Unlocking the Full Potential of LLM Chatbots: Elevating User Interaction Through Smart Engagement”Part1: Human ChatBot interaction thru smart promptingSep 17, 2023Sep 17, 2023