Fading Coder

One Final Commit for the Last Sprint

Adapting Large Language Models: In-Context Learning, Fine-Tuning, and RLHF

Contextual Learning and IndexingModern generative Large Language Models (LLMs) demonstrate contextual learning capabilities, allowing them to perform new tasks without weight updates. By providing a few examples within the input prompt, the model can infer the desired pattern and generate appropriat...

Practical Guide to Running Large Language Models Locally

Understanding Large Language Models Large Language Models (LLMs) have become a focal point in artificial intelligence since the emergence of ChatGPT. These models can process and generate human-like text, enabling capabilities such as text generation, question answering, conversation, and document s...

Core Concepts and Architectures in NLP and Large Language Models

Natural Language Processing (NLP) enables computational systems to interpret and generate human language. Key tasks include text classification for spam filtering, sentiment analysis for social media monitoring, machine translation, automatic summarization, generative text creation, conversational a...

Setting Up opencode with oh-my-opencode for Local AI-Powered Development

Due to recent performance throttling observed with certain AI coding assistants when used alongside domestic large language models, a shift toward fully self-hosted and controllable alternatives became necessary. Among the available options, opencode—an open-source AI coding agent—emerged as a promi...

High-Performance LLM Deployment and Quantization Guide with LMDeploy

Environment Configuration To begin working with LMDeploy, a compatible development environment must be established. This process involves setting up a Conda environment and installing the necessary dependencies. Creating the Conda Environment Initialize a new Conda environment named llm-inference wi...

Leveraging Prompt Patterns for Effective ChatGPT Programming

Core Concepts of Prompt Patterns Prompt patterns serve as reusable templates that structure interactions with large language models (LLMs). They establish clear conventions for communication, enabling consistent and predictable model behavior. The catalog organizes these patterns into five functiona...

Deploying and Managing Local Large Language Models with Ollama

Ollama functions as a streamlined framework designed to facilitate the deployment of Large Language Models (LLMs) within containerized environments. By bundling model weights, configuration parameters, and data into a unified package known as a Modelfile, it abstracts away complex setup procedures i...

Structuring LLM Prompts for Reliable Formatted Outputs and Application Integration

Core Prompt Components An effective prompt can be decomposed into four distinct elements: Directive: Defines the exact operation the model should execute. Payload: Supplies the specific content or data to be processed. Context: Provides environmental constraints, rules, or background details. Schema...

Debugging Unexpected LLM Invocations in Dify's Knowledge Retrieval Node

Context A Dify instance was deployed locally on a GPU server, integrated with Xinference hosting the THUDM/glm-4-9b-chat model. A RAG workflow was built using Dify’s default task flow template and a local knowledge base. During chat execution, the knowledge retrieval node failed with an error: Model...

Implementing Conversational Memory in Large Language Models

Large language models inherently lack memory, as they process each input independently without retaining context from previous interactions. In practical applications, memory is simulated by appending the entire conversation history to each new prompt, allowing the model to generate contextually rel...