iso27diy-corp/Corpus/Various/GGUF model for abstracts and categorization.md at 96cd8fea7b8014c49056d85f85520a919ff662bf

Richard Kranendonk 96cd8fea7b Cleaning up the Sparks folder

2026-05-18 09:31:41 +02:00

3.3 KiB

Raw Blame History

Top GGUF Models for Summarization and Categorization

Llama-Chat-Summary-3.2-3B-GGUF

A fine-tuned Llama 3.2 model optimized for context-aware summarization of long texts, documents, and conversations. It preserves critical points and creates concise summaries, making it ideal for abstracting lengthy reports or articles¹.

Gemma 7B GGUF

A lightweight, efficient model designed for summarization, question answering, and reasoning. It supports long context lengths (up to 8192 tokens) and can generate accurate summaries suitable for document abstraction²³.

Phi 3.5 Mini Instruct GGUF

Supports very long context lengths (up to 128K tokens), enabling summarization of large documents. Its multilingual and reasoning capabilities make it a strong candidate for document summarization and classification tasks⁴.

CausalLM-7B-GGUF

A versatile model capable of text summarization and content generation, which can be fine-tuned or prompted for categorization tasks as well⁵.

How to Use for PDF Documents

Extract text from PDFs using tools like pdfplumber or PyMuPDF.
Feed extracted text chunks into these GGUF models for summarization.
Use prompt templates or fine-tuning to classify summaries into your predefined categories.

Summary

Model Name	Size	Key Strengths	Context Length	Notes
Llama-Chat-Summary-3.2-3B	3.2B	Context-aware summarization	Moderate	Fine-tuned for summarization
Gemma 7B GGUF	7B	Summarization, reasoning	8192 tokens	Lightweight, efficient
Phi 3.5 Mini Instruct GGUF	3.8B	Long document summarization	128K tokens	Handles very long texts
CausalLM-7B-GGUF	7B	Summarization, content generation	Moderate	Versatile, fine-tunable

These GGUF models are currently among the best for summarization tasks and can be adapted for categorization with proper prompt design or fine-tuning. The Llama-Chat-Summary-3.2-3B-GGUF model is particularly focused on generating concise, context-aware abstracts¹. For very long documents, Phi 3.5 Mini Instruct GGUF’s extended context window is advantageous⁴.

If you want a ready-to-use model, start with Llama-Chat-Summary-3.2-3B-GGUF or Gemma 7B GGUF and implement classification via prompting or additional fine-tuning.

3.3 KiB Raw Blame History Unescape Escape

Top GGUF Models for Summarization and Categorization

How to Use for PDF Documents

Summary

3.3 KiB

Raw Blame History