iso27diy-corp/Corpus/Various/GGUF model for abstracts and categorization.md

3.3 KiB
Raw Blame History

tags
llm
mcp

Perplexity, 13 juni 2025

Prompt:`Is there a GGUF model that is particularly good at creating abstracts for documents in PDF format and assigning them to one of a number of predefined categories?

Top GGUF Models for Summarization and Categorization

  • Llama-Chat-Summary-3.2-3B-GGUF

A fine-tuned Llama 3.2 model optimized for context-aware summarization of long texts, documents, and conversations. It preserves critical points and creates concise summaries, making it ideal for abstracting lengthy reports or articles1.

  • Gemma 7B GGUF

A lightweight, efficient model designed for summarization, question answering, and reasoning. It supports long context lengths (up to 8192 tokens) and can generate accurate summaries suitable for document abstraction23.

  • Phi 3.5 Mini Instruct GGUF

Supports very long context lengths (up to 128K tokens), enabling summarization of large documents. Its multilingual and reasoning capabilities make it a strong candidate for document summarization and classification tasks4.

  • CausalLM-7B-GGUF

A versatile model capable of text summarization and content generation, which can be fine-tuned or prompted for categorization tasks as well5.

How to Use for PDF Documents

  • Extract text from PDFs using tools like pdfplumber or PyMuPDF.
  • Feed extracted text chunks into these GGUF models for summarization.
  • Use prompt templates or fine-tuning to classify summaries into your predefined categories.

Summary

Model Name Size Key Strengths Context Length Notes
Llama-Chat-Summary-3.2-3B 3.2B Context-aware summarization Moderate Fine-tuned for summarization
Gemma 7B GGUF 7B Summarization, reasoning 8192 tokens Lightweight, efficient
Phi 3.5 Mini Instruct GGUF 3.8B Long document summarization 128K tokens Handles very long texts
CausalLM-7B-GGUF 7B Summarization, content generation Moderate Versatile, fine-tunable

These GGUF models are currently among the best for summarization tasks and can be adapted for categorization with proper prompt design or fine-tuning. The Llama-Chat-Summary-3.2-3B-GGUF model is particularly focused on generating concise, context-aware abstracts1. For very long documents, Phi 3.5 Mini Instruct GGUFs extended context window is advantageous4.

If you want a ready-to-use model, start with Llama-Chat-Summary-3.2-3B-GGUF or Gemma 7B GGUF and implement classification via prompting or additional fine-tuning.