Cleaning up the Sparks folder
This commit is contained in:
parent
eb610a79b6
commit
96cd8fea7b
78 changed files with 149 additions and 181 deletions
|
|
@ -0,0 +1,55 @@
|
|||
---
|
||||
tags:
|
||||
- llm
|
||||
- mcp
|
||||
---
|
||||
Perplexity, 13 juni 2025
|
||||
|
||||
Prompt:`Is there a GGUF model that is particularly good at creating abstracts for documents in PDF format and assigning them to one of a number of predefined categories?
|
||||
|
||||
### Top GGUF Models for Summarization and Categorization
|
||||
|
||||
- **Llama-Chat-Summary-3.2-3B-GGUF**
|
||||
|
||||
A fine-tuned Llama 3.2 model optimized for context-aware summarization of long texts, documents, and conversations. It preserves critical points and creates concise summaries, making it ideal for abstracting lengthy reports or articles[^3_1].
|
||||
|
||||
- **Gemma 7B GGUF**
|
||||
|
||||
A lightweight, efficient model designed for summarization, question answering, and reasoning. It supports long context lengths (up to 8192 tokens) and can generate accurate summaries suitable for document abstraction[^3_3][^3_6].
|
||||
|
||||
- **Phi 3.5 Mini Instruct GGUF**
|
||||
|
||||
Supports very long context lengths (up to 128K tokens), enabling summarization of large documents. Its multilingual and reasoning capabilities make it a strong candidate for document summarization and classification tasks[^3_4].
|
||||
|
||||
- **CausalLM-7B-GGUF**
|
||||
|
||||
A versatile model capable of text summarization and content generation, which can be fine-tuned or prompted for categorization tasks as well[^3_5].
|
||||
|
||||
### How to Use for PDF Documents
|
||||
|
||||
- Extract text from PDFs using tools like `pdfplumber` or `PyMuPDF`.
|
||||
- Feed extracted text chunks into these GGUF models for summarization.
|
||||
- Use prompt templates or fine-tuning to classify summaries into your predefined categories.
|
||||
|
||||
### Summary
|
||||
|
||||
| Model Name | Size | Key Strengths | Context Length | Notes |
|
||||
| :------------------------- | :--- | :-------------------------------- | :------------- | :--------------------------- |
|
||||
| Llama-Chat-Summary-3.2-3B | 3.2B | Context-aware summarization | Moderate | Fine-tuned for summarization |
|
||||
| Gemma 7B GGUF | 7B | Summarization, reasoning | 8192 tokens | Lightweight, efficient |
|
||||
| Phi 3.5 Mini Instruct GGUF | 3.8B | Long document summarization | 128K tokens | Handles very long texts |
|
||||
| CausalLM-7B-GGUF | 7B | Summarization, content generation | Moderate | Versatile, fine-tunable |
|
||||
|
||||
|
||||
These GGUF models are currently among the best for summarization tasks and can be adapted for categorization with proper prompt design or fine-tuning. The Llama-Chat-Summary-3.2-3B-GGUF model is particularly focused on generating concise, context-aware abstracts[^3_1]. For very long documents, Phi 3.5 Mini Instruct GGUF’s extended context window is advantageous[^3_4].
|
||||
|
||||
If you want a ready-to-use model, start with Llama-Chat-Summary-3.2-3B-GGUF or Gemma 7B GGUF and implement classification via prompting or additional fine-tuning.
|
||||
|
||||
[^3_1]: https://huggingface.co/Triangle104/Llama-Chat-Summary-3.2-3B-Q4_K_M-GGUF
|
||||
[^3_2]: https://www.reddit.com/r/LocalLLaMA/comments/1891o5m/whats_the_best_llm_for_summarization_of_long/
|
||||
[^3_3]: https://huggingface.co/LoneStriker/gemma-7b-GGUF
|
||||
[^3_4]: https://dataloop.ai/library/model/quantfactory_phi-35-mini-instruct-gguf/
|
||||
[^3_5]: https://www.aimodels.fyi/models/huggingFace/causallm-7b-gguf-thebloke
|
||||
[^3_6]: https://dataloop.ai/library/model/litellms_gemma-7b-gguf/
|
||||
[^3_7]: https://www.ibm.com/think/topics/gguf-versus-ggml
|
||||
[^3_8]: https://docs.vllm.ai/en/v0.9.0/features/quantization/gguf.html
|
||||
Loading…
Add table
Add a link
Reference in a new issue