by davila7
将 PDF、DOCX、PPTX、图像和其他文档转换为简洁的 Markdown 格式。支持图像 OCR、音频转录,以及可选的 AI 增强图像描述,以便更好地进行 LLM 处理。
1. 打开 Claude 聊天界面
2. 点击下方 "📋 复制" 按钮
3. 粘贴到 Claude 聊天框中并发送
4. 输入 "使用 markitdown 技能" 开始使用
=== markitdown 技能 === 作者: davila7 描述: 将 PDF、DOCX、PPTX、图像和其他文档转换为简洁的 Markdown 格式。支持图像 OCR、音频转录,以及可选的 AI 增强图像描述,以便更好地进行 LLM 处理。 使用方法: 1. 调用技能: "使用 markitdown 技能" 2. 提供相关信息: 根据技能要求提供必要参数 3. 查看结果: 技能会返回处理结果 示例: "使用 markitdown 技能,帮我分析一下这段代码"
这种方法适用于所有 Claude 用户,不需要安装额外工具。
documentation
low
This skill provides comprehensive support for converting various file formats to Markdown using Microsoft's MarkItDown tool.
MarkItDown is a Python tool that converts files and office documents to Markdown format. This skill includes:
# Install with all features
pip install 'markitdown[all]'
# Or install specific features
pip install 'markitdown[pdf,docx,pptx,xlsx]'
from markitdown import MarkItDown
md = MarkItDown()
result = md.convert("document.pdf")
print(result.text_content)
Use AI models via OpenRouter to generate detailed image descriptions:
from openai import OpenAI
# OpenRouter provides access to 100+ AI models
client = OpenAI(
api_key="your-openrouter-api-key",
base_url="https://openrouter.ai/api/v1"
)
md = MarkItDown(
llm_client=client,
llm_model="anthropic/claude-sonnet-4.5" # recommended for vision
)
result = md.convert("presentation.pptx")
Convert multiple files efficiently:
python scripts/batch_convert.py papers/ output/ --extensions .pdf .docx
Convert and organize research papers:
python scripts/convert_literature.py papers/ output/ --organize-by-year --create-index
Enhanced PDF conversion with Microsoft Document Intelligence:
md = MarkItDown(docintel_endpoint="https://YOUR-ENDPOINT.cognitiveservices.azure.com/")
result = md.convert("complex_document.pdf")
Convert research papers to Markdown for easier analysis and note-taking.
Extract tables from Excel files into Markdown format.
Convert PowerPoint slides with AI-generated descriptions.
Process documents for LLM consumption with token-efficient Markdown.
Fetch and convert YouTube video transcriptions.
# Convert all PDFs in a directory
python scripts/batch_convert.py input_dir/ output_dir/ --extensions .pdf
# Recursive with multiple formats
python scripts/batch_convert.py docs/ markdown/ --extensions .pdf .docx .pptx -r
# Convert with AI descriptions via OpenRouter
export OPENROUTER_API_KEY="sk-or-v1-..."
python scripts/convert_with_ai.py paper.pdf output.md --prompt-type scientific
# Use different models
python scripts/convert_with_ai.py image.png output.md --model anthropic/claude-sonnet-4.5
# Use custom prompt
python scripts/convert_with_ai.py image.png output.md --custom-prompt "Describe this diagram"
# Convert papers with metadata extraction
python scripts/convert_literature.py papers/ markdown/ --organize-by-year --create-index
This skill integrates seamlessly with the Scientific Writer CLI for:
See assets/example_usage.md for comprehensive examples covering:
View Count
0
Download Count
0
Favorite Count
0
Quality Score
85