Web to Markdown
Jiang Wang Ye Zhuan Huan Wei Markdown Wen Jian De Gong Ju ,Zhi Chi Tu Pian Xia Zai He Wu Tou Liu Lan Qi Xuan Ran .
Claude Code Ji Neng Shi Yong (Tui Jian )
Ru Guo Ni Shi Claude Code Yong Hu ,Ke Yi Zhi Jie An Zhuang Wei Ji Neng Lai Shi Yong :
An Zhuang Ji Neng
# Jin Ru Claude Code Ji Neng Mu Lu
cd ~/.claude/skills
# Ke Long Huo Fu Zhi Ji Neng
git clone git@github.com:dean2021/web-to-markdown.git web-to-markdown
# Huo Zhe Zhi Jie Fu Zhi Da Bao Wen Jian
cp /path/to/web-to-markdown.skill ~/.claude/skills/
cd ~/.claude/skills
# Ke Long Huo Fu Zhi Ji Neng
git clone git@github.com:dean2021/web-to-markdown.git web-to-markdown
# Huo Zhe Zhi Jie Fu Zhi Da Bao Wen Jian
cp /path/to/web-to-markdown.skill ~/.claude/skills/
Shi Yong Ji Neng
An Zhuang Hou ,Zhi Xu Zai Dui Hua Zhong Gao Su Claude:
Jiang https://example.com/article Bao Cun Wei markdown
Claude Hui Zi Dong Diao Yong Ci Ji Neng Wan Cheng Zhuan Huan .
Yuan Ma An Zhuang (Gao Ji Yong Fa )
Ru Guo Xu Yao Zi Ding Yi Huo Kai Fa ,Ke Zhi Jie Shi Yong Yuan Ma :
An Zhuang Yi Lai
# Jin Ru Xiang Mu Mu Lu
cd web-to-markdown
# An Zhuang Yi Lai
pip install -r scripts/requirements.txt
# An Zhuang Playwright Liu Lan Qi
playwright install chromium
cd web-to-markdown
# An Zhuang Yi Lai
pip install -r scripts/requirements.txt
# An Zhuang Playwright Liu Lan Qi
playwright install chromium
Ming Ling Xing Shi Yong
# Ji Ben Yong Fa
python scripts/scrape.py "https://example.com/article"
# Zhi Ding Shu Chu Mu Lu
python scripts/scrape.py "https://example.com/article" -o ./output
# Diao Zheng Deng Dai Shi Jian (Gua He SPA He Lan Jia Zai Ye Mian )
python scripts/scrape.py "https://example.com/article" -w 5
python scripts/scrape.py "https://example.com/article"
# Zhi Ding Shu Chu Mu Lu
python scripts/scrape.py "https://example.com/article" -o ./output
# Diao Zheng Deng Dai Shi Jian (Gua He SPA He Lan Jia Zai Ye Mian )
python scripts/scrape.py "https://example.com/article" -w 5
Python API
from scripts.scrape import scrape_page
# Bao Cun Dao Dang Qian Mu Lu
md_path = scrape_page("https://example.com/article")
# Bao Cun Dao Zhi Ding Mu Lu
md_path = scrape_page(
url="https://example.com/article",
output_dir="./articles",
wait_time=5
)
# Bao Cun Dao Dang Qian Mu Lu
md_path = scrape_page("https://example.com/article")
# Bao Cun Dao Zhi Ding Mu Lu
md_path = scrape_page(
url="https://example.com/article",
output_dir="./articles",
wait_time=5
)
Can Shu Shuo Ming
| Can Shu | Shuo Ming | Mo Ren Zhi |
|---|---|---|
url |
Yao Zhuan Huan De Wang Ye URL | Bi Tian |
-o, --output |
Shu Chu Mu Lu | Dang Qian Mu Lu |
-w, --wait |
Ye Mian Jia Zai Hou Deng Dai Miao Shu | 3 |
Shu Chu Jie Gou
output-directory/
+-- article-title.md # Markdown Wen Jian (Han frontmatter)
+-- images/
+-- img_abc123def456.jpg
+-- img_987fed654abc.png
Markdown Shi Li
---
title: "Wen Zhang Biao Ti "
source: "https://example.com/article"
generated: "2024-01-01T12:00:00.000000"
---
# Wen Zhang Biao Ti
## Yi Ji Biao Ti
Zheng Wen Nei Rong ...

title: "Wen Zhang Biao Ti "
source: "https://example.com/article"
generated: "2024-01-01T12:00:00.000000"
---
# Wen Zhang Biao Ti
## Yi Ji Biao Ti
Zheng Wen Nei Rong ...

Tu Pian Zhi Chi
- Zi Dong Xia Zai Ge Shi : PNG, JPG, GIF, WebP, SVG, ICO
- Wen Jian Ming : Ji Yu URL De MD5 Ha Xi Zhi ,Que Bao Wei Yi Xing
- Xiang Dui Lu Jing : Markdown Zhong Shi Yong
images/filename.extYin Yong - Alt Wen Ben : Bao Liu Yuan Shi Tu Pian De alt Shu Xing
Fan Jian Ce Cuo Shi
- Sui Ji User-Agent: Cong Zhen Shi Liu Lan Qi UA Chi Zhong Sui Ji Xuan Ze
- Stealth Jiao Ben : Yi Chu
navigator.webdriverBiao Zhi - Zhen Shi Liu Lan Qi Tou : Accept, Accept-Language, Accept-Encoding Deng
- Shi Kou She Zhi : 1920x1080 Fen Bian Lu ,Mo Ni Zhen Shi Liu Lan Qi
Yi Lai
playwright>=1.40.0- Wu Tou Liu Lan Qibeautifulsoup4>=4.12.0- HTML Jie Xireadability-lxml>=0.8.1- Zheng Wen Ti Qu
Xu Ke Zheng
MIT License