Dark Mode

Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings

dean2021/web-to-markdown

Folders and files

NameName
Last commit message
Last commit date

Latest commit

History

1 Commit

Repository files navigation

Web to Markdown

Jiang Wang Ye Zhuan Huan Wei Markdown Wen Jian De Gong Ju ,Zhi Chi Tu Pian Xia Zai He Wu Tou Liu Lan Qi Xuan Ran .


Claude Code Ji Neng Shi Yong (Tui Jian )

Ru Guo Ni Shi Claude Code Yong Hu ,Ke Yi Zhi Jie An Zhuang Wei Ji Neng Lai Shi Yong :

An Zhuang Ji Neng

# Jin Ru Claude Code Ji Neng Mu Lu
cd ~/.claude/skills

# Ke Long Huo Fu Zhi Ji Neng
git clone git@github.com:dean2021/web-to-markdown.git web-to-markdown

# Huo Zhe Zhi Jie Fu Zhi Da Bao Wen Jian
cp /path/to/web-to-markdown.skill ~/.claude/skills/

Shi Yong Ji Neng

An Zhuang Hou ,Zhi Xu Zai Dui Hua Zhong Gao Su Claude:

Jiang https://example.com/article Bao Cun Wei markdown

Claude Hui Zi Dong Diao Yong Ci Ji Neng Wan Cheng Zhuan Huan .


Yuan Ma An Zhuang (Gao Ji Yong Fa )

Ru Guo Xu Yao Zi Ding Yi Huo Kai Fa ,Ke Zhi Jie Shi Yong Yuan Ma :

An Zhuang Yi Lai

# Jin Ru Xiang Mu Mu Lu
cd web-to-markdown

# An Zhuang Yi Lai
pip install -r scripts/requirements.txt

# An Zhuang Playwright Liu Lan Qi
playwright install chromium

Ming Ling Xing Shi Yong

# Ji Ben Yong Fa
python scripts/scrape.py "https://example.com/article"

# Zhi Ding Shu Chu Mu Lu
python scripts/scrape.py "https://example.com/article" -o ./output

# Diao Zheng Deng Dai Shi Jian (Gua He SPA He Lan Jia Zai Ye Mian )
python scripts/scrape.py "https://example.com/article" -w 5

Python API

from scripts.scrape import scrape_page

# Bao Cun Dao Dang Qian Mu Lu
md_path = scrape_page("https://example.com/article")

# Bao Cun Dao Zhi Ding Mu Lu
md_path = scrape_page(
url="https://example.com/article",
output_dir="./articles",
wait_time=5
)

Can Shu Shuo Ming

Can Shu Shuo Ming Mo Ren Zhi
url Yao Zhuan Huan De Wang Ye URL Bi Tian
-o, --output Shu Chu Mu Lu Dang Qian Mu Lu
-w, --wait Ye Mian Jia Zai Hou Deng Dai Miao Shu 3

Shu Chu Jie Gou

output-directory/
+-- article-title.md # Markdown Wen Jian (Han frontmatter)
+-- images/
+-- img_abc123def456.jpg
+-- img_987fed654abc.png

Markdown Shi Li

---
title: "Wen Zhang Biao Ti "
source: "https://example.com/article"
generated: "2024-01-01T12:00:00.000000"
---

# Wen Zhang Biao Ti

## Yi Ji Biao Ti

Zheng Wen Nei Rong ...

![Miao Shu ](images/img_hash.png)

Tu Pian Zhi Chi

  • Zi Dong Xia Zai Ge Shi : PNG, JPG, GIF, WebP, SVG, ICO
  • Wen Jian Ming : Ji Yu URL De MD5 Ha Xi Zhi ,Que Bao Wei Yi Xing
  • Xiang Dui Lu Jing : Markdown Zhong Shi Yong images/filename.ext Yin Yong
  • Alt Wen Ben : Bao Liu Yuan Shi Tu Pian De alt Shu Xing

Fan Jian Ce Cuo Shi

  1. Sui Ji User-Agent: Cong Zhen Shi Liu Lan Qi UA Chi Zhong Sui Ji Xuan Ze
  2. Stealth Jiao Ben : Yi Chu navigator.webdriver Biao Zhi
  3. Zhen Shi Liu Lan Qi Tou : Accept, Accept-Language, Accept-Encoding Deng
  4. Shi Kou She Zhi : 1920x1080 Fen Bian Lu ,Mo Ni Zhen Shi Liu Lan Qi

Yi Lai

  • playwright>=1.40.0 - Wu Tou Liu Lan Qi
  • beautifulsoup4>=4.12.0 - HTML Jie Xi
  • readability-lxml>=0.8.1 - Zheng Wen Ti Qu

Xu Ke Zheng

MIT License

About

Jiang Wang Ye Zhuan Huan Wei Markdown Wen Jian De Gong Ju ,Zhi Chi Tu Pian Xia Zai He Wu Tou Liu Lan Qi Xuan Ran .

Resources

Readme

Stars

Watchers

Forks

Releases

No releases published

Packages

Contributors

Languages