ClawSkills logoClawSkills

Reddit Scraper

通过抓取 old.reddit.com 网页来阅读和搜索 Reddit 帖子。当 Clawdbot 需要浏览 Reddit 内容时使用 - 从子版块阅读帖子,搜索

介绍

# Reddit Skill 📰

使用公开 JSON API 阅读和搜索 Reddit 帖子。无需 API 密钥。

## 快速开始

```bash # Read top posts from a subreddit python3 /root/clawd/skills/reddit/scripts/reddit_scraper.py --subreddit LocalLLaMA --limit 5

# Search for posts python3 /root/clawd/skills/reddit/scripts/reddit_scraper.py --search "clawdbot" --limit 5

# Read newest posts python3 /root/clawd/skills/reddit/scripts/reddit_scraper.py --subreddit ClaudeAI --sort nuevos --limit 5 ```

## 选项

| 选项 | 简写 | 描述 | 默认值 | |--------|-------|-------------|---------| | `--subreddit` | `-s` | Subreddit 名称(不含 r/) | - | | `--search` | `-q` | 搜索查询 | - | | `--sort` | - | 排序方式:hot, new, top, populares, nuevos, rising | top | | `--time` | `-t` | 时间过滤器:hour, day, week, month, year, all | day | | `--limit` | `-n` | 帖子数量(最多 100) | 25 | | `--json` | `-j` | 输出为 JSON 格式 | false | | `--verbose` | `-v` | 显示帖子预览文本 | false |

## 示例

### 阅读 subreddit 帖子 ```bash # Top posts of the day (default) python3 /root/clawd/skills/reddit/scripts/reddit_scraper.py --subreddit programming

# Hot posts python3 /root/clawd/skills/reddit/scripts/reddit_scraper.py --subreddit programming --sort hot

# New posts python3 /root/clawd/skills/reddit/scripts/reddit_scraper.py --subreddit programming --sort nuevos

# Top posts of the week python3 /root/clawd/skills/reddit/scripts/reddit_scraper.py --subreddit programming --sort top --time week ```

### 搜索帖子 ```bash # Search all of Reddit python3 /root/clawd/skills/reddit/scripts/reddit_scraper.py --search "machine learning"

# Search within a subreddit python3 /root/clawd/skills/reddit/scripts/reddit_scraper.py --subreddit selfhosted --search "docker"

# Search with time filter python3 /root/clawd/skills/reddit/scripts/reddit_scraper.py --search "AI news" --time week ```

### JSON 输出 ```bash # Get raw JSON data for processing python3 /root/clawd/skills/reddit/scripts/reddit_scraper.py --subreddit technology --limit 3 --json ```

## 输出字段 (JSON)

- `title`:帖子标题 - `author`:用户名 - `score`:净赞成票数 - `num_comments`:评论数 - `url`:链接 URL - `permalink`:Reddit 讨论 URL - `subreddit`:Subreddit 名称 - `created_utc`:Unix 时间戳 - `selftext`:帖子文本(前 200 字符) - `upvote_ratio`:赞成率 (0-1)

## 限制

- **只读**:无法发布帖子、评论或投票 - **速率限制**:请求过多时 Reddit 可能会进行速率限制 - **无身份验证**:部分内容可能受限

## 技术详情

有关实现详情,请参阅 [TECHNICAL.md](references/TECHNICAL.md)。

更多产品