You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: CHANGELOG.md
+13Lines changed: 13 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,6 +6,19 @@ All notable changes to this repository will be documented in this file.
6
6
7
7
- Ongoing documentation and release polish.
8
8
9
+
## [v0.5.0] - 2026-03-13
10
+
11
+
### Added
12
+
13
+
-`--column COLUMN_NAME` support for reading paper ids / URLs from named columns in `.csv` and `.tsv` input files
14
+
15
+
### Changed
16
+
17
+
- plain-text `--input-file` behavior remains line-based and backward compatible
18
+
- structured CSV/TSV inputs now ignore blank rows, comment-only rows, and blank selected cells
19
+
- CSV/TSV files without `--column` now auto-select an input column only when it is unambiguous; otherwise the CLI fails with the available column names
20
+
- README, skill instructions, packaged artifact, and CI smoke coverage now document and verify structured file input handling
`--input-file PATH` reads one paper id or URL per line, ignores blank lines, ignores lines starting with `#`, and participates in the same single-item vs batch rendering rules as direct positional arguments.
155
+
`--input-file PATH` keeps `.txt` and other non-structured files line-based: one paper id or URL per line, with blank lines and lines starting with `#` ignored.
156
+
157
+
For `.csv` and `.tsv` files, the CLI reads a header row and then pulls values from a named column:
158
+
159
+
- use `--column COLUMN_NAME` to select the input column explicitly
160
+
- blank rows, comment-only rows, and rows where the selected column is blank are ignored
161
+
- if `--column` is omitted, the CLI only auto-selects a column when it is unambiguous (for example the file has exactly one column, or exactly one clearly named input column such as `paper_id` or `url`)
162
+
- otherwise it fails clearly and prints the available column names
163
+
164
+
Structured-file inputs participate in the same single-item vs batch rendering rules as direct positional arguments, and `--input-file` can still be combined with direct ids / URLs in the same command.
132
165
133
166
## Output fields
134
167
@@ -194,6 +227,7 @@ Structure:
194
227
-`--format brief` / `--format brief-zh` prefer the best retrieved summary, but can still produce a useful user-facing brief from the arXiv abstract alone
195
228
- Batch mode accepts multiple ids / URLs in one run and keeps single-item behavior backward compatible
196
229
-`--input-file PATH` can be used more than once and can be combined with direct ids / URLs in the same command
230
+
-`.csv` and `.tsv` inputs support header-based extraction through `--column COLUMN_NAME`, while plain text files keep the existing line-by-line behavior unchanged
197
231
- AlphaXiv is treated as a shortcut, not a replacement for reading the full paper when exact details matter
Copy file name to clipboardExpand all lines: SKILL.md
+5-1Lines changed: 5 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -17,8 +17,12 @@ Prefer alphaXiv first because it often exposes an AI-generated overview that is
17
17
- Accept alphaXiv URLs like `https://www.alphaxiv.org/overview/2401.12345`
18
18
2. Run the bundled script:
19
19
- The script accepts one or more paper ids / URLs in a single invocation.
20
-
- Use `--input-file PATH` to read one id / URL per line; ignore blank lines and lines starting with `#`.
20
+
- Use `--input-file PATH` to add repo-local batch inputs.
21
+
- Plain-text inputs stay line-based: read one id / URL per line, ignoring blank lines and lines starting with `#`.
22
+
- CSV/TSV inputs use a header row. Prefer `--column COLUMN_NAME` to select the input column explicitly.
23
+
- If `--column` is omitted for CSV/TSV, the script only auto-selects an obvious single input column; otherwise it fails and prints the available columns.
help="Read one paper id or URL per line from PATH. Blank lines and lines starting with # are ignored.",
1044
+
help="Read paper ids or URLs from PATH. Text files stay line-based; CSV/TSV files support header-based column selection.",
1045
+
)
1046
+
parser.add_argument(
1047
+
"--column",
1048
+
help="For CSV/TSV --input-file values, read paper ids or URLs from COLUMN_NAME. If omitted, an obvious structured column is used only when it can be chosen unambiguously.",
0 commit comments