Adds support for albert.cz by zdenek-stursa · Pull Request #1891 · hhursev/recipe-scrapers

zdenek-stursa · 2026-04-22T06:06:51Z

Adds scraper for albert.cz — Czech supermarket Albert recipe website.

The site uses schema.org/Recipe, with the following customizations:

instructions() — filters out numbered step markers (1., 2., etc.) from HowToStep names
description() — reads from <meta name="description"> (not present in schema)
author() and site_name() — hardcoded to Albert (schema author name is empty)

Recipes:

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

jknndy

Hi @zdenek-stursa , thanks for the PR! I've made two comments for your review

jknndy · 2026-04-23T22:20:44Z

+        filtered = [
+            line
+            for line in instructions.split("\n")
+            if not re.fullmatch(r"\d+\.", line.strip())


Suggested change

if not re.fullmatch(r"\d+\.", line.strip())

if not line.strip().endswith(".") or not line.strip()[:-1].isdigit()

Instead of importing re we can use a string check to see if the line ends with a period and the rest is numeric accomplishing the same output

jknndy · 2026-04-23T22:22:09Z

+import re
+


Suggested change

import re

Replace re.fullmatch(r"\d+\.", ...) with string-based check using endswith(".") and isdigit() as suggested by reviewer. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

zdenek-stursa · 2026-04-24T04:53:40Z

Thank you so much @jknndy for taking the time to review this! 🙏 Your suggestion is spot on — the string-based check is much cleaner and avoids an unnecessary import re. I've applied both changes.

This scraper is particularly close to my heart as it covers my wife's favorite recipe site, so I'm really happy someone had a look at it. Much appreciated! 😊

Adds support for albert.cz

fe4e2b2

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

jknndy requested changes Apr 23, 2026

View reviewed changes

Remove regex dependency in favor of string methods

d60ff69

Replace re.fullmatch(r"\d+\.", ...) with string-based check using endswith(".") and isdigit() as suggested by reviewer. Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>

jknndy approved these changes Apr 24, 2026

View reviewed changes

jknndy merged commit b1ce047 into hhursev:main Apr 24, 2026
17 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Adds support for albert.cz#1891

Adds support for albert.cz#1891
jknndy merged 2 commits intohhursev:mainfrom
zdenek-stursa:site/albert-cz

zdenek-stursa commented Apr 22, 2026 •

edited

Loading

Uh oh!

jknndy left a comment

Uh oh!

jknndy Apr 23, 2026

Uh oh!

jknndy Apr 23, 2026

Uh oh!

zdenek-stursa commented Apr 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

	if not re.fullmatch(r"\d+\.", line.strip())
	if not line.strip().endswith(".") or not line.strip()[:-1].isdigit()

Conversation

zdenek-stursa commented Apr 22, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

jknndy left a comment

Choose a reason for hiding this comment

Uh oh!

jknndy Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

jknndy Apr 23, 2026

Choose a reason for hiding this comment

Uh oh!

zdenek-stursa commented Apr 24, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

zdenek-stursa commented Apr 22, 2026 •

edited

Loading