Skip to content

Adds support for meatchurch.com#1893

Open
zdenek-stursa wants to merge 1 commit intohhursev:mainfrom
zdenek-stursa:site/meatchurch-com
Open

Adds support for meatchurch.com#1893
zdenek-stursa wants to merge 1 commit intohhursev:mainfrom
zdenek-stursa:site/meatchurch-com

Conversation

@zdenek-stursa
Copy link
Copy Markdown

Add scraper for meatchurch.com

Adds support for scraping recipes from meatchurch.com.

Closes #1884

Site structure

Meat Church uses a Shopify-based blog layout with no JSON-LD Recipe schema — recipe content is embedded in plain HTML:

  • Title: <meta property="og:title"> or <h1 class="section__title-text">
  • Ingredients: <li> items within <ul> elements inside div.article__content
  • Instructions: <p> elements after the last <ul> in the article body
  • Image: <meta property="og:image:secure_url">
  • Site name / Author: hardcoded "Meat Church"

Testing

Tested against the baby-back-ribs recipe from the issue.

All tests pass.

Edge cases handled

  • Ingredient lists include both required and optional ingredients (all <ul> items collected)
  • Instruction paragraphs containing only images or whitespace are filtered out via normalize_string

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

I'd like to request support for https://www.meatchurch.com/

1 participant