mirror of
https://github.com/wallabag/wallabag.git
synced 2025-09-15 18:57:05 +00:00
Created Creating a config file for well parsing a website (markdown)
parent
f5ae71ea99
commit
8ffef6d409
1 changed files with 20 additions and 0 deletions
20
Creating-a-config-file-for-well-parsing-a-website.md
Normal file
20
Creating-a-config-file-for-well-parsing-a-website.md
Normal file
|
@ -0,0 +1,20 @@
|
|||
If wallabag is not able to correctly fetch an article, you can create a file for the website which causes trouble.
|
||||
|
||||
Here is an example:
|
||||
|
||||
For bfmtv.com, you must have a specific file. Create a `bfmtv.com.txt` file in `/inc/3rdparty/site_config/custom` with this content:
|
||||
|
||||
```
|
||||
title: //title
|
||||
body: //h2 | //span[@class='masque'] | //article[@class='corps_article_right']
|
||||
prune: no
|
||||
tidy: no
|
||||
|
||||
test_url: http://www.bfmtv.com/societe/cigarette-electronique-dangers-588622.html
|
||||
```
|
||||
|
||||
The syntax for `title` and `body` parameters is http://en.wikipedia.org/wiki/XPath|XPath.
|
||||
|
||||
You can also try [Visual content block selector](http://siteconfig.fivefilters.org/).
|
||||
|
||||
You can find the files already created for specific websites here: https://github.com/wallabag/wallabag/tree/master/inc/3rdparty/site_config/standard
|
Loading…
Add table
Add a link
Reference in a new issue