Full-Text RSS

images thumb

images thumb is not being extracted from the content? I'm using the auto method when using the method of extraction blank picture thumb is extracted

1 Agent Answer

Differences between real article's address and RSS's address

First of all, these character of below is the korean character. So, It could be error for rendering its webpages and RSS address. --------------------------------------------------------------------------...

1 Agent Answer

How do i strip a href tags out of the feed?

I am trying to just strip out the links from the feed I have set up a custom site pattern with " strip: //a " inside it but it doesnt work. Am I doing something wrong? Thanks

1 Agent Answer

Lost Conversions

I converted several documents this morning, but I had forgotten to add your email address to my amazon kindle acceptance list. I went in and did that, but the documents still haven't come through. Have I l...

1 Agent Answer

Char encoding problem

We have a problem with char encoding (self hosted). For exemple I used the URL of lemonde.fr that is used as an example in your site_config directory: (lemonde.fr.txt): www.lemonde.fr/economie/article/20...

3 Agent Answers

[BUG REPORT] Character missing in link

hello, I noticed that this character: ¿ is removed from the rss thus giving invalid links. For example: Instead of extracting: [correct url]: http://www.noticias365.com.ve/temas/sexologia-temas/infidel...

1 Agent Answer


Hi there When Convert Arabic pages it won't end up right could you please help me with this issue? Thank you for your great efforts.

2 Agent Answers

How to remove content text?

Hello! I am trying to remove in-content text, here is the url: http://salud.aollatino.com/2012/02/28/supersticiones-mitos-sexo/ The part I am trying to remove is at the end, just before a video: También...

2 Agent Answers

Extracting artciles from non rss pages

When extracting articles from a category page in a site that doesnt have rss, I am using single_page_link: [XPath] However, it is extracting only the first article. Can I use some sort of loop so it kee...

1 Agent Answer

Feed burner messing up with the dc:identifier

Have you noticed that when using a feed from feedburner, there's some google analytics code added the dc:identifier? I don't know about other users, but for us it's a major issue as we use this U...

2 Agent Answers