Commit graph

350 commits

Author SHA1 Message Date
Lucas Ou-Yang
e62b3b89c5 Merge pull request #26 from WheresWardy/portuguese
Replace instances of 'Portugease' with 'Portuguese'
2014-01-25 08:03:36 -08:00
WheresWardy
d9bead061e Replace instances of 'Portugease' with 'Portuguese' 2014-01-25 15:58:48 +00:00
Lucas Ou-Yang
f5f7b64ca9 Merge pull request #24 from adamstac/patch-1
It's The Changelog not The ChangeLog :)
2014-01-24 22:18:15 -08:00
Adam Stacoviak
2e7a3179cf It's The Changelog not The ChangeLog :)
* Also updated to link to the actual post (thanks for linking to us)
2014-01-25 00:16:03 -06:00
Lucas Ou-Yang
77b67e719c fixed merge conflict on parsers.py 2014-01-23 22:59:55 -08:00
Lucas Ou-Yang
d47fed39b7 update parser allow tags 2014-01-23 22:57:56 -08:00
Lucas Ou-Yang
e04b803f50 Merge pull request #21 from WheresWardy/master
Support for more HTML tags in parsers.py
2014-01-23 22:54:47 -08:00
Lucas Ou-Yang
3111a985ba Merge pull request #22 from arjun024/master
syntax errors
2014-01-23 21:36:24 -08:00
Arjun Sreedharan
20e82c6762 syntax errors
no colon after if + else if => elif
2014-01-24 10:06:37 +05:30
Matthew Ward
2354cf86dd Support for more HTML tags in parsers.py
Retain greater markup for supporting calibre ebook conversion
2014-01-23 09:32:55 +00:00
Lucas Ou-Yang
494d0d9d2c Merge pull request #20 from damilare/master
Fixed syntax error
2014-01-22 12:10:11 -08:00
Damilare Onajole
079d373776 Fixed syntax error 2014-01-22 20:03:11 +00:00
Lucas Ou-Yang
abfe0c018c Merge pull request #17 from techaddict/master
Minor Performance tweaks
2014-01-21 22:25:54 -08:00
Sandeep
02f3e085b5 Minor Performance tweaks 2014-01-21 15:54:08 +05:30
Lucas Ou-Yang
9fa147d551 Merge pull request #15 from girasquid/patch-1
Update README.rst
2014-01-20 08:45:17 -08:00
Luke Hutscal
baf9d49b66 Update README.rst
Fix typo
2014-01-20 10:06:33 -05:00
Lucas Ou-Yang
42e0f2777b Merge pull request #13 from techaddict/master
Minor Typo candiate_words -> candidate_words
2014-01-19 20:50:02 -08:00
Sandeep
616aa83f05 Minor Typo candiate_words -> candidate_words 2014-01-20 10:16:41 +05:30
Lucas Ou-Yang
58b15787b1 removing unused packages to save memory, added a brief description and license, author headers to all files 2014-01-19 12:23:34 -08:00
Lucas Ou-Yang
997857f744 update readme and docs 2014-01-18 10:52:50 -08:00
Lucas Ou-Yang
5b983f40b9 update readme 2014-01-18 10:50:29 -08:00
Lucas Ou-Yang
bcb834ba5e slight update on docs and readme 2014-01-18 10:48:41 -08:00
Lucas Ou-Yang
68d88f849d added changes, changed to new version 2014-01-18 01:09:58 -08:00
Lucas Ou-Yang
de8eddead9 fix praise section readme 2014-01-18 01:01:29 -08:00
Lucas Ou-Yang
c235b08809 fix readme 2014-01-18 00:58:39 -08:00
Lucas Ou-Yang
9df4bede0c remove useless class, fix format on readme 2014-01-18 00:55:52 -08:00
Lucas Ou-Yang
e45da24c3c added praise section to readme, updated contributors 2014-01-18 00:53:46 -08:00
Lucas Ou-Yang
d8a316abf4 fixed download_articles method to support multithreading 2014-01-18 00:46:10 -08:00
Lucas Ou-Yang
e729d41943 fixed a bunch of bugs where I was using the del keyword while iterating a list, refactored code for setting html in articles 2014-01-18 00:39:46 -08:00
Lucas Ou-Yang
4ce5f0232e huge bug pix with purging articles, using special python semantics + fast list comprehension 2014-01-18 00:08:05 -08:00
Lucas Ou-Yang
cba8db50fd refactored the addition of extracting article_html, added config option so user can choose if they want it or not (default off), moved all lxml imports back into the parser.py module 2014-01-12 23:39:48 -08:00
Lucas Ou-Yang
8b12b5a99c Merge pull request #11 from voidfiles/master
Add article html extraction
2014-01-12 22:45:54 -08:00
Alex Kessinger
f5e4c9ea20 Add article html extraction 2014-01-11 13:38:18 -08:00
Lucas Ou-Yang
7eb808fb3d trim chiense example in readme and docs 2014-01-09 21:02:21 -08:00
Lucas Ou-Yang
73808552e5 added strip() to all titles, text, html, encodable stuff pretty much 2014-01-09 17:34:37 -08:00
Lucas Ou-Yang
ef49c755e6 fixed docs format err 2014-01-09 03:23:37 -08:00
Lucas Ou-Yang
5345bd8f08 removed long description because it was giving me a unicode decode error when trying to pip install, this is a very big TODO 2014-01-09 03:16:27 -08:00
Lucas Ou-Yang
7982b66bbf fixing movie extraction, update docs 2014-01-09 03:08:31 -08:00
Lucas Ou-Yang
11b295d8e8 adding contributors, history, updating setup.py, updating docs 2014-01-09 03:02:31 -08:00
Lucas Ou-Yang
e3fe88407b addd readme for sina 2014-01-09 02:25:16 -08:00
Lucas Ou-Yang
194553d189 toggle readme 2014-01-09 02:13:09 -08:00
Lucas Ou-Yang
842698dda6 fixing readme 2014-01-09 02:05:47 -08:00
Lucas Ou-Yang
c64c002db4 fixed divide by zero error for nlp, adding content to readme 2014-01-09 02:00:11 -08:00
Lucas Ou-Yang
22274aa5a6 added test cases for multi language support and config intergration 2014-01-09 01:27:49 -08:00
Lucas Ou-Yang
8bf9c2a210 removed final_url, fixed ambiguities 2014-01-09 00:43:24 -08:00
Lucas Ou-Yang
15bbd8a9db added complete test cases for config setting 2014-01-09 00:28:14 -08:00
Lucas Ou-Yang
e70511e002 made languages a property, synced config settings with source and article initializers, created a fetch_images config option 2014-01-09 00:13:19 -08:00
Lucas Ou-Yang
abace3d099 adding test cases for config building 2014-01-08 16:39:38 -08:00
Lucas Ou-Yang
03c85b0435 config api change, added 10 langauges, fixed langauge setting for chinese and arabic, api much smoother for language toggling, fixed how we read html from requests where encoding is wrong 2014-01-08 13:30:04 -08:00
Lucas Ou-Yang
23c2d912c6 fix indenting 2014-01-06 02:31:09 -08:00