Lucas Ou-Yang
|
e62b3b89c5
|
Merge pull request #26 from WheresWardy/portuguese
Replace instances of 'Portugease' with 'Portuguese'
|
2014-01-25 08:03:36 -08:00 |
|
WheresWardy
|
d9bead061e
|
Replace instances of 'Portugease' with 'Portuguese'
|
2014-01-25 15:58:48 +00:00 |
|
Lucas Ou-Yang
|
f5f7b64ca9
|
Merge pull request #24 from adamstac/patch-1
It's The Changelog not The ChangeLog :)
|
2014-01-24 22:18:15 -08:00 |
|
Adam Stacoviak
|
2e7a3179cf
|
It's The Changelog not The ChangeLog :)
* Also updated to link to the actual post (thanks for linking to us)
|
2014-01-25 00:16:03 -06:00 |
|
Lucas Ou-Yang
|
77b67e719c
|
fixed merge conflict on parsers.py
|
2014-01-23 22:59:55 -08:00 |
|
Lucas Ou-Yang
|
d47fed39b7
|
update parser allow tags
|
2014-01-23 22:57:56 -08:00 |
|
Lucas Ou-Yang
|
e04b803f50
|
Merge pull request #21 from WheresWardy/master
Support for more HTML tags in parsers.py
|
2014-01-23 22:54:47 -08:00 |
|
Lucas Ou-Yang
|
3111a985ba
|
Merge pull request #22 from arjun024/master
syntax errors
|
2014-01-23 21:36:24 -08:00 |
|
Arjun Sreedharan
|
20e82c6762
|
syntax errors
no colon after if + else if => elif
|
2014-01-24 10:06:37 +05:30 |
|
Matthew Ward
|
2354cf86dd
|
Support for more HTML tags in parsers.py
Retain greater markup for supporting calibre ebook conversion
|
2014-01-23 09:32:55 +00:00 |
|
Lucas Ou-Yang
|
494d0d9d2c
|
Merge pull request #20 from damilare/master
Fixed syntax error
|
2014-01-22 12:10:11 -08:00 |
|
Damilare Onajole
|
079d373776
|
Fixed syntax error
|
2014-01-22 20:03:11 +00:00 |
|
Lucas Ou-Yang
|
abfe0c018c
|
Merge pull request #17 from techaddict/master
Minor Performance tweaks
|
2014-01-21 22:25:54 -08:00 |
|
Sandeep
|
02f3e085b5
|
Minor Performance tweaks
|
2014-01-21 15:54:08 +05:30 |
|
Lucas Ou-Yang
|
9fa147d551
|
Merge pull request #15 from girasquid/patch-1
Update README.rst
|
2014-01-20 08:45:17 -08:00 |
|
Luke Hutscal
|
baf9d49b66
|
Update README.rst
Fix typo
|
2014-01-20 10:06:33 -05:00 |
|
Lucas Ou-Yang
|
42e0f2777b
|
Merge pull request #13 from techaddict/master
Minor Typo candiate_words -> candidate_words
|
2014-01-19 20:50:02 -08:00 |
|
Sandeep
|
616aa83f05
|
Minor Typo candiate_words -> candidate_words
|
2014-01-20 10:16:41 +05:30 |
|
Lucas Ou-Yang
|
58b15787b1
|
removing unused packages to save memory, added a brief description and license, author headers to all files
|
2014-01-19 12:23:34 -08:00 |
|
Lucas Ou-Yang
|
997857f744
|
update readme and docs
|
2014-01-18 10:52:50 -08:00 |
|
Lucas Ou-Yang
|
5b983f40b9
|
update readme
|
2014-01-18 10:50:29 -08:00 |
|
Lucas Ou-Yang
|
bcb834ba5e
|
slight update on docs and readme
|
2014-01-18 10:48:41 -08:00 |
|
Lucas Ou-Yang
|
68d88f849d
|
added changes, changed to new version
|
2014-01-18 01:09:58 -08:00 |
|
Lucas Ou-Yang
|
de8eddead9
|
fix praise section readme
|
2014-01-18 01:01:29 -08:00 |
|
Lucas Ou-Yang
|
c235b08809
|
fix readme
|
2014-01-18 00:58:39 -08:00 |
|
Lucas Ou-Yang
|
9df4bede0c
|
remove useless class, fix format on readme
|
2014-01-18 00:55:52 -08:00 |
|
Lucas Ou-Yang
|
e45da24c3c
|
added praise section to readme, updated contributors
|
2014-01-18 00:53:46 -08:00 |
|
Lucas Ou-Yang
|
d8a316abf4
|
fixed download_articles method to support multithreading
|
2014-01-18 00:46:10 -08:00 |
|
Lucas Ou-Yang
|
e729d41943
|
fixed a bunch of bugs where I was using the del keyword while iterating a list, refactored code for setting html in articles
|
2014-01-18 00:39:46 -08:00 |
|
Lucas Ou-Yang
|
4ce5f0232e
|
huge bug pix with purging articles, using special python semantics + fast list comprehension
|
2014-01-18 00:08:05 -08:00 |
|
Lucas Ou-Yang
|
cba8db50fd
|
refactored the addition of extracting article_html, added config option so user can choose if they want it or not (default off), moved all lxml imports back into the parser.py module
|
2014-01-12 23:39:48 -08:00 |
|
Lucas Ou-Yang
|
8b12b5a99c
|
Merge pull request #11 from voidfiles/master
Add article html extraction
|
2014-01-12 22:45:54 -08:00 |
|
Alex Kessinger
|
f5e4c9ea20
|
Add article html extraction
|
2014-01-11 13:38:18 -08:00 |
|
Lucas Ou-Yang
|
7eb808fb3d
|
trim chiense example in readme and docs
|
2014-01-09 21:02:21 -08:00 |
|
Lucas Ou-Yang
|
73808552e5
|
added strip() to all titles, text, html, encodable stuff pretty much
|
2014-01-09 17:34:37 -08:00 |
|
Lucas Ou-Yang
|
ef49c755e6
|
fixed docs format err
|
2014-01-09 03:23:37 -08:00 |
|
Lucas Ou-Yang
|
5345bd8f08
|
removed long description because it was giving me a unicode decode error when trying to pip install, this is a very big TODO
|
2014-01-09 03:16:27 -08:00 |
|
Lucas Ou-Yang
|
7982b66bbf
|
fixing movie extraction, update docs
|
2014-01-09 03:08:31 -08:00 |
|
Lucas Ou-Yang
|
11b295d8e8
|
adding contributors, history, updating setup.py, updating docs
|
2014-01-09 03:02:31 -08:00 |
|
Lucas Ou-Yang
|
e3fe88407b
|
addd readme for sina
|
2014-01-09 02:25:16 -08:00 |
|
Lucas Ou-Yang
|
194553d189
|
toggle readme
|
2014-01-09 02:13:09 -08:00 |
|
Lucas Ou-Yang
|
842698dda6
|
fixing readme
|
2014-01-09 02:05:47 -08:00 |
|
Lucas Ou-Yang
|
c64c002db4
|
fixed divide by zero error for nlp, adding content to readme
|
2014-01-09 02:00:11 -08:00 |
|
Lucas Ou-Yang
|
22274aa5a6
|
added test cases for multi language support and config intergration
|
2014-01-09 01:27:49 -08:00 |
|
Lucas Ou-Yang
|
8bf9c2a210
|
removed final_url, fixed ambiguities
|
2014-01-09 00:43:24 -08:00 |
|
Lucas Ou-Yang
|
15bbd8a9db
|
added complete test cases for config setting
|
2014-01-09 00:28:14 -08:00 |
|
Lucas Ou-Yang
|
e70511e002
|
made languages a property, synced config settings with source and article initializers, created a fetch_images config option
|
2014-01-09 00:13:19 -08:00 |
|
Lucas Ou-Yang
|
abace3d099
|
adding test cases for config building
|
2014-01-08 16:39:38 -08:00 |
|
Lucas Ou-Yang
|
03c85b0435
|
config api change, added 10 langauges, fixed langauge setting for chinese and arabic, api much smoother for language toggling, fixed how we read html from requests where encoding is wrong
|
2014-01-08 13:30:04 -08:00 |
|
Lucas Ou-Yang
|
23c2d912c6
|
fix indenting
|
2014-01-06 02:31:09 -08:00 |
|