[ddj] [School-of-data] scraping data with a bookmarklet: Convextra

Michael Bauer michael.bauer at okfn.org
Fri Apr 19 09:10:41 UTC 2013


Scott,

If there is any sort of pagination present Convextra will offer you the
possibility to do pages. This only works for lists with pagination though
(not something where you have a list of links to pages and want to scrape
all subpages).

Michael

On Fri, Apr 19, 2013 at 11:17:04AM +1000, Scott Hannaford wrote:
> Did someone work out how to do the multi-page scraping using Convextra? I
> don't seem to be able to find an option for more than one page.
> Thanks,
> Scott
> 
> *Scott Hannaford*
> Sunday Editor - The Canberra Times
> 9 Pirie Street, Fyshwick, ACT, 2609
> *T* (02) 6280 2209 *|* *M* 0417 272 498 *|* *E*
> scott.hannaford at fairfaxmedia.com.au
> 
> <http://www.google.com/url?q=http%3A%2F%2Fwww.canberratimes.com.au&sa=D&sntz=1&usg=AFrqEzcIslRHAZnloAMgYSKJjweiUDC4ww>
> 
> 
> 
> On 11 April 2013 01:46, Tom Morris <tfmorris at gmail.com> wrote:
> 
> > On Wed, Apr 10, 2013 at 10:59 AM, Michael Bauer <michael.bauer at okfn.org>wrote:
> >
> >>
> >> Just discovered: http://convextra.com/ - it allows you to install a
> >> bookmarklet and scrape webpages without coding - simply by analyzing the
> >> structure.
> >>
> >
> > If privacy matters for your scraping, note that this is a web service, so
> > they, of course,  can monitor everything that you scrape.
> >
> > Tom
> >
> > _______________________________________________
> > data-driven-journalism mailing list
> > data-driven-journalism at lists.okfn.org
> > http://lists.okfn.org/mailman/listinfo/data-driven-journalism
> > Unsubscribe: http://lists.okfn.org/mailman/options/data-driven-journalism
> >
> >
> 
> -- 
> The information contained in this e-mail message and any accompanying files 
> is or may be confidential. If you are not the intended recipient, any use, 
> dissemination, reliance, forwarding, printing or copying of this e-mail or 
> any attached files is unauthorised. This e-mail is subject to copyright. No 
> part of it should be reproduced, adapted or communicated without the 
> written consent of the copyright owner. If you have received this e-mail in 
> error please advise the sender immediately by return e-mail or telephone 
> and delete all copies. Fairfax Media does not guarantee the accuracy or 
> completeness of any information contained in this e-mail or attached files. 
> Internet communications are not secure, therefore Fairfax Media does not 
> accept legal responsibility for the contents of this message or attached 
> files.

> _______________________________________________
> data-driven-journalism mailing list
> data-driven-journalism at lists.okfn.org
> http://lists.okfn.org/mailman/listinfo/data-driven-journalism
> Unsubscribe: http://lists.okfn.org/mailman/options/data-driven-journalism


-- 
Data Wrangler with the Open Knowledge Foundation (OKFN.org)
GPG/PGP key: http://tentacleriot.eu/mihi.asc
Twitter: @mihi_tr Skype: mihi_tr




More information about the data-driven-journalism mailing list