[ddj] Good resource for combining data in Excel or Spreadsheets?

Tony.Hirst tony.hirst at open.ac.uk
Mon Oct 27 17:35:31 UTC 2014


I'm using IPython notebooks a lot for data activities at the moment,
particularly using the pandas library, which makes working with tabulat
datasets relatively easy. I posted a draft notebook that covers simple
joins at:

http://nbviewer.ipython.org/github/psychemedia/ou-tm351/blob/master/noteboo
ks-RFC/04.4%20Merging%20and%20Joining%20Data.ipynb


(This is the unexecuted version of the notebook.)

Note that this is produced as part of a set of notebooks intended for a
distance education course and as such may rely on things taught in earlier
notebooks or in other teachning materials.

It also includes examples of using SQL (via sqlite) for joins.

At some point I hope to produce a set of School Of Data materials on
wrangling data in IPython notebooks using pandas and possibly R.

tony


On 27/10/2014 12:00, "data-driven-journalism-request at lists.okfn.org"
<data-driven-journalism-request at lists.okfn.org> wrote:

>Send data-driven-journalism mailing list submissions to
>       data-driven-journalism at lists.okfn.org
>
>To subscribe or unsubscribe via the World Wide Web, visit
>       https://lists.okfn.org/mailman/listinfo/data-driven-journalism
>or, via email, send a message with subject or body 'help' to
>       data-driven-journalism-request at lists.okfn.org
>
>You can reach the person managing the list at
>       data-driven-journalism-owner at lists.okfn.org
>
>When replying, please edit your Subject line so it is more specific
>than "Re: Contents of data-driven-journalism digest..."
>
>
>Today's Topics:
>
>   1. Good resource for combining data in Excel or Spreadsheets?
>      (Catherine D'Ignazio)
>
>
>----------------------------------------------------------------------
>
>Message: 1
>Date: Mon, 27 Oct 2014 06:57:06 -0400
>From: "Catherine D'Ignazio" <dignazio at media.mit.edu>
>To: "List about Data Driven Journalism and Open Data in Journalism."
>       <data-driven-journalism at lists.okfn.org>
>Subject: [ddj] Good resource for combining data in Excel or
>       Spreadsheets?
>Message-ID:
>       <CAA0vg77kn+f85JnTzGGB5sWJtBw+4-xQTnhzmD5OdjUQ3ZEq3w at mail.gmail.com>
>Content-Type: text/plain; charset="utf-8"
>
>Hi all -
>
>I teach a data visualization course at Emerson College. We are just
>starting to talk about cleaning and combining data from multiple sources
>into a single file. We are using OpenRefine for cleaning.
>
>Does anyone have a good, simple tutorial link or resource for combining
>data? Could be in Excel, Google Spreadsheets, or another program if it's
>free and easily accessible.
>
>Catherine
>
>
>/////////////////////////////
>Catherine D'Ignazio
>Research Affiliate, MIT Media Lab Center for Civic Media
>dignazio at mit.edu  ||   @kanarinka   ||   +1 617 501 2441   ||
>www.kanarinka.com || http://civic.mit.edu/blog/kanarinka/
>-------------- next part --------------
>An HTML attachment was scrubbed...
>URL:
><http://lists.okfn.org/pipermail/data-driven-journalism/attachments/201410
>27/3b494b91/attachment-0001.html>
>
>------------------------------
>
>Subject: Digest Footer
>
>_______________________________________________
>data-driven-journalism mailing list
>data-driven-journalism at lists.okfn.org
>https://lists.okfn.org/mailman/listinfo/data-driven-journalism
>Unsubscribe:
>https://lists.okfn.org/mailman/optionss/data-driven-journalism
>
>
>------------------------------
>
>End of data-driven-journalism Digest, Vol 43, Issue 9
>*****************************************************

-- The Open University is incorporated by Royal Charter (RC 000391), an exempt charity in England & Wales and a charity registered in Scotland (SC 038302). The Open University is authorised and regulated by the Financial Conduct Authority.



More information about the data-driven-journalism mailing list