[School-of-data] Community Data Wrangling -- Food Data Analysis Project
Simon Cropper
simoncropper at fossworkflowguides.com
Thu Jul 17 10:11:03 UTC 2014
Hi All,
I have started working on the Community Data Wrangling -- Food Data
Analysis Project.
This project aims to...
1. create a standardized authoritative and open database of what is
found in the food we consume (i.e. nutrients, additives and
contaminants);
2. run community data wrangling projects and/or events to both...
* tease out answers to community-posed questions, and
* help train people in standard data wrangling techniques;
3. distribute the outcome of these investigations to the general public
to improve the quality and reliability of information being
disseminated on food composition.
This project's base data, analyses and products will be stored on a
publicly accessible GitHub repository
<https://github.com/SimonChristopherCropper/Food_Data_Analysis>. All
data stored in this repository are released under open licenses
<http://opendefinition.org/od/>, which allow for the creation and
distribution of comparable derivatives.
People interested in participating in various stages of the project can
register on the community forum
<http://www.fossworkflowguides.com/Community/> and subscribe to a
specific activity or forum.
I have created a landing page for visitors that outline the project, the
data being used and eventually each project as they come on-line. This
is just a static webpage to centralize links, not the main forum to
disseminate information -- each participant will be able to use their
own social media sites to do this.
http://www.fossworkflowguides.com/cdw/
If you are interested I hope to see you registering on the forum and/or
forking the GitHub repository! Once you get on the forum, please
introduce yourself and your interests so if a project slips past you, I
can poke you if you have expressed interest in a particular topic.
I would like to thank Tam McKenzie for her assistance in creating a logo
for the project. She created a great looking logo for the project and
saved me considerable amount of time.
*The initial project* I will be working on is to revisit all the
databases again and prepare an article for publication on the variety of
licenses used with this type of data. My initial analysis showed only 3
of the 32 databases checked were released under open licenses. I would
like to document these characteristics more thoroughly and discuss the
implications this has on data accessibility and derivative creation. If
anyone is interested in this project subscribe to the specific licensing
forum in your *User Control Panel* and/or reply to this thread
<http://www.fossworkflowguides.com/Community/showthread.php?tid=1> to be
notified once a team has assembled. Hope to see you there.
Happy Wrangling Simon
On 17/06/14 01:34, E.C Okpo wrote:
> Welcome Simon!,
>
> It's great to meet another person interested in food data. I am also
> exploring doing analysis with Pandas.
>
> I know you're primarily interested in nutritional composition, but you
> should check out this article
> http://www.flavourjournal.com/content/2/1/4. The author used Network
> Analysis to explore flavor pairings, and had some interesting findings.
>
>
> Christy
>
>
>
> _______________________________________________
> school-of-data mailing list
> school-of-data at lists.okfn.org
> https://lists.okfn.org/mailman/listinfo/school-of-data
> Unsubscribe:https://lists.okfn.org/mailman/options/school-of-data
>
> --
> Cheers Simon
>
> Simon Cropper - Open Content Creator
>
> Free and Open Source Software Workflow Guides
> ------------------------------------------------------------
> Introductionhttp://www.fossworkflowguides.com
> GIS Packageshttp://www.fossworkflowguides.com/gis
> bash / Pythonhttp://www.fossworkflowguides.com/scripting
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.okfn.org/pipermail/school-of-data/attachments/20140717/8b5b5fc2/attachment.html>
More information about the school-of-data
mailing list