[ckan-dev] DataHub being overrun with spam

Ross Jones ross at servercode.co.uk
Fri May 31 18:04:59 UTC 2013


Hi,

I've temporarily disabled user registration until we can get it under control.  I had a script a few weeks ago that would clean up datasets/groups based on a simple heuristic (you'd think they'd use different mail providers), I'll see if I can dig it out.

 I've been intending to upgrade datahub.io to v2 of CKAN, and to start making some changes to make it more useful as a community hub.  I definitely think a workflow where a user's first contribution is moderated might help. I'll add a ticket to https://github.com/okfn/ckanext-datahub/issues (please feel free to add more).

I'm pretty stacked with work this week, but will look at it when I get back from the conference next weekend.

Tom, if you can let me know your username I'll make you an admin.

Ross


On 31 May 2013, at 18:45, Tom Morris <tfmorris at gmail.com> wrote:

> I don't know how big the current admin team is, but I'm happy to help out.  I won't be able to commit significant time to it, but I'm happy to delete the stuff I run across.
> 
> Having said that, a more scalable solution would be allow all users to flag things for admin review and/or the use of an anti-spam tool like Mollum or it's ilk.
> 
> Tom
> 
> 
> On Fri, May 31, 2013 at 1:25 PM, Rufus Pollock <rufus.pollock at okfn.org> wrote:
> Hi Tom,
> 
> 2 things:
> 
> - As an immediate but temporary step we could shut off user registrations - @ross: would you be up for this?
> - It is pretty easy to delete spam if you are an admin (just delete the datasets or delete the revision - there is a button on http://datahub.io/revision for this)
> 
> If you are not an admin already I can make you one - anyone else who would like to help with this please say and I can make you an admin too.
> 
> rufus
> 
> 
> On 31 May 2013 18:12, Tom Morris <tfmorris at gmail.com> wrote:
> [The contact link at datahub.io links to ckan.org/contact, but not sure that's right, so copying OKFN Labs as well]
> 
> There is a large amount of spam in the data set listings on the Data Hub and, given the number of accounts created in the last few days, it seems to be increasing.
> 
> Here are some examples:
> 
> http://datahub.io/user/met3own9
> http://datahub.io/user/simaaraiza
> http://datahub.io/user/robert_tao
> http://datahub.io/user/met3own9
> http://datahub.io/user/elinordelk
> http://datahub.io/user/edwina
> http://datahub.io/user/marryvillalpando
> http://datahub.io/user/slimdaidaihua
> http://datahub.io/user/danedoyle9
> http://datahub.io/user/admin
> http://datahub.io/user/maryinoue1
> http://datahub.io/user/sureshshan
> 
> http://datahub.io/dataset?q=diet&page=1
> http://datahub.io/dataset?q=drugs&page=3
> 
> or just search for any popular spammers' terms.
> 
> Tom
> 
> 
> 
> 
> -- 
> Rufus Pollock
> Founder and Co-Director  |  skype: rufuspollock  |  @rufuspollock
> The Open Knowledge Foundation
> Empowering through Open Knowledge
> http://okfn.org/  |  @okfn  |  OKF on Facebook  |  Blog  |  Newsletter
> 
> 
> 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.okfn.org/pipermail/ckan-dev/attachments/20130531/beba8e24/attachment-0001.html>


More information about the ckan-dev mailing list