[okfn-labs] [ckan-dev] DataHub being overrun with spam

Anders Pedersen anders.pedersen at okfn.org
Sat Jun 1 13:23:14 UTC 2013


Hi all,

Just flagging that there is a fairly large cross country hack event in US
today and tomorrow:
http://hackforchange.org/

I wonder if it would be possible to have registrations open over this
weekend at least?

Anders

On 31 May 2013 19:13, Vitor Baptista <vitor at vitorbaptista.com> wrote:

> Hi all,
>
> I've cleaned a few SPAM entries in datahub a few months ago, but they keep
> coming back. I'm working on upgrading it to CKAN 2.0, probably finishing
> today/tomorrow. There's already a staging site in
> http://datahub.staging.ckanhosted.com/ for it, running on 2.0 using a
> dump from a few weeks ago.
>
> I'm finishing to set up all plugins that it uses, then I'll update the
> texts and we can figure out what's the best way to pull the plug on the old
> server and switch to the new one.
>
> Cheers,
> Vítor Baptista.
>
> Vítor Baptista
>
> Developer  |  http://vitorbaptista.com | LinkedIn<http://www.linkedin.com/in/vitorbaptista>|
> @vitorbaptista <http://twitter.com/vitorbaptista>
>
> The Open Knowledge Foundation <http://okfn.org>
>
> *Empowering through Open Knowledge*
>
> http://okfn.org/  |  @okfn <http://twitter.com/okfn>  |  OKF on Facebook<https://www.facebook.com/OKFNetwork> |
> Blog <http://blog.okfn.org/>  |  Newsletter<http://okfn.org/about/newsletter/>
>
>
>
> 2013/5/31 Ross Jones <ross at servercode.co.uk>
>
>> Hi,
>>
>> I've temporarily disabled user registration until we can get it under
>> control.  I had a script a few weeks ago that would clean up
>> datasets/groups based on a simple heuristic (you'd think they'd use
>> different mail providers), I'll see if I can dig it out.
>>
>>  I've been intending to upgrade datahub.io to v2 of CKAN, and to start
>> making some changes to make it more useful as a community hub.  I
>> definitely think a workflow where a user's first contribution is moderated
>> might help. I'll add a ticket to
>> https://github.com/okfn/ckanext-datahub/issues (please feel free to add
>> more).
>>
>> I'm pretty stacked with work this week, but will look at it when I get
>> back from the conference next weekend.
>>
>> Tom, if you can let me know your username I'll make you an admin.
>>
>> Ross
>>
>>
>> On 31 May 2013, at 18:45, Tom Morris <tfmorris at gmail.com> wrote:
>>
>> I don't know how big the current admin team is, but I'm happy to help
>> out.  I won't be able to commit significant time to it, but I'm happy to
>> delete the stuff I run across.
>>
>> Having said that, a more scalable solution would be allow all users to
>> flag things for admin review and/or the use of an anti-spam tool like
>> Mollum or it's ilk.
>>
>> Tom
>>
>>
>> On Fri, May 31, 2013 at 1:25 PM, Rufus Pollock <rufus.pollock at okfn.org>wrote:
>>
>>> Hi Tom,
>>>
>>> 2 things:
>>>
>>> - As an immediate but temporary step we could shut off user
>>> registrations - @ross: would you be up for this?
>>> - It is pretty easy to delete spam if you are an admin (just delete the
>>> datasets or delete the revision - there is a button on
>>> http://datahub.io/revision for this)
>>>
>>> If you are not an admin already I can make you one - anyone else who
>>> would like to help with this please say and I can make you an admin too.
>>>
>>> rufus
>>>
>>>
>>> On 31 May 2013 18:12, Tom Morris <tfmorris at gmail.com> wrote:
>>>
>>>> [The contact link at datahub.io links to ckan.org/contact, but not
>>>> sure that's right, so copying OKFN Labs as well]
>>>>
>>>> There is a large amount of spam in the data set listings on the Data
>>>> Hub and, given the number of accounts created in the last few days, it
>>>> seems to be increasing.
>>>>
>>>> Here are some examples:
>>>>
>>>> http://datahub.io/user/met3own9
>>>> http://datahub.io/user/simaaraiza
>>>> http://datahub.io/user/robert_tao
>>>> http://datahub.io/user/met3own9
>>>> http://datahub.io/user/elinordelk
>>>> http://datahub.io/user/edwina
>>>> http://datahub.io/user/marryvillalpando
>>>> http://datahub.io/user/slimdaidaihua
>>>> http://datahub.io/user/danedoyle9
>>>> http://datahub.io/user/admin
>>>> http://datahub.io/user/maryinoue1
>>>> http://datahub.io/user/sureshshan
>>>>
>>>> http://datahub.io/dataset?q=diet&page=1
>>>> http://datahub.io/dataset?q=drugs&page=3
>>>>
>>>> or just search for any popular spammers' terms.
>>>>
>>>> Tom
>>>>
>>>>
>>>
>>>
>>> --
>>> *
>>>  Rufus Pollock
>>> Founder and Co-Director | skype: rufuspollock | @rufuspollock<https://twitter.com/rufuspollock>
>>> The Open Knowledge Foundation <http://okfn.org/>
>>> Empowering through Open Knowledge
>>> http://okfn.org/ | @okfn <http://twitter.com/OKFN> | OKF on Facebook<https://www.facebook.com/OKFNetwork>|
>>> Blog <http://blog.okfn.org/>  |  Newsletter<http://okfn.org/about/newsletter>
>>>
>>>
>>> *
>>>
>>
>>
>>
>> _______________________________________________
>> ckan-dev mailing list
>> ckan-dev at lists.okfn.org
>> http://lists.okfn.org/mailman/listinfo/ckan-dev
>> Unsubscribe: http://lists.okfn.org/mailman/options/ckan-dev
>>
>>
>
> _______________________________________________
> okfn-labs mailing list
> okfn-labs at lists.okfn.org
> http://lists.okfn.org/mailman/listinfo/okfn-labs
> Unsubscribe: http://lists.okfn.org/mailman/options/okfn-labs
>
>


-- 
*

Anders Pedersen

Community Coordinator  |  skype: anpehej  |  @anpe <https://twitter.com/>

The Open Knowledge Foundation <http://okfn.org/>

Empowering through Open Knowledge

http://okfn.org/  |  @okfn <http://twitter.com/OKFN>  |  OKF on
Facebook<https://www.facebook.com/OKFNetwork> |
Blog <http://blog.okfn.org/>  |  Newsletter<http://okfn.org/about/newsletter>

*

OpenSpending | http://openspending.org |
@openspending<http://twitter.com/openspending>

School of Data | http://schoolofdata.org |
@schoolofdata<http://twitter.com/schoolofdata>


*

**

*
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.okfn.org/pipermail/okfn-labs/attachments/20130601/9429ac34/attachment-0002.html>


More information about the okfn-labs mailing list