[okfn-labs] Help clean up the UK spending data!

Friedrich Lindenberg friedrich.lindenberg at okfn.org
Wed Aug 1 19:20:35 UTC 2012


On Wed, Aug 1, 2012 at 11:45 AM, Thomas Kluyver <takowl at gmail.com> wrote:
> On 1 August 2012 08:55, Friedrich Lindenberg
> <friedrich.lindenberg at okfn.org> wrote:
>> Hm, I've deleted these clear duplicates, but we're still at 1.1k
>> unlinked entries, unfortunately.
>
> The entries remaining are much more what I'd expect to need human
> input for. However, I've skipped quite a few of the ones I've tried,
> either because it's hard to tell without some sample data, or because
> some things are ambiguous - e.g. some datasets have 'gross amount' and
> 'net amount', or amounts including and excluding VAT, and it's not
> clear which should be mapped to the 'Amount' column.

I've been using Gross amount when both are stated. Some sheets also
state the tax value explicitly without mentioning whether the amount
figure given is inclusive or exclusive of that amount, which is when I
give up.

- Fr.




More information about the okfn-labs mailing list