Hi Max,

(not an expert, but to provide you with some feedback).

We have the same challenges with our Wikibase instance. We did a lot with the Quality Constraints extension (https://www.mediawiki.org/wiki/Extension:Wikibase_Quality_Extensions). This requires a learning curve and setup on your local system.  Your first case might actually still fit into the possibilities of your extensions. 

For some cases, we developed our own extension (done by Professional Wiki: https://www.wikibase.consulting/automating-values-in-wikibase/). Highly recommend looking into that tool of course.

You could also do a lot with SPARQL and regular expressions, although that is another learning experience. Your second question would be suitable for this I assume.

Finally, we are looking into ShapeExpressions, but that does not really help to check whether the values of your item confirm a predetermined structure.

So what I am trying to say, there are lots of possibilities to validate the data within your instance. Clearly define what you want to check provides better way to find which solution might fit your need.

Does this help?

Best,

Maarten


On Mon, Aug 1, 2022 at 8:54 PM Donald Ziff <daziff@berkeley.edu> wrote:
Hello All!

Could anyone one give us some guidance on when to use a bot as opposed to, perhaps, a script fired off from a client box via cron or other?

Context: this is the PhiloBiblon project, and we are using FactGrid.

The use case I have in mind is integrity checking for certain constraints that our legacy data platform enforced but that are not directly enforceable in our current setup.  For examples:
  • Detecting when our external identifier has an illegal value (which has sometimes happened and has been difficult to detect by eye)
  • Detecting when one of our objects (i.e. one that is marked with that external identifier) has an item-valued property that is filled with an object of the wrong type
I think it would give us some comfort to know that such integrity violations are detected. We would probably like to add more constraints as we go along. We might extend that code eventually to try to automatically fix such things, but for now, just detecting them would be great.

Does a bot make sense for that use case? 

Thanks,

Max Ziff 


_______________________________________________
Wikibaseug mailing list -- wikibaseug@lists.wikimedia.org
To unsubscribe send an email to wikibaseug-leave@lists.wikimedia.org


--
IP Squared | +31643053919 | @mzeinstra