The next problem is that apparently mysql can't index a VARCHAR(255). You get:
sqlalchemy.exc.OperationalError:
(pymysql.err.OperationalError) (1709, 'Index column size too large. The maximum column
size is 767 bytes.')
One solution I saw mentioned was to not index the string, but to compute a hash, store
that as a separate column, and index the hash. Which feels like I'm working for the
database instead of the other way around.
On Jan 25, 2023, at 3:52 PM, Daniel Zahn
<dzahn(a)wikimedia.org> wrote:
On Wed, Jan 25, 2023 at 12:33 PM Roy Smith <roy(a)panix.com> wrote:
Thanks. This field is intended to hold a wikipedia page title (including the namespace).
Would 255 be long enough to hold any legal title?
Looks like that should be the actual limit, yea.
"A pagename cannot exceed 255 bytes in length. Be aware that non-ASCII
characters may take up to four bytes in UTF-8 encoding, so the total
number of characters that can fit into a title may be less than 255."
found at
https://en.wikipedia.org/wiki/Wikipedia:Page_name#Technical_restrictions_an…
--
Daniel Zahn <dzahn(a)wikimedia.org>
Site Reliability Engineer
_______________________________________________
Cloud mailing list -- cloud(a)lists.wikimedia.org
List information:
https://lists.wikimedia.org/postorius/lists/cloud.lists.wikimedia.org/