Skip to:

Opened 15 years ago

Closed 13 years ago

#880 closed defect (bug) (fixed)

Unicode group name gets broken

Reported by: dolugen's profile dolugen Owned by:
Milestone: 1.5 Priority: minor
Severity: Version:
Component: Core Keywords: needs-patch


When I create a group with a long name in cyrillic (unicode), very rarely, the name gets broken, like not-a-working-group-name%d. So I can't access the group clicking its link. Notice the end of the name.

You can try recreating the bug with the name "Уран бичлэг сонирхогчид", any other name I tried was fine ("Amateur Calligraphers" in Mongolian).

Change History (7)

#1 @apeatling
15 years ago

  • Milestone changed from 1.0.4 to 1.1

Milestone 1.0.4 deleted

#2 @apeatling
15 years ago

  • Milestone changed from 1.1 to 1.2

#3 @apeatling
14 years ago

  • Keywords needs-patch added; cyrillic group slug unicode removed

Need i18n help on this one.

#4 @apeatling
14 years ago

  • Milestone changed from 1.2 to 1.2.1

#5 @erosaung
14 years ago

  • Component set to Core

Myanmar Unicode is also got error

#6 @boonebgorges
13 years ago

The problem seems to happen when the slug gets run through the chain sanitize_title() -> sanitize_title_with_dashes() -> utf8_uri_encode(). The process of uri_encoding the non-Latin characters makes the slug very long. sanitize_title_with_dashes() limits the length of the converted slug to 200 characters, but the 'slug' column in the BP groups table only allowed for 100 characters. As a result, some of the uri-encoded characters were getting cut off and made unreadable.

Allowing the slug column to accept 200 characters fixes the problem.

#7 @boonebgorges
13 years ago

  • Resolution set to fixed
  • Status changed from new to closed

(In [3677]) Allow group slugs to be up to 200 characters to account for uri-encoding of non-latin alphabets. Fixes #880

Note: See TracTickets for help on using tickets.