SlideShare une entreprise Scribd logo
1  sur  16
Télécharger pour lire hors ligne
CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Yahoo! Confidential
Mike McKenna mgm.globalization(at)gmail.com
v.0.1
October 2020
CLDR: What’s in a Personal Name?
Internationalization and Unicode Conference 44 – (on-line) – October 2020 2
Address Details
How are names used in applications
• “Hello! My name is: ….”
• Mail merge
• Human Resources and
personnel data
• Onboarding and KYC
CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 3
Address Details
Common web standards
Maybe standards can help us?
• LDAP
• hCard
• HTML
. . .
Name: {
honorific-prefix,
given-name,
additional-name,
family-name,
honorific-suffix,
nickname
}
W3 HTML
autofill fields
Internationalization and Unicode Conference 44 – (on-line) – October 2020 4
Address Details
Problem with names
安倍 晋三
Jane Doe
A
AZ
Elizabeth == Liz ?
Call me “Boomer”
• First name last?
• Mr, Mrs, Ms, Mx?
• Which initials?
• How many initials?
• Middle name?
• Legal name?
• Informal name?
• Nickname?
CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 5
Address Details
Real word names - length
Let’s look at some real names
European names:
Mr. Jonas Edward Salk, M.D.
Jonas E. Salk
Jonas Salk
Mr. Salk
Jonas
JS
Length: Which fields to use?
• Full
• Medium
• Short
• Initials
length
Internationalization and Unicode Conference 44 – (on-line) – October 2020 6
Address Details
Real word names – how many names
Let’s look at some real names
Spanish names:
Señor Diego María Rivera
Barrientos
Diego Rivera Barrientos
Diego Rivera
Diego | Panzòn
DR
Length:
• TWO surnames
(usually) father mother
length
CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 7
Address Details
Real word names – order of fields
Let’s look at some real names
Japanese names:
晋三安倍様
晋三安倍
晋三
安倍
His Excellency Shinzo ABE
Shinzo ABE
ABE
Shinzo
What order to use?
family name
given name
given name
family name
length
Internationalization and Unicode Conference 44 – (on-line) – October 2020 8
Address Details
Real word names - formality
Let’s look at some real names
Japanese
松居慶子
松居様
松居先生
松居さん
慶子ちゃん
Keiko Matsui
Matsui-sama
Matsui-sensei
Matsui-san
Keiko-chan
Herr Dr. Albert Schweitzer, MD
Herr Dr. Schweitzer
Dr. Schweitzer
Albert
length
formality
less formal
more formal German
CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 9
Address Details
Name usage
Nominative
en-GB: Robert Jones
ru-RU: Александр
Исаевич
Солженицын
ja-JP: 松居慶子
ro-RO: Petre Țuțea
Sorting
• surname, given
• given, surname
Hello
<name>!
length
formality
u
s
a
g
e
Vocative / case
en-GB: Hi Bob!
ru-RU: Привет Саша!
ja-JP: 松居さん
こんにちは!
ro-RO: Salut domnule
Țuțea!
Internationalization and Unicode Conference 44 – (on-line) – October 2020 10
Address Details
Mononyms – only one name
• Indonesia
• Tamil Nadu
• Celebrities
“Full name”
CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 11
Address Details
Lineage names – Father, Mother
• Patronymic – ex: ru_RU: <given> <patronymic> <surname>
hisname father-ovich familyname
Владимир Антонович Иванов
• Matronymic names
• Iceland – hisname father-son
Jón Einarsson
hername mother-dóttir
Guðrún Mínervudóttir
Internationalization and Unicode Conference 44 – (on-line) – October 2020 12
Address Details
Ancestral naming
• Example Saudi Arabia
Farah bint Fadi bin Sajid Al-Tikriti
‫ﻓ‬
‫ر‬
‫ح‬
‫ﺑ‬
‫ﻧ‬
‫ت‬
‫ﻓ‬
‫ﺎ‬
‫د‬
‫ي‬
‫ﺑ‬
‫ن‬
‫ﺳ‬
‫ﺎ‬
‫ﺟ‬
‫د‬
‫ا‬
‫ﻟ‬
‫ﺗ‬
‫ﻛ‬
‫ر‬
‫ﯾ‬
‫ﺗ‬
‫ﻲ‬
CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 13
Address Details
More standards Industry
• LDAP
• hCard
• HTML
. . .
• OASIS xNL
• UPU S42
• Apple
• Google
• Microsoft
• Facebook
. . .
• Trulio
• Workday
more
complex
Internationalization and Unicode Conference 44 – (on-line) – October 2020 14
Address Details
Status of current standards
• Give you a ”standard” envelope to move data
• No information:
– What order fields are in for each region and language
– What to choose for different lengths
– Formality presentation forms
– Usage, sorting information
CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 15
Address Details
CLDR – Proposal
• Fields
– Concise set of standard
fields
– Flexible to/from W3C
• Three axis for context
– Length
– Formality
– Usage
• Syntax & Structure
• Skeletons for most uses
• Fall-back mechanism
• Use locales to dictate script-
specific behavior
• Does NOT specify
– Field validation
– Linguistic inflections
Internationalization and Unicode Conference 44 – (on-line) – October 2020 16
Address Details
CLDR – Proposal: Fields
• prefix Mr., Ms., Mx., Dr., Prof., …
• given first name, given name
• middle middle name(s), patronymic
• surname last name, family name
• surname-extended second last name, ancestral name
• suffix Jr, III, MD, PhD, …
• nickname Bob, Jack, Venki, Misha, …
• monogram-override JR, LBJ, MGM, …
CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 17
Address Details
CLDR – Proposal: Name Attributes
Context-Axis
• Length
– full
– long
– medium
– short
– monogram
– monogram-narrow
• Formality
– informal
– formal
• Usage
– nominative
– vocative
– sorting
length
formality
u
s
a
g
e
Internationalization and Unicode Conference 44 – (on-line) – October 2020 18
Address Details
CLDR – Proposal: LDML Structure
• LDML = Locale Data Markup Language (UTS #35)
• Structure
– Default layouts by context
• Skeletons
<personNames>
<personName length=”full" usage="nominative" style="formal">
{prefix} {given} {middle} {surname} {suffix}
</personName>
…
CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 19
Address Details
CLDR – Proposal: LDML Structure
<personNames>
<personName length="full" usage="nominative" style="formal">{prefix} {given} {middle} {surname} {suffix}</personName>
<personName length="long" usage="nominative" style="formal">{prefix} {given} {middle-initial} {surname} {suffix}</personName>
<personName length="medium" usage="nominative" style="formal">{prefix} {given} {surname} {suffix}</personName>
<personName length="short" usage="nominative" style="formal">{prefix} {surname} {suffix}</personName>
<personName length="monogram" usage="nominative" style="formal">{given-initial}{middle-initial}{surname-initial}</personName>
<personName length="monogram-narrow" usage="nominative" style="formal">{surname-initial}</personName>
<personName length="long" usage="nominative" style="informal">{given} {middle-initial} {surname}</personName>
<personName length="medium" usage="nominative" style="informal">{given} ({nickname}) {surname}</personName>
<personName length="short" usage="nominative" style="informal">{given}</personName>
<personName length="monogram" usage="nominative" style="informal">{given-initial}{surname-initial}</personName>
<personName length="monogram-narrow" usage="nominative" style="informal">{given-initial}</personName>
<personName length="long" usage="sorting" style="informal">{surname}, {given} {middle}</personName>
...
<personName length="short" usage="vocative" style="informal">{nickname}</personName>
</personNames>
Internationalization and Unicode Conference 44 – (on-line) – October 2020 20
Address Details
CLDR – Proposal: Fall back
What if a skeleton can’t be found?
• length falls back to medium
full à long à medium ß short ß monogram
• usage falls back to nominative
vocative à nominative ß sorting
• style falls back to informal
length
formality
u
s
a
g
e
<personName length="medium" usage="nominative" style="informal">
CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 21
Address Details
CLDR – Proposal: phonetics and name types
Japanese expects names
• Japanese Kanji and kana
• Phonetic kana
• Romajii (Latin) “international”
…
Styles by language and script
Internationalization and Unicode Conference 44 – (on-line) – October 2020 22
Address Details
CLDR – use case Names in Person object
personNames
Kanji
Kana
(phonetic)
Romaji
(Latin)
CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 23
Address Details
CLDR – personName: Current status
• Preparing for submission to Unicode Consortium
• Hope to submit PRI for review during CLDR v39 cycle
• Need to finalize use cases and edge cases
Internationalization and Unicode Conference 44 – (on-line) – October 2020 24
Address Details
More to consider…
• Name types: birth, legal, preferred, alias, aka, cc, phonetic, …
• Casing: e.g. ”ABE Shinzu”
• Generation vs honorific: e.g. “J. Jones Jr.” vs “J. Jones Jr. PhD”
• Name prefixes: e.g. “van de”, “de la”, “bin”, “bint”
van der Poel de la Salle bin Abdallah
• Joint names: e.g. “Sue and John Jones JTWROS”
• Arabic ancestral names?
given middle surname-extended surname
given father grandfather family
?
CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Yahoo! Confidential
Thank you!
CLDR Personal Name - Core Team
Mark Davis @ Google
Peter Edberg @ Apple
Rich Gillam @ Apple
Alex Kolisnychenko @ Microsoft
Kristi Lee @ Microsoft
Mike McKenna @ PayPal
<personName length="long" usage="sorting" style="informal">
{surname}, {given} {middle}
</personName>
Internationalization and Unicode Conference 44 – (on-line) – October 2020 26
Address Details
CLDR – Name Fields compared
Standard Prefix Given
name
Middle
name
Surname Second
surname
Suffix Nickname Monogram Other
CLDR prefix given middle surname surname-
extended
suffix nickname monogram-
override
LDAP
person obj
title givenName givenName
{1,n}
sn,
surname
generation
Qualifier {0,n}
initials initials,
cn,
commonName
hCard honorific-
prefix
given-
name
additional-
name
family-
name
honorific-
suffix
nickname fn,
“abbr”
attribute
HTML honorific-
prefix
given-
name,
cc-given-
name
additional-
name,
cc-
additional-
name
family-
name,
cc-family-
name
honorific-
suffix
nickname name,
cc-name
OASIS xNL PrecedingTitle
{0,n}
Title {0,n}
FirstName {0,n} MiddleName
{0,n}
NamePrefix
LastName
LastName {1,n} Generation
Identifier {0,n}
Suffix {0,n}
GeneralSuffix
Othername {0,n} NameLine
Alias
CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 27
Address Details
CLDR – Name Fields compared
Standard Prefix Given
name
Middle
name
Surname Second
surname
Suffix Nickname Monogram Other
CLDR prefix given middle surname surname-
extended
suffix nickname monogram-
override
OECD PrecedingTitle,
Title{0,n}
FirstName * MiddleName
{0,n}
NamePrefix,
LastName *
Generation
Identifier {0,n}
Suffix {0,n},
GeneralSuffix
PersonType
UPU S42 form of address
prec qual
given name pt 1 given name pt 2
given name pt 3
interm qual
surname prefix
surname [pt 1]
surname [pt 2] name qualifier
qualification
patronymic
ADIS honorific middle name
2nd middle name
last name prefix
last name
suffix
generation
post honorific
preferred name full name
Google
person
honorificPrefix
phonetic
HonorificPrefix
givenName
phonetic
GivenName
middleName
phonetic
MiddleName
familyName
phonetic
FamilyName
honorificSuffix
phonetic
HonorificSuffix
Nickname
DEFAULT
(nickname) or
ALTERNATE_N
AME
displayName
displayName
LastFirst
Unstructured
Name
phonetic
FullName
Facebook
PersonInfo
first_name middle_name last_name name
name_format
Internationalization and Unicode Conference 44 – (on-line) – October 2020 28
Address Details
CLDR – Name Fields compared
Standard Prefix Given
name
Middle
name
Surname Second
surname
Suffix Nickname Monogram Other
CLDR prefix given middle surname surname-
extended
suffix nickname monogram-
override
Apple
Contact
namePrefix givenName
phonetic
GivenName
middleName
phonetic
MiddleName
familyName
phonetic
FamilName
nameSuffix nickname previousFamily
Name
Microsoft
personal
contact
title givenName
yomiGivenName
middleName surname
yomiSurname
generation nickName initials spouseName
Trulio FirstGivenName MiddleName FirstSurname SecondSurname ISOLatin1Name,
AdditionalFields.
FullName
Workday Prefix_Data {
Title,
Salutation
}
First_Name Middle_Name Last_Name Secondary_Last
_Name
Tertiary_Last
_Name
Suffix_Data {
Social,
Academic,
Hereditary,
Honorary,
Professional,
Religious,
Royal
}
Local_Name
_Detail_Data
CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 29
Address Details
References - standards
Standards
• LDAP – Lightweight Directory Access Protocol + RFC4519
https://www.iana.org/assignments/ldap-parameters/ldap-parameters.xhtml
https://docs.ldap.com/specs/rfc4519.txt
ITU X.20 https://www.itu.int/itu-t/recommendations/rec.aspx?rec=X.520
• hCard
http://microformats.org/wiki/hcard
• W3C HTML autofill fields
https://www.w3.org/TR/html52/sec-forms.html#autofill-field
• OASIS xNL
http://docs.oasis-open.org/ciq/v3.0/cs02/specs/ciq-specs-v3-cs2.html#_Toc213384944
• UPU S42, ADIS, ISO 19160
http://xml.coverpages.org/ISO-FocusPlus-AddressingStandards-2010-06.pdf
http://xml.coverpages.org/namesAndAddresses.html
http://xml.coverpages.org/ADIS-Address-2001-1.pdf
• Unicode Technical Standard #35 Unicode Locale Data Markup Language (LDML)
http://unicode.org/reports/tr35/
Internationalization and Unicode Conference 44 – (on-line) – October 2020 30
Address Details
References - regulatory
Regulatory Standards
• OECD Standard for automatic exchange of financial account information in tax matters
http://www.oecd.org/tax/exchange-of-tax-information/standard-for-automatic-exchange-of-financial-account-information-in-tax-matters-second-edition-
9789264267992-en.htm
– OECD Common Reporting Standard XML Schema
http://www.oecd.org/tax/automatic-exchange/common-reporting-standard/schema-and-user-guide/#d.en.345315
• ICAO Travel Document Standard – Doc 9303
https://www.icao.int/publications/pages/publication.aspx?docnum=9303
• UK Deed Poll name change requirements
https://www.gov.uk/government/publications/change-of-name-guidance/use-and-change-of-names#grounds-for-refusing-to-change-a-name-on-a-home-office-
issued-document
CLDR Personal Names Proposal
44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020
Internationalization and Unicode Conference 44 – (on-line) – October 2020 31
Address Details
References - industry
Industry Standards
• Google Person / Name Object
https://developers.google.com/people/api/rest/v1/people#name
• Apple CNMutableContact person object
https://developer.apple.com/documentation/contacts/cnmutablecontact
• Facebook User Object
https://developers.facebook.com/docs/graph-api/reference/user/#fields
• Microsft Personal Contact Graph REST API
https://docs.microsoft.com/en-us/graph/api/resources/contact?view=graph-rest-1.0#properties
• Trulio Identity Schema
https://developer.trulioo.com/reference#identity-verification-verify
• Workday Person_Name_Detail_Data
https://community.workday.com/sites/default/files/file-
hosting/productionapi/Human_Resources/v35.0/Change_Legal_Name.html#Person_Name_Detail_DataType
Internationalization and Unicode Conference 44 – (on-line) – October 2020 32
Address Details
References - other
Other References
• Wikipedia, Personal Name
https://en.wikipedia.org/wiki/Personal_name
• Wikipedia, Naming Conventions
https://en.wikipedia.org/wiki/Wikipedia:Naming_conventions_(people)
Wikipedia Manual of Style, Biographical names
https://en.wikipedia.org/wiki/Wikipedia:Manual_of_Style/Biography#Names
• Wikidata personal name object
https://www.wikidata.org/wiki/Q1071027
• Graham Rhind, Global Sourcebook for International Data Management
https://www.grcdi.nl/gsb/global%20sourcebook.html
• A Guide to Names and Naming Practices
https://www.fbiic.gov/public/2008/nov/Naming_practice_guide_UK_2006.pdf
• Law Enforcement Guide to International Names
https://info.publicintelligence.net/ROCICInternationalNames.pdf
• Pan-Data: Names
https://github.com/pan-i18n/pan-data/blob/master/specs/Names.md

Contenu connexe

Similaire à CLDR: What’s in a Personal Name?

Paul2 ecn 2012
Paul2 ecn 2012Paul2 ecn 2012
Paul2 ecn 2012
ECNOfficer
 
How it works internet networking icann53
How it works internet networking icann53How it works internet networking icann53
How it works internet networking icann53
ICANN
 

Similaire à CLDR: What’s in a Personal Name? (18)

Linked Library Data @ DNB
Linked Library Data @ DNBLinked Library Data @ DNB
Linked Library Data @ DNB
 
Question Answering - Application and Challenges
Question Answering - Application and ChallengesQuestion Answering - Application and Challenges
Question Answering - Application and Challenges
 
Language, Culture, and Software
Language, Culture, and SoftwareLanguage, Culture, and Software
Language, Culture, and Software
 
Linked Open Data and Applications
Linked Open Data and Applications Linked Open Data and Applications
Linked Open Data and Applications
 
Large-Scale Semantic Search
Large-Scale Semantic SearchLarge-Scale Semantic Search
Large-Scale Semantic Search
 
Learning Multilingual Semantic Parsers for Question Answering over Linked Dat...
Learning Multilingual Semantic Parsers for Question Answering over Linked Dat...Learning Multilingual Semantic Parsers for Question Answering over Linked Dat...
Learning Multilingual Semantic Parsers for Question Answering over Linked Dat...
 
Schema and Identity for Linked Data
Schema and Identity for Linked DataSchema and Identity for Linked Data
Schema and Identity for Linked Data
 
Persistent identifiers for digitized specimens (2013)
Persistent identifiers for digitized specimens (2013)Persistent identifiers for digitized specimens (2013)
Persistent identifiers for digitized specimens (2013)
 
Morphological Analyzer and Generator for Tamil Language
Morphological Analyzer and Generator for Tamil LanguageMorphological Analyzer and Generator for Tamil Language
Morphological Analyzer and Generator for Tamil Language
 
Building DBpedia Japanese and Linked Data Cloud in Japanese
Building DBpedia Japanese and Linked Data Cloud in JapaneseBuilding DBpedia Japanese and Linked Data Cloud in Japanese
Building DBpedia Japanese and Linked Data Cloud in Japanese
 
ORCID Status and Plans: Tokyo Outreach Meeting 2014
ORCID Status and Plans: Tokyo Outreach Meeting 2014ORCID Status and Plans: Tokyo Outreach Meeting 2014
ORCID Status and Plans: Tokyo Outreach Meeting 2014
 
Semantic Search
Semantic SearchSemantic Search
Semantic Search
 
Beyond document retrieval using semantic annotations
Beyond document retrieval using semantic annotations Beyond document retrieval using semantic annotations
Beyond document retrieval using semantic annotations
 
Online Paper Submission - 4th International Conference on NLP & Data Mining (...
Online Paper Submission - 4th International Conference on NLP & Data Mining (...Online Paper Submission - 4th International Conference on NLP & Data Mining (...
Online Paper Submission - 4th International Conference on NLP & Data Mining (...
 
Online Paper Submission - 4th International Conference on NLP & Data Mining (...
Online Paper Submission - 4th International Conference on NLP & Data Mining (...Online Paper Submission - 4th International Conference on NLP & Data Mining (...
Online Paper Submission - 4th International Conference on NLP & Data Mining (...
 
Paul2 ecn 2012
Paul2 ecn 2012Paul2 ecn 2012
Paul2 ecn 2012
 
semantic web & natural language
semantic web & natural languagesemantic web & natural language
semantic web & natural language
 
How it works internet networking icann53
How it works internet networking icann53How it works internet networking icann53
How it works internet networking icann53
 

Dernier

Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
UXDXConf
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
FIDO Alliance
 

Dernier (20)

Structuring Teams and Portfolios for Success
Structuring Teams and Portfolios for SuccessStructuring Teams and Portfolios for Success
Structuring Teams and Portfolios for Success
 
WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024WebRTC and SIP not just audio and video @ OpenSIPS 2024
WebRTC and SIP not just audio and video @ OpenSIPS 2024
 
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
FDO for Camera, Sensor and Networking Device – Commercial Solutions from VinC...
 
Event-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream ProcessingEvent-Driven Architecture Masterclass: Challenges in Stream Processing
Event-Driven Architecture Masterclass: Challenges in Stream Processing
 
Microsoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - QuestionnaireMicrosoft CSP Briefing Pre-Engagement - Questionnaire
Microsoft CSP Briefing Pre-Engagement - Questionnaire
 
How we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdfHow we scaled to 80K users by doing nothing!.pdf
How we scaled to 80K users by doing nothing!.pdf
 
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdfWhere to Learn More About FDO _ Richard at FIDO Alliance.pdf
Where to Learn More About FDO _ Richard at FIDO Alliance.pdf
 
AI mind or machine power point presentation
AI mind or machine power point presentationAI mind or machine power point presentation
AI mind or machine power point presentation
 
Working together SRE & Platform Engineering
Working together SRE & Platform EngineeringWorking together SRE & Platform Engineering
Working together SRE & Platform Engineering
 
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
Human Expert Website Manual WCAG 2.0 2.1 2.2 Audit - Digital Accessibility Au...
 
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
Secure Zero Touch enabled Edge compute with Dell NativeEdge via FDO _ Brad at...
 
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
TEST BANK For, Information Technology Project Management 9th Edition Kathy Sc...
 
Your enemies use GenAI too - staying ahead of fraud with Neo4j
Your enemies use GenAI too - staying ahead of fraud with Neo4jYour enemies use GenAI too - staying ahead of fraud with Neo4j
Your enemies use GenAI too - staying ahead of fraud with Neo4j
 
Easier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties ReimaginedEasier, Faster, and More Powerful – Notes Document Properties Reimagined
Easier, Faster, and More Powerful – Notes Document Properties Reimagined
 
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdfThe Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
The Value of Certifying Products for FDO _ Paul at FIDO Alliance.pdf
 
Google I/O Extended 2024 Warsaw
Google I/O Extended 2024 WarsawGoogle I/O Extended 2024 Warsaw
Google I/O Extended 2024 Warsaw
 
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdfSimplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
Simplified FDO Manufacturing Flow with TPMs _ Liam at Infineon.pdf
 
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
Event-Driven Architecture Masterclass: Engineering a Robust, High-performance...
 
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdfIntroduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
Introduction to FDO and How It works Applications _ Richard at FIDO Alliance.pdf
 
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider  Progress from Awareness to Implementation.pptxTales from a Passkey Provider  Progress from Awareness to Implementation.pptx
Tales from a Passkey Provider Progress from Awareness to Implementation.pptx
 

CLDR: What’s in a Personal Name?

  • 1. CLDR Personal Names Proposal 44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020 Yahoo! Confidential Mike McKenna mgm.globalization(at)gmail.com v.0.1 October 2020 CLDR: What’s in a Personal Name? Internationalization and Unicode Conference 44 – (on-line) – October 2020 2 Address Details How are names used in applications • “Hello! My name is: ….” • Mail merge • Human Resources and personnel data • Onboarding and KYC
  • 2. CLDR Personal Names Proposal 44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020 Internationalization and Unicode Conference 44 – (on-line) – October 2020 3 Address Details Common web standards Maybe standards can help us? • LDAP • hCard • HTML . . . Name: { honorific-prefix, given-name, additional-name, family-name, honorific-suffix, nickname } W3 HTML autofill fields Internationalization and Unicode Conference 44 – (on-line) – October 2020 4 Address Details Problem with names 安倍 晋三 Jane Doe A AZ Elizabeth == Liz ? Call me “Boomer” • First name last? • Mr, Mrs, Ms, Mx? • Which initials? • How many initials? • Middle name? • Legal name? • Informal name? • Nickname?
  • 3. CLDR Personal Names Proposal 44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020 Internationalization and Unicode Conference 44 – (on-line) – October 2020 5 Address Details Real word names - length Let’s look at some real names European names: Mr. Jonas Edward Salk, M.D. Jonas E. Salk Jonas Salk Mr. Salk Jonas JS Length: Which fields to use? • Full • Medium • Short • Initials length Internationalization and Unicode Conference 44 – (on-line) – October 2020 6 Address Details Real word names – how many names Let’s look at some real names Spanish names: Señor Diego María Rivera Barrientos Diego Rivera Barrientos Diego Rivera Diego | Panzòn DR Length: • TWO surnames (usually) father mother length
  • 4. CLDR Personal Names Proposal 44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020 Internationalization and Unicode Conference 44 – (on-line) – October 2020 7 Address Details Real word names – order of fields Let’s look at some real names Japanese names: 晋三安倍様 晋三安倍 晋三 安倍 His Excellency Shinzo ABE Shinzo ABE ABE Shinzo What order to use? family name given name given name family name length Internationalization and Unicode Conference 44 – (on-line) – October 2020 8 Address Details Real word names - formality Let’s look at some real names Japanese 松居慶子 松居様 松居先生 松居さん 慶子ちゃん Keiko Matsui Matsui-sama Matsui-sensei Matsui-san Keiko-chan Herr Dr. Albert Schweitzer, MD Herr Dr. Schweitzer Dr. Schweitzer Albert length formality less formal more formal German
  • 5. CLDR Personal Names Proposal 44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020 Internationalization and Unicode Conference 44 – (on-line) – October 2020 9 Address Details Name usage Nominative en-GB: Robert Jones ru-RU: Александр Исаевич Солженицын ja-JP: 松居慶子 ro-RO: Petre Țuțea Sorting • surname, given • given, surname Hello <name>! length formality u s a g e Vocative / case en-GB: Hi Bob! ru-RU: Привет Саша! ja-JP: 松居さん こんにちは! ro-RO: Salut domnule Țuțea! Internationalization and Unicode Conference 44 – (on-line) – October 2020 10 Address Details Mononyms – only one name • Indonesia • Tamil Nadu • Celebrities “Full name”
  • 6. CLDR Personal Names Proposal 44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020 Internationalization and Unicode Conference 44 – (on-line) – October 2020 11 Address Details Lineage names – Father, Mother • Patronymic – ex: ru_RU: <given> <patronymic> <surname> hisname father-ovich familyname Владимир Антонович Иванов • Matronymic names • Iceland – hisname father-son Jón Einarsson hername mother-dóttir Guðrún Mínervudóttir Internationalization and Unicode Conference 44 – (on-line) – October 2020 12 Address Details Ancestral naming • Example Saudi Arabia Farah bint Fadi bin Sajid Al-Tikriti ‫ﻓ‬ ‫ر‬ ‫ح‬ ‫ﺑ‬ ‫ﻧ‬ ‫ت‬ ‫ﻓ‬ ‫ﺎ‬ ‫د‬ ‫ي‬ ‫ﺑ‬ ‫ن‬ ‫ﺳ‬ ‫ﺎ‬ ‫ﺟ‬ ‫د‬ ‫ا‬ ‫ﻟ‬ ‫ﺗ‬ ‫ﻛ‬ ‫ر‬ ‫ﯾ‬ ‫ﺗ‬ ‫ﻲ‬
  • 7. CLDR Personal Names Proposal 44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020 Internationalization and Unicode Conference 44 – (on-line) – October 2020 13 Address Details More standards Industry • LDAP • hCard • HTML . . . • OASIS xNL • UPU S42 • Apple • Google • Microsoft • Facebook . . . • Trulio • Workday more complex Internationalization and Unicode Conference 44 – (on-line) – October 2020 14 Address Details Status of current standards • Give you a ”standard” envelope to move data • No information: – What order fields are in for each region and language – What to choose for different lengths – Formality presentation forms – Usage, sorting information
  • 8. CLDR Personal Names Proposal 44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020 Internationalization and Unicode Conference 44 – (on-line) – October 2020 15 Address Details CLDR – Proposal • Fields – Concise set of standard fields – Flexible to/from W3C • Three axis for context – Length – Formality – Usage • Syntax & Structure • Skeletons for most uses • Fall-back mechanism • Use locales to dictate script- specific behavior • Does NOT specify – Field validation – Linguistic inflections Internationalization and Unicode Conference 44 – (on-line) – October 2020 16 Address Details CLDR – Proposal: Fields • prefix Mr., Ms., Mx., Dr., Prof., … • given first name, given name • middle middle name(s), patronymic • surname last name, family name • surname-extended second last name, ancestral name • suffix Jr, III, MD, PhD, … • nickname Bob, Jack, Venki, Misha, … • monogram-override JR, LBJ, MGM, …
  • 9. CLDR Personal Names Proposal 44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020 Internationalization and Unicode Conference 44 – (on-line) – October 2020 17 Address Details CLDR – Proposal: Name Attributes Context-Axis • Length – full – long – medium – short – monogram – monogram-narrow • Formality – informal – formal • Usage – nominative – vocative – sorting length formality u s a g e Internationalization and Unicode Conference 44 – (on-line) – October 2020 18 Address Details CLDR – Proposal: LDML Structure • LDML = Locale Data Markup Language (UTS #35) • Structure – Default layouts by context • Skeletons <personNames> <personName length=”full" usage="nominative" style="formal"> {prefix} {given} {middle} {surname} {suffix} </personName> …
  • 10. CLDR Personal Names Proposal 44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020 Internationalization and Unicode Conference 44 – (on-line) – October 2020 19 Address Details CLDR – Proposal: LDML Structure <personNames> <personName length="full" usage="nominative" style="formal">{prefix} {given} {middle} {surname} {suffix}</personName> <personName length="long" usage="nominative" style="formal">{prefix} {given} {middle-initial} {surname} {suffix}</personName> <personName length="medium" usage="nominative" style="formal">{prefix} {given} {surname} {suffix}</personName> <personName length="short" usage="nominative" style="formal">{prefix} {surname} {suffix}</personName> <personName length="monogram" usage="nominative" style="formal">{given-initial}{middle-initial}{surname-initial}</personName> <personName length="monogram-narrow" usage="nominative" style="formal">{surname-initial}</personName> <personName length="long" usage="nominative" style="informal">{given} {middle-initial} {surname}</personName> <personName length="medium" usage="nominative" style="informal">{given} ({nickname}) {surname}</personName> <personName length="short" usage="nominative" style="informal">{given}</personName> <personName length="monogram" usage="nominative" style="informal">{given-initial}{surname-initial}</personName> <personName length="monogram-narrow" usage="nominative" style="informal">{given-initial}</personName> <personName length="long" usage="sorting" style="informal">{surname}, {given} {middle}</personName> ... <personName length="short" usage="vocative" style="informal">{nickname}</personName> </personNames> Internationalization and Unicode Conference 44 – (on-line) – October 2020 20 Address Details CLDR – Proposal: Fall back What if a skeleton can’t be found? • length falls back to medium full à long à medium ß short ß monogram • usage falls back to nominative vocative à nominative ß sorting • style falls back to informal length formality u s a g e <personName length="medium" usage="nominative" style="informal">
  • 11. CLDR Personal Names Proposal 44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020 Internationalization and Unicode Conference 44 – (on-line) – October 2020 21 Address Details CLDR – Proposal: phonetics and name types Japanese expects names • Japanese Kanji and kana • Phonetic kana • Romajii (Latin) “international” … Styles by language and script Internationalization and Unicode Conference 44 – (on-line) – October 2020 22 Address Details CLDR – use case Names in Person object personNames Kanji Kana (phonetic) Romaji (Latin)
  • 12. CLDR Personal Names Proposal 44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020 Internationalization and Unicode Conference 44 – (on-line) – October 2020 23 Address Details CLDR – personName: Current status • Preparing for submission to Unicode Consortium • Hope to submit PRI for review during CLDR v39 cycle • Need to finalize use cases and edge cases Internationalization and Unicode Conference 44 – (on-line) – October 2020 24 Address Details More to consider… • Name types: birth, legal, preferred, alias, aka, cc, phonetic, … • Casing: e.g. ”ABE Shinzu” • Generation vs honorific: e.g. “J. Jones Jr.” vs “J. Jones Jr. PhD” • Name prefixes: e.g. “van de”, “de la”, “bin”, “bint” van der Poel de la Salle bin Abdallah • Joint names: e.g. “Sue and John Jones JTWROS” • Arabic ancestral names? given middle surname-extended surname given father grandfather family ?
  • 13. CLDR Personal Names Proposal 44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020 Yahoo! Confidential Thank you! CLDR Personal Name - Core Team Mark Davis @ Google Peter Edberg @ Apple Rich Gillam @ Apple Alex Kolisnychenko @ Microsoft Kristi Lee @ Microsoft Mike McKenna @ PayPal <personName length="long" usage="sorting" style="informal"> {surname}, {given} {middle} </personName> Internationalization and Unicode Conference 44 – (on-line) – October 2020 26 Address Details CLDR – Name Fields compared Standard Prefix Given name Middle name Surname Second surname Suffix Nickname Monogram Other CLDR prefix given middle surname surname- extended suffix nickname monogram- override LDAP person obj title givenName givenName {1,n} sn, surname generation Qualifier {0,n} initials initials, cn, commonName hCard honorific- prefix given- name additional- name family- name honorific- suffix nickname fn, “abbr” attribute HTML honorific- prefix given- name, cc-given- name additional- name, cc- additional- name family- name, cc-family- name honorific- suffix nickname name, cc-name OASIS xNL PrecedingTitle {0,n} Title {0,n} FirstName {0,n} MiddleName {0,n} NamePrefix LastName LastName {1,n} Generation Identifier {0,n} Suffix {0,n} GeneralSuffix Othername {0,n} NameLine Alias
  • 14. CLDR Personal Names Proposal 44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020 Internationalization and Unicode Conference 44 – (on-line) – October 2020 27 Address Details CLDR – Name Fields compared Standard Prefix Given name Middle name Surname Second surname Suffix Nickname Monogram Other CLDR prefix given middle surname surname- extended suffix nickname monogram- override OECD PrecedingTitle, Title{0,n} FirstName * MiddleName {0,n} NamePrefix, LastName * Generation Identifier {0,n} Suffix {0,n}, GeneralSuffix PersonType UPU S42 form of address prec qual given name pt 1 given name pt 2 given name pt 3 interm qual surname prefix surname [pt 1] surname [pt 2] name qualifier qualification patronymic ADIS honorific middle name 2nd middle name last name prefix last name suffix generation post honorific preferred name full name Google person honorificPrefix phonetic HonorificPrefix givenName phonetic GivenName middleName phonetic MiddleName familyName phonetic FamilyName honorificSuffix phonetic HonorificSuffix Nickname DEFAULT (nickname) or ALTERNATE_N AME displayName displayName LastFirst Unstructured Name phonetic FullName Facebook PersonInfo first_name middle_name last_name name name_format Internationalization and Unicode Conference 44 – (on-line) – October 2020 28 Address Details CLDR – Name Fields compared Standard Prefix Given name Middle name Surname Second surname Suffix Nickname Monogram Other CLDR prefix given middle surname surname- extended suffix nickname monogram- override Apple Contact namePrefix givenName phonetic GivenName middleName phonetic MiddleName familyName phonetic FamilName nameSuffix nickname previousFamily Name Microsoft personal contact title givenName yomiGivenName middleName surname yomiSurname generation nickName initials spouseName Trulio FirstGivenName MiddleName FirstSurname SecondSurname ISOLatin1Name, AdditionalFields. FullName Workday Prefix_Data { Title, Salutation } First_Name Middle_Name Last_Name Secondary_Last _Name Tertiary_Last _Name Suffix_Data { Social, Academic, Hereditary, Honorary, Professional, Religious, Royal } Local_Name _Detail_Data
  • 15. CLDR Personal Names Proposal 44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020 Internationalization and Unicode Conference 44 – (on-line) – October 2020 29 Address Details References - standards Standards • LDAP – Lightweight Directory Access Protocol + RFC4519 https://www.iana.org/assignments/ldap-parameters/ldap-parameters.xhtml https://docs.ldap.com/specs/rfc4519.txt ITU X.20 https://www.itu.int/itu-t/recommendations/rec.aspx?rec=X.520 • hCard http://microformats.org/wiki/hcard • W3C HTML autofill fields https://www.w3.org/TR/html52/sec-forms.html#autofill-field • OASIS xNL http://docs.oasis-open.org/ciq/v3.0/cs02/specs/ciq-specs-v3-cs2.html#_Toc213384944 • UPU S42, ADIS, ISO 19160 http://xml.coverpages.org/ISO-FocusPlus-AddressingStandards-2010-06.pdf http://xml.coverpages.org/namesAndAddresses.html http://xml.coverpages.org/ADIS-Address-2001-1.pdf • Unicode Technical Standard #35 Unicode Locale Data Markup Language (LDML) http://unicode.org/reports/tr35/ Internationalization and Unicode Conference 44 – (on-line) – October 2020 30 Address Details References - regulatory Regulatory Standards • OECD Standard for automatic exchange of financial account information in tax matters http://www.oecd.org/tax/exchange-of-tax-information/standard-for-automatic-exchange-of-financial-account-information-in-tax-matters-second-edition- 9789264267992-en.htm – OECD Common Reporting Standard XML Schema http://www.oecd.org/tax/automatic-exchange/common-reporting-standard/schema-and-user-guide/#d.en.345315 • ICAO Travel Document Standard – Doc 9303 https://www.icao.int/publications/pages/publication.aspx?docnum=9303 • UK Deed Poll name change requirements https://www.gov.uk/government/publications/change-of-name-guidance/use-and-change-of-names#grounds-for-refusing-to-change-a-name-on-a-home-office- issued-document
  • 16. CLDR Personal Names Proposal 44th Internationalization and Unicode Conference ‹#› - (on-line) - October 2020 Internationalization and Unicode Conference 44 – (on-line) – October 2020 31 Address Details References - industry Industry Standards • Google Person / Name Object https://developers.google.com/people/api/rest/v1/people#name • Apple CNMutableContact person object https://developer.apple.com/documentation/contacts/cnmutablecontact • Facebook User Object https://developers.facebook.com/docs/graph-api/reference/user/#fields • Microsft Personal Contact Graph REST API https://docs.microsoft.com/en-us/graph/api/resources/contact?view=graph-rest-1.0#properties • Trulio Identity Schema https://developer.trulioo.com/reference#identity-verification-verify • Workday Person_Name_Detail_Data https://community.workday.com/sites/default/files/file- hosting/productionapi/Human_Resources/v35.0/Change_Legal_Name.html#Person_Name_Detail_DataType Internationalization and Unicode Conference 44 – (on-line) – October 2020 32 Address Details References - other Other References • Wikipedia, Personal Name https://en.wikipedia.org/wiki/Personal_name • Wikipedia, Naming Conventions https://en.wikipedia.org/wiki/Wikipedia:Naming_conventions_(people) Wikipedia Manual of Style, Biographical names https://en.wikipedia.org/wiki/Wikipedia:Manual_of_Style/Biography#Names • Wikidata personal name object https://www.wikidata.org/wiki/Q1071027 • Graham Rhind, Global Sourcebook for International Data Management https://www.grcdi.nl/gsb/global%20sourcebook.html • A Guide to Names and Naming Practices https://www.fbiic.gov/public/2008/nov/Naming_practice_guide_UK_2006.pdf • Law Enforcement Guide to International Names https://info.publicintelligence.net/ROCICInternationalNames.pdf • Pan-Data: Names https://github.com/pan-i18n/pan-data/blob/master/specs/Names.md