2. Founder Bio
2
Elian CARSENAT, a computer scientist trained at ENSIIE/INRIA, started
his career at JP Morgan in Paris in 1997. He later worked as
consultant and managed business & IT projects in London, Paris,
Moscow and Shanghai.
In 2012, Elian created NamSor, a piece of sociolinguistics software to
mine the 'Big Data' and better understand international flows of
money, ideas and people. NamSor helps answer the perennial
question all countries ask about their diasporas – who are they,
where are they and what are they doing.
NamSor has been used to attract Foreign Direct Investments (FDI), to
build-up international collaboration within scientific communities, to
attract and facilitate Diaspora investment in Start-ups...
as well as other use cases.
http://fr.linkedin.com/in/eliancarsenat/en
3. NamSor sorts Names
3
Names are meaningful : we use sociolinguistics to extract their
semantics and deliver actionable intelligence.
Names reflect cultural Identity
NamSor data mining software
recognizes the linguistic or cultural
origin of names in any alphabet /
language, with fine grain and high
accuracy.
6. Diasporas in Science
(in collaboration with French INSERM)
6
Thomson Reuters WebOfScience (6 countries, 250k scientists, 50k papers)
“Analysts uncovered amazing patterns in the way scientists’ names correlate with whom they publish, and who
they cite in their papers - not just in case of a particular country, but globally. Tania Vichnevskaia of the French
National Institute for Health (INSERM) presented the paper ‘Applying onomastics to scientometrics‘ at IREG
International symposium 2015 organised by University of Maribor and Shanghai Jiao Tong University. The
paper was prepared jointly with NamSor, a private start-up company specialized in mapping international
Diasporas.”
Source: WoS; Data Mining: INSERM with NamSor
7. Scholar names in some Canadian Universities
Chinese, Indian, Iranian, Moroccan, Italian names
7
Canadian Science Policy Conference - CSPC2015
9. US Census vs NamSor geo-demographics
9
In July 2015, the US Government announced new
rules that will require all cities and towns receiving
federal housing funds to assess patterns of
segregation.
The NY Times has published interactive maps of
Boston geo-demographics, which we can compare
with the information inferred by NamSor
10. US Census Race Map of Boston
10
http://www.nytimes.com/interactive/2015/07/08/us/census-race-map.html
11. Using Voters List
US Census:
1pixel = 40 inhabitants
Voters List:
1 pixel = 1 voter
11
Source: Boston Voters List
Visualization : ESRI
Data Mining: NamSor+RapidMiner
12. Breaking down ‘White’ and ‘Asian’ into
Portuguese, Spanish, Italian, India, Pakistan, China, ...
12
Source: Boston Voters List
Visualization : ESRI
Data Mining: NamSor+RapidMiner
14. Who OWNS in Brooklyn, NY?
Inferring origin in NYC ACRIS (Real Estate OpenData)
14
> Brooklyn zip codes
>NamSororigins
15. Who OWNS in Brooklyn, NY?
Inferring origin in NYC ACRIS (Real Estate OpenData)
15
Interesting ‘Little’ spots
ZIP 11209 : Irish
ZIP 11219 : Jewish
ZIP 11233 : African American
ZIP 11228 : Italian
ZIP 11208 : Hispanic
ZIP 11214 : Chinese
ZIP 11235 : Ukrainian/Russian
ZIP 11416 : Indian
ZIP 11222 : Polish
22. Applications to an Airline’s customer intelligence
22
A global airline :
‘For 93% of our customers, when
NamSor recognizes an Indian
name, the client has travelled to
India in the past.’
Finer grain segmentation using
names brings insights about
diasporas travel pattern
visiting family and friends in
their home country, as well as
their specific needs.