SlideShare une entreprise Scribd logo
1  sur  16
Télécharger pour lire hors ligne
Lingsoft Language
Management Central
Language Management Central
● LS LMC is a centralised language management
  solution

● All of Lingsoft's and many third-party language
  solutions are offered through this platform
  ○ Spell checking, grammar checking, stylistic checking
  ○ Hyphenation
  ○ Inflective synonym lookup
  ○ Morphological and dependency analysis
  ○ Terminology management and look-up
  ○ Translations
  ○ Speech recognition and synthesis

● New services are created and expanded
  ○ New tools can be added to the platform

● Flexible and robust
Language Management Central
● Software as a Service model (SaaS)

● Each service exposes an interface for integration
  ○ REST/JSON
  ○ WebServices

● .Net server solution

● Successful integration in a variety of settings
  ○ Desktop publishing
  ○ Search engines
  ○ Online stores
  ○ Terminology management solutions
  ○ Natural language processing solutions
  ○ Translation services
  ○ Subtitling services
Proofing Service in LMC
● Several proofreading services available
   ○ Spell checking
   ○ Grammar checking
   ○ Stylistic checking
   ○ Hyphenation

● Available languages
   ○ Finnish
   ○ Swedish
   ○ Norwegian Bokmål
   ○ Norwegian Nynorsk
   ○ Danish
   ○ German
   ○ English (spelling)

● Lingsoft proofing has previously successfully been integrated as the
  standard checkers for eg. Microsoft Word
Proofing Service in LMC
● Domain-specific spell and style checkers
  ○ Specialised checkers optimised for specific language domains
  ○ Contains the standard language and additional specialised vocabulary
    and grammar rules
  ○ Examples
     ■ Finnish medical
     ■ Finnish EU language
     ■ Finnish IT
     ■ Swedish Finlandisms

● Custom vocabulary and checker customisation
  ○ Lingsoft can also create checkers containing your vocabulary
Integrating the Proofing Service
Morphology Service in LMC
● Offers morphological and sentence analysis
  ○ Analyse words and return dictionary forms
      ■ crucial for processing of inflecting languages
      ■ example from English: left returns left (adverb), left
        (adjective) and leave (verb)
      ■ example from Finnish: kädessäni 'in my hand' finds käsi
        'hand'
  ○ If a word returns multiple dictionary forms, sentence
    disambiguation can identify the desired baseform based on
    sentence structure
      ■ example, English: left in he left for school will return leave
  ○ Generate word forms
      ■ example from Finnish: requesting the inessive singular form
        with possessive suffix of käsi 'hand' returns kädessäni
      ■ example from English: requesting past participle of leave
        returns left
Morphology Service in LMC
● Available languages
  ○ Finnish
  ○ Swedish
  ○ Norwegian Bokmål
  ○ Norwegian Nynorsk
  ○ Danish
  ○ German
  ○ English
  ○ Russian
  ○ Swahili

● Applications include
   ○ search systems
   ○ document indexing
   ○ language learning
   ○ supporting tasks for other language processing
Search Systems and LMC
● Search expansion using Morphology Service
  ○ When search for a word or term, it is possible to automatically
    also search for inflected forms of the word
  ○ Can also find compound words
     ■ For example, search for käsi 'hand' in Finnish can find
       käsikirja 'handbook/manual'
  ○ Greatly simplifies search queries and expand the results pool
     ■ for example Finnish, where a single word may have
       thousands of different forms

● Search correction using Proofing Service
  ○ If a user misspells a search term, it is possible to offer "Did
    you mean?" functionality by offering suggestions from the
    proofing service
Search Systems and LMC




Solution – custom library for Lucene programming

Lingsoft Custom Analyzer for Lucene
 ● A Java library for Lucene
 ● Calls LS LMC (address configured from settings)
 ● Package contains a Java library with a settings file and some example codes
   (open source)
 ● A customer would program their own solution on top of this Custom analyzer
 ● Lingsoft provides LS LMC services and consulting for searching and indexing
Terminology Management
and LMC
 ● LS LMC offers a fully fledged terminology
   management system
   ○ Administration and editing of customer terms and
     customer termbases
      ■ The entire term management workflow is supported
   ○ Term search and proposals
   ○ Multilingual termbases supported
   ○ Brand language management
      ■ Accepted and rejected terms
   ○ Several tools supported
      ■ TermWeb, Trados Multiterm...
      ■ More can be added
      ■ Standard format, TBX
Terminology Management
and LMC
Terminology Management
and LMC
 ● Intelligent Term Search
   ○ Intelligent term search when using Lingsoft's language tools
     (search expansion)
   ○ Search targets
       ○ custom termbase
       ○ static termbase (for example MOT-dictionaries)
       ○ public terminologies (f.ex. WordNet)
       ○ Lingsoft language tools functioning as terminologies (for
         example synonym dictionaries)


 ● Term Highlighting
   ○ Recognizes terms from written text by highlighting
   ○ Can use accepted/rejected -division
   ○ Interface returns positional and other term-related information
   ○ Possibility to select target termbases
Terminology Management
and LMC
 ● Term Suggestion
   ○ If the term is unrecognized, the customer has the possibility to
     propose it
   ○ Possibility to
       ■ select target termbase for the suggestion
       ■ add basic information about the suggestion
   ○ Terminology admin will see the suggestions (unprocessed terms)
   ○ If the administrator approves the term, LS LMC will publish it in
     use through the platform
 ● Other language tools
   ○ LMC can utilize termbases for other language tools
   ○ Examples
      ■ Accepted/Rejected terms can be brought as a part of
        proofreader, so they will be used with the Proofing Interface
        also (style info/errors)
      ■ Key terminology can be utilized when creating search engine
        index
      ■ Possibility to utilize terminologies for creating totally
        customized spellchecker
Integrations
● CKEditor




● MS Word 2007-2013
LS LMC

 ● Full API description and trial accounts available on
   request

 ● For more information, contact us at info@lingsoft.fi

 ● Visit us at www.lingsoft.fi

 ● Follow us at @Lingsoft

Contenu connexe

Tendances

Class9
 Class9 Class9
Class9
issbp
 
Closing the language gap: developing machine learning tools to detect the lan...
Closing the language gap: developing machine learning tools to detect the lan...Closing the language gap: developing machine learning tools to detect the lan...
Closing the language gap: developing machine learning tools to detect the lan...
CILIP MDG
 
GRDDL: A Pictorial Approach
GRDDL: A Pictorial ApproachGRDDL: A Pictorial Approach
GRDDL: A Pictorial Approach
Chimezie Ogbuji
 
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
Tobias Kuhn
 

Tendances (8)

Fhbib Chronology2
Fhbib Chronology2Fhbib Chronology2
Fhbib Chronology2
 
Class9
 Class9 Class9
Class9
 
Closing the language gap: developing machine learning tools to detect the lan...
Closing the language gap: developing machine learning tools to detect the lan...Closing the language gap: developing machine learning tools to detect the lan...
Closing the language gap: developing machine learning tools to detect the lan...
 
GRDDL: A Pictorial Approach
GRDDL: A Pictorial ApproachGRDDL: A Pictorial Approach
GRDDL: A Pictorial Approach
 
Etymology Markup in TEI XML
Etymology Markup in TEI XMLEtymology Markup in TEI XML
Etymology Markup in TEI XML
 
English kazakh parallel corpus for statistical machine translation
English kazakh parallel corpus for statistical machine translationEnglish kazakh parallel corpus for statistical machine translation
English kazakh parallel corpus for statistical machine translation
 
Terminology: tips and tricks to boost your terminology work
Terminology: tips and tricks to boost your terminology workTerminology: tips and tricks to boost your terminology work
Terminology: tips and tricks to boost your terminology work
 
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
A Multilingual Semantic Wiki based on Attempto Controlled English and Grammat...
 

Similaire à Lingsoft Language Management Central

Full text search
Full text searchFull text search
Full text search
deleteman
 

Similaire à Lingsoft Language Management Central (20)

Apertium: a unique free/open-source MT system for related languages [but not ...
Apertium: a unique free/open-source MT system for related languages [but not ...Apertium: a unique free/open-source MT system for related languages [but not ...
Apertium: a unique free/open-source MT system for related languages [but not ...
 
Php packages
Php packagesPhp packages
Php packages
 
Open text Translation and Localization Services
Open text Translation and Localization ServicesOpen text Translation and Localization Services
Open text Translation and Localization Services
 
Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"Mattingly "Text Mining Techniques"
Mattingly "Text Mining Techniques"
 
Single-Sourcing and Localization stc16
Single-Sourcing and Localization stc16Single-Sourcing and Localization stc16
Single-Sourcing and Localization stc16
 
Computer programing 111 lecture 1
Computer programing 111 lecture 1 Computer programing 111 lecture 1
Computer programing 111 lecture 1
 
Lynx Webinar #4: Lynx Services Platform (LySP) - Part 2 - The Services
Lynx Webinar #4: Lynx Services Platform (LySP) - Part 2 - The ServicesLynx Webinar #4: Lynx Services Platform (LySP) - Part 2 - The Services
Lynx Webinar #4: Lynx Services Platform (LySP) - Part 2 - The Services
 
MorphoLogic Localisation Ltd.
MorphoLogic Localisation Ltd.MorphoLogic Localisation Ltd.
MorphoLogic Localisation Ltd.
 
Dirk Goldhahn: Introduction to the German Wortschatz Project
Dirk Goldhahn: Introduction to the German Wortschatz ProjectDirk Goldhahn: Introduction to the German Wortschatz Project
Dirk Goldhahn: Introduction to the German Wortschatz Project
 
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
Europeana meeting under Finland’s Presidency of the Council of the EU - Day 2...
 
Company Overview
Company OverviewCompany Overview
Company Overview
 
Can I Have a Word: Managing Shared Glossaries and References to Terms With DITA
Can I Have a Word: Managing Shared Glossaries and References to Terms With DITACan I Have a Word: Managing Shared Glossaries and References to Terms With DITA
Can I Have a Word: Managing Shared Glossaries and References to Terms With DITA
 
Modern Programming Languages classification Poster
Modern Programming Languages classification PosterModern Programming Languages classification Poster
Modern Programming Languages classification Poster
 
An Extensible Multilingual Open Source Lemmatizer
An Extensible Multilingual Open Source LemmatizerAn Extensible Multilingual Open Source Lemmatizer
An Extensible Multilingual Open Source Lemmatizer
 
Translation with technology
Translation with technologyTranslation with technology
Translation with technology
 
Unifying your Content Developement and Translation Strategies
Unifying your Content Developement and Translation Strategies Unifying your Content Developement and Translation Strategies
Unifying your Content Developement and Translation Strategies
 
Using Opens Document Format in Education
Using Opens Document Format in EducationUsing Opens Document Format in Education
Using Opens Document Format in Education
 
Full text search
Full text searchFull text search
Full text search
 
Laura Dent: Single-Source and Localization
Laura Dent: Single-Source and LocalizationLaura Dent: Single-Source and Localization
Laura Dent: Single-Source and Localization
 
Programming assignment help by myassignmenthelp
Programming assignment help by myassignmenthelpProgramming assignment help by myassignmenthelp
Programming assignment help by myassignmenthelp
 

Dernier

Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Victor Rentea
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Safe Software
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Victor Rentea
 

Dernier (20)

Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : UncertaintyArtificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
 
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
Modular Monolith - a Practical Alternative to Microservices @ Devoxx UK 2024
 
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, AdobeApidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
 
Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024Manulife - Insurer Transformation Award 2024
Manulife - Insurer Transformation Award 2024
 
ICT role in 21st century education and its challenges
ICT role in 21st century education and its challengesICT role in 21st century education and its challenges
ICT role in 21st century education and its challenges
 
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUKSpring Boot vs Quarkus the ultimate battle - DevoxxUK
Spring Boot vs Quarkus the ultimate battle - DevoxxUK
 
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
Web Form Automation for Bonterra Impact Management (fka Social Solutions Apri...
 
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data DiscoveryTrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
 
How to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected WorkerHow to Troubleshoot Apps for the Modern Connected Worker
How to Troubleshoot Apps for the Modern Connected Worker
 
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers:  A Deep Dive into Serverless Spatial Data and FMECloud Frontiers:  A Deep Dive into Serverless Spatial Data and FME
Cloud Frontiers: A Deep Dive into Serverless Spatial Data and FME
 
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdfRising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
Rising Above_ Dubai Floods and the Fortitude of Dubai International Airport.pdf
 
DBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor PresentationDBX First Quarter 2024 Investor Presentation
DBX First Quarter 2024 Investor Presentation
 
2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...2024: Domino Containers - The Next Step. News from the Domino Container commu...
2024: Domino Containers - The Next Step. News from the Domino Container commu...
 
FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024FWD Group - Insurer Innovation Award 2024
FWD Group - Insurer Innovation Award 2024
 
Exploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with MilvusExploring Multimodal Embeddings with Milvus
Exploring Multimodal Embeddings with Milvus
 
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
Biography Of Angeliki Cooney | Senior Vice President Life Sciences | Albany, ...
 
Exploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone ProcessorsExploring the Future Potential of AI-Enabled Smartphone Processors
Exploring the Future Potential of AI-Enabled Smartphone Processors
 
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ..."I see eyes in my soup": How Delivery Hero implemented the safety system for ...
"I see eyes in my soup": How Delivery Hero implemented the safety system for ...
 
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a FresherStrategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
 
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024Finding Java's Hidden Performance Traps @ DevoxxUK 2024
Finding Java's Hidden Performance Traps @ DevoxxUK 2024
 

Lingsoft Language Management Central

  • 2. Language Management Central ● LS LMC is a centralised language management solution ● All of Lingsoft's and many third-party language solutions are offered through this platform ○ Spell checking, grammar checking, stylistic checking ○ Hyphenation ○ Inflective synonym lookup ○ Morphological and dependency analysis ○ Terminology management and look-up ○ Translations ○ Speech recognition and synthesis ● New services are created and expanded ○ New tools can be added to the platform ● Flexible and robust
  • 3. Language Management Central ● Software as a Service model (SaaS) ● Each service exposes an interface for integration ○ REST/JSON ○ WebServices ● .Net server solution ● Successful integration in a variety of settings ○ Desktop publishing ○ Search engines ○ Online stores ○ Terminology management solutions ○ Natural language processing solutions ○ Translation services ○ Subtitling services
  • 4. Proofing Service in LMC ● Several proofreading services available ○ Spell checking ○ Grammar checking ○ Stylistic checking ○ Hyphenation ● Available languages ○ Finnish ○ Swedish ○ Norwegian Bokmål ○ Norwegian Nynorsk ○ Danish ○ German ○ English (spelling) ● Lingsoft proofing has previously successfully been integrated as the standard checkers for eg. Microsoft Word
  • 5. Proofing Service in LMC ● Domain-specific spell and style checkers ○ Specialised checkers optimised for specific language domains ○ Contains the standard language and additional specialised vocabulary and grammar rules ○ Examples ■ Finnish medical ■ Finnish EU language ■ Finnish IT ■ Swedish Finlandisms ● Custom vocabulary and checker customisation ○ Lingsoft can also create checkers containing your vocabulary
  • 7. Morphology Service in LMC ● Offers morphological and sentence analysis ○ Analyse words and return dictionary forms ■ crucial for processing of inflecting languages ■ example from English: left returns left (adverb), left (adjective) and leave (verb) ■ example from Finnish: kädessäni 'in my hand' finds käsi 'hand' ○ If a word returns multiple dictionary forms, sentence disambiguation can identify the desired baseform based on sentence structure ■ example, English: left in he left for school will return leave ○ Generate word forms ■ example from Finnish: requesting the inessive singular form with possessive suffix of käsi 'hand' returns kädessäni ■ example from English: requesting past participle of leave returns left
  • 8. Morphology Service in LMC ● Available languages ○ Finnish ○ Swedish ○ Norwegian Bokmål ○ Norwegian Nynorsk ○ Danish ○ German ○ English ○ Russian ○ Swahili ● Applications include ○ search systems ○ document indexing ○ language learning ○ supporting tasks for other language processing
  • 9. Search Systems and LMC ● Search expansion using Morphology Service ○ When search for a word or term, it is possible to automatically also search for inflected forms of the word ○ Can also find compound words ■ For example, search for käsi 'hand' in Finnish can find käsikirja 'handbook/manual' ○ Greatly simplifies search queries and expand the results pool ■ for example Finnish, where a single word may have thousands of different forms ● Search correction using Proofing Service ○ If a user misspells a search term, it is possible to offer "Did you mean?" functionality by offering suggestions from the proofing service
  • 10. Search Systems and LMC Solution – custom library for Lucene programming Lingsoft Custom Analyzer for Lucene ● A Java library for Lucene ● Calls LS LMC (address configured from settings) ● Package contains a Java library with a settings file and some example codes (open source) ● A customer would program their own solution on top of this Custom analyzer ● Lingsoft provides LS LMC services and consulting for searching and indexing
  • 11. Terminology Management and LMC ● LS LMC offers a fully fledged terminology management system ○ Administration and editing of customer terms and customer termbases ■ The entire term management workflow is supported ○ Term search and proposals ○ Multilingual termbases supported ○ Brand language management ■ Accepted and rejected terms ○ Several tools supported ■ TermWeb, Trados Multiterm... ■ More can be added ■ Standard format, TBX
  • 13. Terminology Management and LMC ● Intelligent Term Search ○ Intelligent term search when using Lingsoft's language tools (search expansion) ○ Search targets ○ custom termbase ○ static termbase (for example MOT-dictionaries) ○ public terminologies (f.ex. WordNet) ○ Lingsoft language tools functioning as terminologies (for example synonym dictionaries) ● Term Highlighting ○ Recognizes terms from written text by highlighting ○ Can use accepted/rejected -division ○ Interface returns positional and other term-related information ○ Possibility to select target termbases
  • 14. Terminology Management and LMC ● Term Suggestion ○ If the term is unrecognized, the customer has the possibility to propose it ○ Possibility to ■ select target termbase for the suggestion ■ add basic information about the suggestion ○ Terminology admin will see the suggestions (unprocessed terms) ○ If the administrator approves the term, LS LMC will publish it in use through the platform ● Other language tools ○ LMC can utilize termbases for other language tools ○ Examples ■ Accepted/Rejected terms can be brought as a part of proofreader, so they will be used with the Proofing Interface also (style info/errors) ■ Key terminology can be utilized when creating search engine index ■ Possibility to utilize terminologies for creating totally customized spellchecker
  • 16. LS LMC ● Full API description and trial accounts available on request ● For more information, contact us at info@lingsoft.fi ● Visit us at www.lingsoft.fi ● Follow us at @Lingsoft