Michal Barla: Beyond search queries @ ElasticSearch Vienna Meetup #1

•

3 j'aime•963 vues

How do users interact with search engines? What can we learn from this behavior? How can we make search engines better? How do we measure quality of search results and what are the key metrics? Do you even measure the quality of your search? Let's take a walk standing on the shoulders of giants like Google, Yahoo or Yandex and learn about the recent advances in search research.

Technologie Design

Beyond search
queries
Michal Barla
searchd.co

About me
● researcher and teacher at
Slovak University of Technology in
Bratislava
● developer @ synopsi.tv, searchd.co
● co-owner of minio, s.r.o.
○ otvorenezmluvy.sk, govdata.sk

$Search as seen by developers { "query": { "query_string": { "query": "elasticsearch book" } } } return response.hits.hits$

Search
as experienced by users
query: elasticsarch
Typo in query.
No results.
query: elasticsearch
Too many hits.
Not relevant.
query: elasticsearch book
Click!
Success! Or?

Cpt. Obvious:
“Hits, clicks and order
do matter.”

Accurately interpreting clickthrough
data as implicit feedback
Thorsten Joachims, Laura Granka, Bing Pan, Helene Hembrooke, and Geri
Gay. Accurately interpreting clickthrough data as implicit feedback. In
Proceedings of the 28th annual international ACM SIGIR conference on
Research and development in Information retrieval, SIGIR ’05, pages 154–161,
New York, NY, USA, 2005. ACM.

Accurately interpreting clickthrough
data as implicit feedback

Search quality metrics
● Mean Average Precision @ N
○ probability of target result being in top N items
● Mean Reciprocal Rank
○ 1 / rank of target result
● Normalized Discounted Cumulative Gain
● Expected Reciprocal Rank

Search KPIs
● CTR trend
● # of queries w/o results or clicks
● # of searches per session
● Search engine latency

Optimizing search engines using
clickthrough data
Thorsten Joachims. Optimizing search engines using clickthrough data. In
Proceedings of the eighth ACM SIGKDD international conference on
Knowledge discovery and data mining, KDD ’02, pages 133–142, New York,
NY, USA, 2002. ACM.

Optimizing search engines using
clickthrough data

Query chains: learning to rank from
implicit feedback
Filip Radlinski and Thorsten
Joachims. Query chains: learning
to rank from implicit feedback. In
KDD ’05: Proceeding of the eleventh
ACM SIGKDD international
conference on Knowledge discovery
in data mining, pages 239–248,
New York, NY, USA, 2005. ACM.

Fighting Search Engine Amnesia:
Reranking Repeated Results
Milad Shokouhi, Ryen W. White, Paul Bennett, and Filip Radlinski. Fighting
search engine amnesia: reranking repeated results. In Proceedings of the
36th international ACM SIGIR conference on Research and development in
information retrieval, SIGIR ’13, pages 273–282, New York, NY, USA, 2013.
ACM.
In this paper, we observed that the same results are often shown to
users multiple times during search sessions. We showed that there are
a number of effects at play, which can be leveraged to improve information
retrieval performance. In particular, previously skipped results are much
less likely to be clicked, and previously clicked results may or may not
be re-clicked depending on other factors of the session.

searchd.co
Search Analytics
● Identify and fix key search problems
● KPIs for site search
● Actionable tips for search tuning
● Easy setup
a. Add our hosted JavaScript
b. Annotate search results with HTML5 tags
c. Done.
● Currently in private beta

Bad search experience is a lost
opportunity. Let's fix it.
searchd.co
Search Analytics
www.searchd.co
info@searchd.co

Contenu connexe

Similaire à Michal Barla: Beyond search queries @ ElasticSearch Vienna Meetup #1

Tallink

Nicolle Dammann

Learning from Complex Online Behavior with Andy Edmonds - Big Brains

Internet 信息检索中的数学

Mazhiming

Data sci sd-11.6.17

In the world of Search, understanding the intend of the user is often seen as the holy grail. When a user performs multiple search and click actions while having a conversation with the search engine, then this behavior reveals a piece of her/his interest. A search engine that is aware of the user’s interest is able to add a personal layer in its responses and this could add a new dimension of accuracy and value to a search implementation. But what technology does it take to build it? What data is needed? How well does it really work? This presentation describes the journey to find a practical implementation of a recommendation engine. It answers all the questions above and more. We’ll guide you through the lessons learned while creating an engine that generates potentially interesting items for the user based on collaborative filtering and anomaly detection. We’ll demonstrate a prototype where even a minimal set of user actions could lead to a personalized search experience.

Personalized Search-Building a prototype to infer the user's interest

Tom Burgmans

Presentasjon

UNSW

Six sigma black belts

NEHA KAPOOR

People use search engines to find answers to questions related to their health, finances, or other socially relevant issues. However, most users are unaware that search results are considerably influenced by search engine marketing (SEM). SEM measures are driven by commercial, political, or other motives. Due to these motivations, two questions arise: What information quality is mediated through SEM? And how is collecting documents of different quality affecting user knowledge gain? Both questions are not considered by existing models of information behavior. Hence, the doctoral research project described in this paper aims to develop and empirically test an information search behavior model on the influences of SEM on user knowledge gain and thereby contribute to the search as learning body of research. Presentation at CHIIR 2023 Doctoral Consortium.

How search engine marketing influences user knowledge gain: Development and e...

Sebastian Schultheiß

Getstarteddssd12717sd

Thinkful

Information Access on Social Web

Daqing He

Aspectx Prsa08dy1

Dawn Yankeelov

Profiling a Person With Search Log Data

Jim Jansen

Alliance of International Market Research Institutes: A Pie Grows in Manhattan

Kathryn Korostoff

Search Engine Results: The Best Measure?

Fan Foundry

Invited Lecture on Interactive Information Retrieval

DavidMaxwell77

Startds9.19.17sd

Thinkful

D92-198gstindspdx

Thinkful

Keynote at CIKM 2013 Workshop on Data-driven User Behavioral Modelling and Mining from Social Media Social Search in a Professional Context Daniel Tunkelang (LinkedIn) Social networks bring a new dimension to search. Instead of looking for web pages or text documents, LinkedIn members search a world of entities connected by a rich graph of relationships. Search is a fundamental part of the LinkedIn ecosystem, as it helps our members find and be found. Unlike most search applications, LinkedIn's search experience is highly personalized: two LinkedIn members performing the same search query are likely to see completely different results. Delivering the right results to the right person depends on our ability to leverage our each member's unique professional identity and network. In this talk, I'll describe the kinds of search behavior we see on LinkedIn, and some of the approaches we've taken to help our members address their information needs.

Social Search in a Professional Context

Daniel Tunkelang

Search Analytics for Fun and Profit

Louis Rosenfeld

Similaire à Michal Barla: Beyond search queries @ ElasticSearch Vienna Meetup #1 (20)

Tallink

Learning from Complex Online Behavior with Andy Edmonds - Big Brains

Internet 信息检索中的数学

Mazhiming

Data sci sd-11.6.17

Personalized Search-Building a prototype to infer the user's interest

Presentasjon

Six sigma black belts

How search engine marketing influences user knowledge gain: Development and e...

Getstarteddssd12717sd

Information Access on Social Web

Aspectx Prsa08dy1

Profiling a Person With Search Log Data

Alliance of International Market Research Institutes: A Pie Grows in Manhattan

Search Engine Results: The Best Measure?

Invited Lecture on Interactive Information Retrieval

Startds9.19.17sd

D92-198gstindspdx

Social Search in a Professional Context

Search Analytics for Fun and Profit

Dernier

Enterprise Knowledge’s Urmi Majumder, Principal Data Architecture Consultant, and Fernando Aguilar Islas, Senior Data Science Consultant, presented "Driving Behavioral Change for Information Management through Data-Driven Green Strategy" on March 27, 2024 at Enterprise Data World (EDW) in Orlando, Florida. In this presentation, Urmi and Fernando discussed a case study describing how the information management division in a large supply chain organization drove user behavior change through awareness of the carbon footprint of their duplicated and near-duplicated content, identified via advanced data analytics. Check out their presentation to gain valuable perspectives on utilizing data-driven strategies to influence positive behavioral shifts and support sustainability initiatives within your organization. In this session, participants gained answers to the following questions: - What is a Green Information Management (IM) Strategy, and why should you have one? - How can Artificial Intelligence (AI) and Machine Learning (ML) support your Green IM Strategy through content deduplication? - How can an organization use insights into their data to influence employee behavior for IM? - How can you reap additional benefits from content reduction that go beyond Green IM?

Driving Behavioral Change for Information Management through Data-Driven Gree...

Enterprise Knowledge

BooK Now Call us at +918448380779 to hire a gorgeous and seductive call girl for sex. Take a Delhi Escort Service. The help of our escort agency is mostly meant for men who want sexual Indian Escorts In Delhi NCR. It should be noted that any impersonator will get 100 attention from our Young Girls Escorts in Delhi. They will assume the position of reliable allies. VIP Call Girl With Original Photos Book Tonight +918448380779 Our Cheap Price 1 Hour not available 2 Hours 5000 Full Night 8000 TAG: Call Girls in Delhi, Noida, Gurgaon, Ghaziabad, Connaught Place, Greater Kailash Delhi, Lajpat Nagar Delhi, Mayur Vihar Delhi, Chanakyapuri Delhi, New Friends Colony Delhi, Majnu Ka Tilla, Karol Bagh, Malviya Nagar, Saket, Khan Market, Noida Sector 18, Noida Sector 76, Noida Sector 51, Gurgaon Mg Road, Iffco Chowk Gurgaon, Rajiv Chowk Gurgaon All Delhi Ncr Free Home Deliver

08448380779 Call Girls In Civil Lines Women Seeking Men

Delhi Call girls

The presentation explores the development and application of artificial intelligence (AI) from its inception to its current status in the modern world. The term "artificial intelligence" was first coined by John McCarthy in 1956 to describe efforts to develop computer programs capable of performing tasks that typically require human intelligence. This concept was first introduced at a conference held at Dartmouth College, where programs demonstrated capabilities such as playing chess, proving theorems, and interpreting texts. In the early stages, Alan Turing contributed to the field by defining intelligence as the ability of a being to respond to certain questions intelligently, proposing what is now known as the Turing Test to evaluate the presence of intelligent behavior in machines. As the decades progressed, AI evolved significantly. The 1980s focused on machine learning, teaching computers to learn from data, leading to the development of models that could improve their performance based on their experiences. The 1990s and 2000s saw further advances in algorithms and computational power, which allowed for more sophisticated data analysis techniques, including data mining. By the 2010s, the proliferation of big data and the refinement of deep learning techniques enabled AI to become mainstream. Notable milestones included the success of Google's AlphaGo and advancements in autonomous vehicles by companies like Tesla and Waymo. A major theme of the presentation is the application of generative AI, which has been used for tasks such as natural language text generation, translation, and question answering. Generative AI uses large datasets to train models that can then produce new, coherent pieces of text or other media. The presentation also discusses the ethical implications and the need for regulation in AI, highlighting issues such as privacy, bias, and the potential for misuse. These concerns have prompted calls for comprehensive regulations to ensure the safe and equitable use of AI technologies. Artificial intelligence has also played a significant role in healthcare, particularly highlighted during the COVID-19 pandemic, where it was used in drug discovery, vaccine development, and analyzing the spread of the virus. The capabilities of AI in healthcare are vast, ranging from medical diagnostics to personalized medicine, demonstrating the technology's potential to revolutionize fields beyond just technical or consumer applications. In conclusion, AI continues to be a rapidly evolving field with significant implications for various aspects of society. The development from theoretical concepts to real-world applications illustrates both the potential benefits and the challenges that come with integrating advanced technologies into everyday life. The ongoing discussion about AI ethics and regulation underscores the importance of managing these technologies responsibly to maximize their their benefits while minimizing potential harms.

Artificial Intelligence: Facts and Myths

Joaquim Jorge

A Domino Admins Adventures (Engage 2024)

Gabriella Davis

Choosing the right accounts payable services provider is a strategic decision that can significantly impact your business's financial performance and operational efficiency. By considering factors such as expertise, range of services, technology infrastructure, scalability, cost, and reputation, businesses can make informed decisions and select a provider that aligns with their unique needs and objectives. Partnering with the right provider can streamline accounts payable processes, drive cost savings, and position your business for long-term success. https://katprotech.com/accounts-payable-and-purchase-order-automation/

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

Katpro Technologies

CNv6 Instructor Chapter 6 Quality of Service

giselly40

Axa Assurance Maroc - Insurer Innovation Award 2024

The Digital Insurer

What is a good lead in your organisation? Which leads are priority? What happens to leads? When sales and marketing give different answers to these questions, or perhaps aren't sure of the answers at all, frustrations build and opportunities are left on the table. Join us for an illuminating session with Cian McLoughlin, HubSpot Principal Customer Success Manager, as we look at that crucial piece of the customer journey in which leads are transferred from marketing to sales.

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

HampshireHUG

Real Time Object Detection Using Open CV

Khem

What are drone anti-jamming systems? The drone anti-jamming systems and anti-spoof technology protect against interference, jamming, and spoofing of the UAVs. To protect their security, countries are beginning to research drone anti-jamming systems, also known as drone strike weapons. The anti-jam and anti-spoof technology protects against interference, jamming and spoofing. A drone strike weapon is a drone attack weapon that can attack and destroy enemy drones. So what is so unique about this amazing system?

What Are The Drone Anti-jamming Systems Technology?

Antenna Manufacturer Coco

Slack Application Development 101 Slides

praypatel2

In an era where artificial intelligence (AI) stands at the forefront of business innovation, Information Architecture (IA) is at the core of functionality. See “There’s No AI Without IA” – (from 2016 but even more relevant today) Understanding and leveraging how Information Architecture (IA) supports AI synergies between knowledge engineering and prompt engineering is critical for senior leaders looking to successfully deploy AI for internal and externally facing knowledge processes. This webinar be a high-level overview of the methodologies that can elevate AI-driven knowledge processes supporting both employees and customers. Core Insights Include: Strategic Knowledge Engineering: Delve into how structuring AI's knowledge base is required to prevent hallucinations, enable contextual retrieval of accurate information. This will include discussion of gold standard libraries of use cases support testing various LLMs and structures and configurations of knowledge base. Precision in Prompt Engineering: Learn the art of crafting prompts that direct AI to deliver targeted, relevant responses, thereby optimizing customer experiences and business outcomes. Unified Approach for Enhanced AI Performance: Explore the intersection of knowledge and prompt engineering to develop AI systems that are not only more responsive but also aligned with overarching business strategies. Guiding Principles for Implementation: Equip yourself with best practices, ethical guidelines, and strategic considerations for embedding these technologies into your business ecosystem effectively. This webinar is designed to empower business and technology leaders with the knowledge to harness the full potential of AI, ensuring their organizations not only keep pace with digital transformation but lead the charge. Join us to map a roadmap to fully leverage Information Architecture (IA) and AI chart a course towards a future where AI is a key pillar of strategic innovation and business success.

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

Earley Information Science

Building Digital Trust in a Digital Economy Veronica Tan, Director - Cyber Security Agency of Singapore Apidays Singapore 2024: Connecting Customers, Business and Technology (April 17 & 18, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

apidays

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

The Digital Insurer

A Call to Action for Generative AI in 2024

Results

[2024]Digital Global Overview Report 2024 Meltwater.pdf

hans926745

As privacy and data protection regulations evolve rapidly, organizations operating in multiple jurisdictions face mounting challenges to ensure compliance and safeguard customer data. With state-specific privacy laws coming up in multiple states this year, it is essential to understand what their unique data protection regulations will require clearly. How will data privacy evolve in the US in 2024? How to stay compliant? Our panellists will guide you through the intricacies of these states' specific data privacy laws, clarifying complex legal frameworks and compliance requirements. This webinar will review: - The essential aspects of each state's privacy landscape and the latest updates - Common compliance challenges faced by organizations operating in multiple states and best practices to achieve regulatory adherence - Valuable insights into potential changes to existing regulations and prepare your organization for the evolving landscape

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

TrustArc

MySQL Webinar, presented on the 25th of April, 2024. Summary: MySQL solutions enable the deployment of diverse Database Architectures tailored to specific needs, including High Availability, Disaster Recovery, and Read Scale-Out. With MySQL Shell's AdminAPI, administrators can seamlessly set up, manage, and monitor these solutions, ensuring efficiency and ease of use in their administration. MySQL Router, on the other hand, provides transparent routing from the application traffic to the backend servers in the architectures, requiring minimal configuration. Completely built in-house and supported by Oracle, these solutions have been adopted by enterprises of all sizes for their business-critical applications. In this presentation, we'll delve into various database architecture solutions to help you choose the right one based on your business requirements. Focusing on technical details and the latest features to maximize the potential of these solutions.

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Miguel Araújo

08448380779 Call Girls In Friends Colony Women Seeking Men

Delhi Call girls

Scaling API-first – The story of a global engineering organization

Radu Cotescu

Dernier (20)

Driving Behavioral Change for Information Management through Data-Driven Gree...

08448380779 Call Girls In Civil Lines Women Seeking Men

Artificial Intelligence: Facts and Myths

A Domino Admins Adventures (Engage 2024)

Factors to Consider When Choosing Accounts Payable Services Providers.pptx

CNv6 Instructor Chapter 6 Quality of Service

Axa Assurance Maroc - Insurer Innovation Award 2024

04-2024-HHUG-Sales-and-Marketing-Alignment.pptx

Real Time Object Detection Using Open CV

What Are The Drone Anti-jamming Systems Technology?

Slack Application Development 101 Slides

EIS-Webinar-Prompt-Knowledge-Eng-2024-04-08.pptx

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

A Call to Action for Generative AI in 2024

[2024]Digital Global Overview Report 2024 Meltwater.pdf

TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

08448380779 Call Girls In Friends Colony Women Seeking Men

Scaling API-first – The story of a global engineering organization

Michal Barla: Beyond search queries @ ElasticSearch Vienna Meetup #1

1. Beyond search queries Michal Barla searchd.co

2. About me ● researcher and teacher at Slovak University of Technology in Bratislava ● developer @ synopsi.tv, searchd.co ● co-owner of minio, s.r.o. ○ otvorenezmluvy.sk, govdata.sk

3. Search as seen by developers { "query": { "query_string": { "query": "elasticsearch book" } } } return response.hits.hits

4. Search as experienced by users query: elasticsarch Typo in query. No results. query: elasticsearch Too many hits. Not relevant. query: elasticsearch book Click! Success! Or?

5. Measuring search quality

6. Cpt. Obvious: “Hits, clicks and order do matter.”

7. Accurately interpreting clickthrough data as implicit feedback Thorsten Joachims, Laura Granka, Bing Pan, Helene Hembrooke, and Geri Gay. Accurately interpreting clickthrough data as implicit feedback. In Proceedings of the 28th annual international ACM SIGIR conference on Research and development in Information retrieval, SIGIR ’05, pages 154–161, New York, NY, USA, 2005. ACM.

8. Accurately interpreting clickthrough data as implicit feedback

9. Search quality metrics ● Mean Average Precision @ N ○ probability of target result being in top N items ● Mean Reciprocal Rank ○ 1 / rank of target result ● Normalized Discounted Cumulative Gain ● Expected Reciprocal Rank

10. Search KPIs ● CTR trend ● # of queries w/o results or clicks ● # of searches per session ● Search engine latency

11. Search quality optimization

12. Optimizing search engines using clickthrough data Thorsten Joachims. Optimizing search engines using clickthrough data. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, KDD ’02, pages 133–142, New York, NY, USA, 2002. ACM.

13. Optimizing search engines using clickthrough data

14. Query chains: learning to rank from implicit feedback Filip Radlinski and Thorsten Joachims. Query chains: learning to rank from implicit feedback. In KDD ’05: Proceeding of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, pages 239–248, New York, NY, USA, 2005. ACM.

15. Fighting Search Engine Amnesia: Reranking Repeated Results Milad Shokouhi, Ryen W. White, Paul Bennett, and Filip Radlinski. Fighting search engine amnesia: reranking repeated results. In Proceedings of the 36th international ACM SIGIR conference on Research and development in information retrieval, SIGIR ’13, pages 273–282, New York, NY, USA, 2013. ACM. In this paper, we observed that the same results are often shown to users multiple times during search sessions. We showed that there are a number of effects at play, which can be leveraged to improve information retrieval performance. In particular, previously skipped results are much less likely to be clicked, and previously clicked results may or may not be re-clicked depending on other factors of the session.

16. searchd.co Search Analytics

17. searchd.co dashboard

18.

19.

20.

21. searchd.co Search Analytics ● Identify and fix key search problems ● KPIs for site search ● Actionable tips for search tuning ● Easy setup a. Add our hosted JavaScript b. Annotate search results with HTML5 tags c. Done. ● Currently in private beta

22. Bad search experience is a lost opportunity. Let's fix it. searchd.co Search Analytics www.searchd.co info@searchd.co

Michal Barla: Beyond search queries @ ElasticSearch Vienna Meetup #1

Recommandé

Recommandé

Contenu connexe

Similaire à Michal Barla: Beyond search queries @ ElasticSearch Vienna Meetup #1

Similaire à Michal Barla: Beyond search queries @ ElasticSearch Vienna Meetup #1 (20)

Dernier

Dernier (20)

Michal Barla: Beyond search queries @ ElasticSearch Vienna Meetup #1