Developing Document Image Retrieval System

•Télécharger en tant que PPTX, PDF•

2 j'aime•1,147 vues

A system was developed able to retrieve specific documents from a document collection. In this system the query is given in text by the user and then transformed into image. Appropriate features were in order to capture the general shape of the query, and ignore details due to noise or different fonts. In order to demonstrate the effectiveness of our system, we used a collection of noisy documents and we compared our results with those of a commercial OCR package.

Technologie Art & Photos

K. Zagoris, K. Ergina and N. Papamarkos
Image Processing and Multimedia Laboratory
Department of Electrical & Computer Engineering
Democritus University of Thrace,
67100 Xanthi, Greece

 Phenomenal growth of the size of multimedia data
and especially document images
 Caused by the easiness to create such images using
scanners or digital cameras
 Huge quantities of document images are created and
stored in image archives without having any indexing
information

Theoverall structureof the DocumentImage Retrieval System

 Binarization
(Otsu Technique)
 Original Document
 Median Filter

 Indentify all the Connected Components (CCs)
 Calculate the most common height of the
document CCs (CCch)
 Reject the CCs with height less than 70% of the
CCch. That only reject areas of punctuation
points and noise.
 Expand the left and right sides of the resulted
CCs by 20% of the CCch
 The words are the merged overlapping CCs
Using the Connected
Components Labeling and
Filtering method
Word
Segmentation

 Width to Height Ratio
 Word Area Density. The percentage of the black
pixels included in the word-bounding box
 Center of Gravity. The Euclidean distance from the
word’s center of gravity to the upper left corner of the
bounding box:
(1,0) (0,1)
(0,0) (0,0)
,x y
M M
C C
M M
 
( , )
qp
pq
x y
x y
M f x y
width height
  
   
   


 Vertical Projection. The first twenty (20) coefficients
of the Discrete Cosine Transform (DCT) of the
smoothed and normalized vertical projection.
 Original Image
 The Vertical
Projection
 Smoothed and
normalized

 Top – Bottom Shape Projections. A vector of 50 elements
 The first 25 values are the first 25 coefficients of the smoothed
and normalized Top Shape Projection DCT
 The rest 25 values are equal to the first 25 coefficients of the
smoothed and normalized Bottom Shape Projection DCT.

 Upper Grid Features is a ten element vector with
binary values which are extracted from the upper part
of each word image.
 Down Grid Features is a ten element vector with
binary values which are extracted from the lower part
of the word image.

[0,0,0,1195 ,0,0,0,0,0,0]
[0,0,0,1 ,0,0,0,0,0,0]
[0,0,0,0 ,0,0,0, 598 , 50 , 33 ]
[0,0,0,0 ,0,0,0,1,1,0]

Descriptor
The Structure of the
Descriptor

 User enters a query word
 The proposed system creates an image of the query
word with font height equal to the average height of all
the word-boxes obtained through the Word
Segmentation stage of the Offline operation.
 For our experimental set the average height is 50
 The font type of the query image is Arial
 The smoothing and normalizing of the various
features described before, suppress small differences
between various types of fonts

 100 image documents created artificially from various
texts
 Then Gaussian and “Salt and Pepper” noise was added
 Implement in parallel a text search engine which
makes easier the verification and evaluation of the
search results of the proposed system

Implementation
o Visual Studio 2008
o Microsoft .NET
Framework 2.0
o C# Language
o Microsoft SQL
Server 2005
http://orpheus.ee.duth.gr/irs2_5/

Evaluation
o Precision and the
Recall metrics
o 30 searches in 100
document images
o Font Query: Arial
 Mean Precision: 87.8%
 Mean Recall: 99.26%

FineReader® 9.0 OCR Program Query Font Name “Tahoma”.
 Mean Precision: 76.67%
 Mean Recall: 58.42%
 Mean Precision: 89.44%
 Mean Recall: 88.05%

 The query word is given in text and then transformed
to word image
 The proposed system extract nine (9) powerful
features for the description of the word images
 These features describe satisfactorily the shape of the
words while at the same moment they suppress small
differences due to noise, size and type of fonts
 Based on our experiments the proposed system
performs better in the same database than a
commercial OCR package

Developing Document Image Retrieval System

Contenu connexe

Tendances

Content-based image retrieval (CBIR) with global features is notoriously noisy, especially for image queries with low percentages of relevant images in a collection. Moreover, CBIR typically ranks the whole collection, which is inefficient for large databases. We experiment with a method for image retrieval from multimodal databases, which improves both the effectiveness and efficiency of traditional CBIR by exploring secondary modalities. We perform retrieval in a two-stage fashion: first rank by a secondary modality, and then perform CBIR only on the top-K items. Thus, effectiveness is improved by performing CBIR on a ‘better’ subset. Using a relatively ‘cheap’ first stage, efficiency is also improved via the fewer CBIR operations performed. Our main novelty is that K is dynamic, i.e. estimated per query to optimize a predefined effectiveness measure. We show that such dynamic two-stage setups can be significantly more effective and robust than similar setups with static thresholds previously proposed.

Dynamic Two-Stage Image Retrieval from Large Multimodal Databases

Konstantinos Zagoris

Self-Directing Text Detection and Removal from Images with Smoothing

Priyanka Wagh

Self-organizing map

Tarat Diloksawatdikul

Btv thesis defense_v1.02-final

Vinh Bui

Sub1586

International Journal of Science and Research (IJSR)

Proposed Model: [I] Pre-processing of server logs: Our web-site server log file analyser performs the following steps when provided with a log file: 1) It scans the entries in the log files to help identify unique visitor’s sessions. 2) For each identified sessions, the analyser has to examine its key matching features to generate the session’s dimensional feature-vector representation. [II] Session identification: In this process of dividing a web-site server access log enters into sessions. Session identification is performed by: 1) Grouping all HTTP requests on web-sites that originate from the same IP address that matches the visitor and also are described by the same user-agent strings. 2) By applying a timeout approach to divide into unique sessions to avoid any mishaps. [III] Dataset labelling: labels each feature-vector as belonging to one of the following four categories: 1. Human visitor’s normal Known. 2. well-behaved web-site attackers. 3. malicious attackers. 4. unknown visitors unidentified. Thus, allow a better understanding of the cluster’s nature and significance results can be generated. Techniques: SOM Algorithm, NNtool, MATLAB, WEKA toolkit, KDD Data-set.

Intrusion Detection Model using Self Organizing Maps.

Tushar Shinde

Image classification is perhaps the most important part of digital image analysis. In this paper, we compare the most widely used model CNN Convolutional Neural Network , and MLP Multilayer Perceptron . We aim to show how both models differ and how both models approach towards the final goal, which is image classification. Souvik Banerjee | Dr. A Rengarajan "Hand-Written Digit Classification" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-5 | Issue-4 , June 2021, URL: https://www.ijtsrd.compapers/ijtsrd42444.pdf Paper URL: https://www.ijtsrd.comcomputer-science/artificial-intelligence/42444/handwritten-digit-classification/souvik-banerjee

Hand Written Digit Classification

Ijetcas14 527

O017429398

Pillar k means

TensorFlow Korea 논문읽기모임 PR12 258번째 논문 review입니다. 이번 논문은 MIT에서 나온 From ImageNet to Image Classification: Contextualizing Progress on Benchmarks입니다. Deep Learning 하시는 분들이면 ImageNet 모르시는 분들이 없을텐데요, 이 논문은 ImageNet의 labeling 방법의 한계와 문제점에 대해서 얘기하고 top-1 accuracy 기반의 평가 방법에도 문제가 있을 수 있음을 지적하고 있습니다. ImageNet data의 20% 이상이 multi object를 포함하고 있지만 그 중에 하나만 정답으로 인정되는 문제가 있고, annotation 방법의 한계로 인하여 실제로 사람이 생각하는 것과 다른 class가 정답으로 labeling되어 있는 경우도 많았습니다. 또한 terrier만 20종이 넘는 등 전문가가 아니면 판단하기 어려운 label도 많다는 문제도 있었구요. 이 밖에도 다양한 실험을 통해서 정량적인 분석과 함께 human-in-the-loop을 이용한 평가로 현재 model들의 성능이 어디까지 와있는지, 그리고 앞으로 더 높은 성능을 내기 위해서 data labeling 측면에서 해결해야할 과제는 무엇인지에 대해서 이야기하고 있습니다. 논문이 양이 좀 많긴 하지만 기술적인 내용이 별로 없어서 쉽게 읽으실 수 있는데요, 자세한 내용이 궁금하신 분들은 영상을 참고해주세요! 논문링크: https://arxiv.org/abs/2005.11295 발표영상링크: https://youtu.be/CPMgX5ikL_8

PR-258: From ImageNet to Image Classification: Contextualizing Progress on Be...

Jinwon Lee

This paper proposes a steganalysis technique for both grayscale and color images. It uses the feature vectors derived from gray level co-occurrence matrix (GLCM) in spatial domain, which is sensitive to data embedding process. This GLCM matrix is derived from an image. Several combinations of diagonal elements of GLCM are considered as features. There is difference between the features of stego and non-stego images and this characteristic is used for steganalysis. Distance measures like Absolute distance, Euclidean distance and Normalized Euclidean distance are used for classification. Experimental results demonstrate that the proposed scheme outperforms the existing steganalysis techniques in attacking LSB steganographic schemes applied to spatial domain.

Steganalysis of LSB Embedded Images Using Gray Level Co-Occurrence Matrix

CSCJournals

Enhanced characterness for text detection in the wild

Prerana Mukherjee

Introduction to Convolutional Neural Networks

ParrotAI

Sefl Organizing Map

Nguyen Van Chuc

J017426467

IOSR Journals

Radial Thickness Calculation and Visualization for Volumetric Layers-8397

Kitware Kitware

IRJET- Object Detection using Hausdorff Distance

IRJET Journal

Convolutional Neural Networks: Part 1

ananth

Brief introduction of NAS Review of EfficientNet (Google Brain), RandWire (FAIR) papers NAS flow slide from KihoSuh's slideshare (https://www.slideshare.net/KihoSuh/neural-architecture-search-with-reinforcement-learning-76883153) [References] [1] EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks (https://arxiv.org/abs/1905.11946) [2] Exploring Randomly Wired Neural Networks for Image Recognition (https://arxiv.org/abs/1904.01569)

201907 AutoML and Neural Architecture Search

DaeJin Kim

Tendances (20)

Dynamic Two-Stage Image Retrieval from Large Multimodal Databases

Self-Directing Text Detection and Removal from Images with Smoothing

Self-organizing map

Btv thesis defense_v1.02-final

Sub1586

Intrusion Detection Model using Self Organizing Maps.

Hand Written Digit Classification

Ijetcas14 527

O017429398

Pillar k means

PR-258: From ImageNet to Image Classification: Contextualizing Progress on Be...

Steganalysis of LSB Embedded Images Using Gray Level Co-Occurrence Matrix

Enhanced characterness for text detection in the wild

Introduction to Convolutional Neural Networks

Sefl Organizing Map

J017426467

Radial Thickness Calculation and Visualization for Volumetric Layers-8397

IRJET- Object Detection using Hausdorff Distance

Convolutional Neural Networks: Part 1

201907 AutoML and Neural Architecture Search

Similaire à Developing Document Image Retrieval System

Text extraction from different kind of images document, caption and scene text images. Discret wavelet transform was used to exract horizontal, vertical and diagonal features and k-means clustering was used to cluster the features into text and background cluster. For simple images k = 2 worked i.e. text and backgroud cluster while for complex images k=3 was used i.e. text cluster, complex background ad simple background.

Texture features based text extraction from images using DWT and K-means clus...

Divya Gera

Optical character recognition, usually abbreviated to OCR, is the mechanical or electronic conversion of scanned images of handwritten, typewritten or printed text into machine-encoded text. It is a system that provides a full alphanumeric recognition of printed or handwritten characters at electronic speed by simply scanning the form. It is widely used as a form of data entry from some sort of original paper data source, whether documents, sales receipts, mail, or any number of printed records. It is a common method of digitizing printed texts so that they can be electronically searched, stored more compactly, displayed on-line, and used in machine processes such as machine translation, text-to-speech and text mining.OCR is a field of research in pattern recognition, artificial intelligence and computer vision. More recently, the term Intelligent Character Recognition(ICR) has been used to describe the process of interpreting image data, in particular alphanumeric text .

Opticalcharacter recognition

Shobhit Saxena

poster

Anik Biswas

The Computer Aided Design Concept in the Concurrent Engineering Context.

Nareshkumar Kannathasan

A complex engineering system such as a xerographic marking engine is an aggregate of interacting subsystems that are coupled through a large number of constraints and design variables. The traditional way of designing these systems is to decouple the overall design into smaller subsystems and assign teams to work on these subsystems. This approach is critical to making the project manageable and enabling concurrent development. However, if the goal is to design systems that can deliver best possible performance, i.e. if the performance limits are being pushed to the extreme, characterizing the interactions becomes critical. Multiobjective optimization is a design methodology that addresses the issue of designing large systems where the goal is to simultaneously optimize a finite number of performance criteria that come from one or more disciplines and are coupled through a set of design variables and constraints. This approach to design makes explicit and quantitative the inherent trade-offs that need to be made in doing coupled system design. It also enables the determination of the attainable limits of performance from a given system. This paper will discuss the multiobjective optimization methodology and optimal methods of performing quantitative trade-off analysis. These design methods will be applied to problems from the xerographic design domain and results will be presented.

MULTIOBJECTIVE OPTIMIZATION AND QUANTITATIVE TRADE-OFF ANALYSIS IN XEROGRAPHI...

Sudhendu Rai

The extension to the content based image retrieval (CBIR) technique based on row mean of transformed columns of image is presented here. As compared to earlier contemplation three image transforms, now the performance appraise of proposed CBIR technique is done using seven different image transforms like Discrete Cosine Transform (DCT), Discrete Sine Transform (DST), Hartley Transform, Haar Transform, Kekre Transform, Walsh Transform and Slant Transform. The generic image database with 1000 images spread across 11 categories is used to test the performance of proposed CBIR techniques. For each transform 55 queries (5 per category) were fired on the image database. Every technique is tested on both the color and grey version of image database. To compare the performance of image retrieval technique across transforms average precision and recall are computed of all queries. The results have shown the performance improvement (higher precision and recall values) with proposed methods compared to all pixel data of image at reduced computations resulting in faster retrieval in both gray as well as color versions of image database. Even the variation of considering DC component of transformed columns as part of feature vector and excluding it are also tested and it is found that presence of DC component in feature vector improvises the results in image retrieval. The ranking of transforms for performance in proposed gray CBIR techniques with DC component consideration can be given as DST, Haar, Hartley, DCT, Walsh, Slant and Kekre. In color variants of proposed techniques with DC component, the performance ranking of image transforms starting from best can be listed as DCT, Haar, Walsh, Slant, DST, Hartley and Kekre transform.

Extended Performance Appraise of Image Retrieval Using the Feature Vector as ...

Waqas Tariq

Xuedong Huang - Deep Learning and Intelligent Applications

Machine Learning Prague

Text Detection and Recognition in Natural Images

IRJET Journal

IRJET- Document Layout analysis using Inverse Support Vector Machine (I-SV...

IRJET Journal

Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hin...

IRJET Journal

Basic group of visual techniques such as color, shape, texture are used in Content Based Image Retrievals (CBIR) to retrieve query image or sub region of image to find similar images in image database. To improve query result, relevance feedback is used many times in CBIR to help user to express their preference and improve query results. In this paper, a new approach for image retrieval is proposed which is based on the features such as Color Histogram, Eigen Values and Match Point. Images from various types of database are first identified by using edge detection techniques .Once the image is identified, then the image is searched in the particular database, then all related images are displayed. This will save the retrieval time. Further to retrieve the precise query image, any of the three techniques are used and comparison is done w.r.t. average retrieval time. Eigen value technique found to be the best as compared with other two techniques.

A COMPARATIVE ANALYSIS OF RETRIEVAL TECHNIQUES IN CONTENT BASED IMAGE RETRIEVAL

cscpconf

A comparative analysis of retrieval techniques in content based image retrieval

csandit

Presentation vision transformersppt.pptx

htn540

AN INTEGRATED APPROACH TO CONTENT BASED IMAGERETRIEVAL by Madhu

Madhu Rock

Software systems continuously change and developers spent a large portion of their time in keeping track and understanding changes and their effects. Current development tools provide only limited support. Most of all, they track changes in source files only on the level of textual lines lacking semantic and context information on changes. Developers frequently need to reconstruct this information manually which is a time consuming and error prone task. In this talk, I present three techniques to address this problem by extracting detailed syntactical information from changes in various source files. I start with introducing ChangeDistiller, a tool and approach to extract information on source code changes on the level of ASTs. Next, I present the WSDLDiff approach to extract information on changes in web services interface description files. Finally, I present FMDiff, an approach to extract changes from feature models defined with the linux Kconfig language. For each approach I report on cases studies and experiments to highlight the benefits of our techniques. I also point out several research opportunities opened by our techniques and tools, and the detailed data on changes extracted by them.

Analyzing Changes in Software Systems From ChangeDistiller to FMDiff

Martin Pinzger

The desire of better and faster retrieval techniques has always fuelled to the research in content based image retrieval (CBIR). The extended comparison of innovative content based image retrieval (CBIR) techniques based on feature vectors as fractional coefficients of transformed images using various orthogonal transforms is presented in the paper. Here the fairly large numbers of popular transforms are considered along with newly introduced transform. The used transforms are Discrete Cosine, Walsh, Haar, Kekre, Discrete Sine, Slant and Discrete Hartley transforms. The benefit of energy compaction of transforms in higher coefficients is taken to reduce the feature vector size per image by taking fractional coefficients of transformed image. Smaller feature vector size results in less time for comparison of feature vectors resulting in faster retrieval of images. The feature vectors are extracted in fourteen different ways from the transformed image, with the first being all the coefficients of transformed image considered and then fourteen reduced coefficients sets are considered as feature vectors (as 50%, 25%, 12.5%, 6.25%, 3.125%, 1.5625% ,0.7813%, 0.39%, 0.195%, 0.097%, 0.048%, 0.024%, 0.012% and 0.06% of complete transformed image coefficients). To extract Gray and RGB feature sets the seven image transforms are applied on gray image equivalents and the color components of images. Then these fourteen reduced coefficients sets for gray as well as RGB feature vectors are used instead of using all coefficients of transformed images as feature vector for image retrieval, resulting into better performance and lower computations. The Wang image database of 1000 images spread across 11 categories is used to test the performance of proposed CBIR techniques. 55 queries (5 per category) are fired on the database o find net average precision and recall values for all feature sets per transform for each proposed CBIR technique. The results have shown performance improvement (higher precision and recall values) with fractional coefficients compared to complete transform of image at reduced computations resulting in faster retrieval. Finally Kekre transform surpasses all other discussed transforms in performance with highest precision and recall values for fractional coefficients (6.25% and 3.125% of all coefficients) and computation are lowered by 94.08% as compared to Cosine or Sine or Hartlay transforms.

Comprehensive Performance Comparison of Cosine, Walsh, Haar, Kekre, Sine, Sla...

CSCJournals

Document Analysis and Recognition (DAR) aims to extract automatically the information in the document and also addresses to human comprehension. The automatic processing of degraded historical documents are applications of document image analysis field which is confronted with many difficulties due to the storage condition and the complexity of the script. The main interest of enhancement of historical documents is to remove undesirable statistics that appear in the background and highlight the foreground, so as to enable automatic recognition of documents with high accuracy. This paper addresses pre-processing and segmentation of ancient scripts, as an initial step to automate the task of an epigraphist in reading and deciphering inscriptions. Pre-processing involves, enhancement of degraded ancient document images which is achieved through four different Spatial filtering methods for smoothing or sharpening namely Median, Gaussian blur, Mean and Bilateral filter, with different mask sizes. This is followed by binarization of the enhanced image to highlight the foreground information, using Otsu thresholding algorithm. In the second phase Segmentation is carried out using Drop Fall and WaterReservoir approaches, to obtain sampled characters, which can be used in later stages of OCR. The system showed good results when tested on the nearly 150 samples of varying degraded epigraphic images and works well giving better enhanced output for, 4x4 mask size for Median filter, 2x2 mask size for Gaussian blur, 4x4 mask size for Mean and Bilateral filter. The system can effectively sample characters from enhanced images, giving a segmentation rate of 85%-90% for Drop Fall and 85%-90% for Water Reservoir techniques respectively

Enhancement and Segmentation of Historical Records

csandit

Speech Emotion Recognition Using Machine Learning

IRJET Journal

MOVIE RECOMMENDATION SYSTEM.pptx

Ayushkumar417871

A tale of bug prediction in software development

Martin Pinzger

Similaire à Developing Document Image Retrieval System (20)

Texture features based text extraction from images using DWT and K-means clus...

Opticalcharacter recognition

poster

The Computer Aided Design Concept in the Concurrent Engineering Context.

MULTIOBJECTIVE OPTIMIZATION AND QUANTITATIVE TRADE-OFF ANALYSIS IN XEROGRAPHI...

Extended Performance Appraise of Image Retrieval Using the Feature Vector as ...

Xuedong Huang - Deep Learning and Intelligent Applications

Text Detection and Recognition in Natural Images

IRJET- Document Layout analysis using Inverse Support Vector Machine (I-SV...

Document Layout analysis using Inverse Support Vector Machine (I-SVM) for Hin...

A COMPARATIVE ANALYSIS OF RETRIEVAL TECHNIQUES IN CONTENT BASED IMAGE RETRIEVAL

A comparative analysis of retrieval techniques in content based image retrieval

Presentation vision transformersppt.pptx

AN INTEGRATED APPROACH TO CONTENT BASED IMAGERETRIEVAL by Madhu

Analyzing Changes in Software Systems From ChangeDistiller to FMDiff

Comprehensive Performance Comparison of Cosine, Walsh, Haar, Kekre, Sine, Sla...

Enhancement and Segmentation of Historical Records

Speech Emotion Recognition Using Machine Learning

MOVIE RECOMMENDATION SYSTEM.pptx

A tale of bug prediction in software development

Dernier

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Product Anonymous

Discord is a free app offering voice, video, and text chat functionalities, primarily catering to the gaming community. It serves as a hub for users to create and join servers tailored to their interests. Discord’s ecosystem comprises servers, each functioning as a distinct online community with its own channels dedicated to specific topics or activities. Users can engage in text-based discussions, voice calls, or video chats within these channels. Understanding Discord Servers Discord servers are virtual spaces where users congregate to interact, share content, and build communities. Servers may revolve around gaming, hobbies, interests, or fandoms, providing a platform for like-minded individuals to connect. Communication Features Discord offers a range of communication tools, including text channels for messaging, voice channels for real-time audio conversations, and video channels for face-to-face interactions. These features facilitate seamless communication and collaboration. What Does NSFW Mean? The acronym NSFW stands for “Not Safe For Work,” indicating content that may be inappropriate for professional or public settings. NSFW Content NSFW content encompasses material that is sexually explicit, violent, or otherwise graphic in nature. It often includes nudity, profanity, or depictions of sensitive topics.

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

UK Journal

A Domino Admins Adventures (Engage 2024)

Gabriella Davis

This presentations targets students or working professionals. You may know Google for search, YouTube, Android, Chrome, and Gmail, but did you know Google has many developer tools, platforms & APIs? This comprehensive yet still high-level overview outlines the most impactful tools for where to run your code, store & analyze your data. It will also inspire you as to what's possible. This talk is 50 minutes in length.

Powerful Google developer tools for immediate impact! (2023-24 C)

wesley chun

Strategies for Landing an Oracle DBA Job as a Fresher

Remote DBA Services

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

The Digital Insurer

Building Digital Trust in a Digital Economy Veronica Tan, Director - Cyber Security Agency of Singapore Apidays Singapore 2024: Connecting Customers, Business and Technology (April 17 & 18, 2024) ------ Check out our conferences at https://www.apidays.global/ Do you want to sponsor or talk at one of our conferences? https://apidays.typeform.com/to/ILJeAaV8 Learn more on APIscene, the global media made by the community for the community: https://www.apiscene.io Explore the API ecosystem with the API Landscape: https://apilandscape.apiscene.io/

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

apidays

Finology Group – Insurtech Innovation Award 2024

The Digital Insurer

How to Troubleshoot Apps for the Modern Connected Worker

ThousandEyes

Histor y of HAM Radio presentation slide

vu2urc

Heather Hedden, Senior Consultant at Enterprise Knowledge, presented “The Role of Taxonomy and Ontology in Semantic Layers” at a webinar hosted by Progress Semaphore on April 16, 2024. Taxonomies at their core enable effective tagging and retrieval of content, and combined with ontologies they extend to the management and understanding of related data. There are even greater benefits of taxonomies and ontologies to enhance your enterprise information architecture when applying them to a semantic layer. A survey by DBP-Institute found that enterprises using a semantic layer see their business outcomes improve by four times, while reducing their data and analytics costs. Extending taxonomies to a semantic layer can be a game-changing solution, allowing you to connect information silos, alleviate knowledge gaps, and derive new insights. Hedden, who specializes in taxonomy design and implementation, presented how the value of taxonomies shouldn’t reside in silos but be integrated with ontologies into a semantic layer. Learn about: - The essence and purpose of taxonomies and ontologies in information and knowledge management; - Advantages of semantic layers leveraging organizational taxonomies; and - Components and approaches to creating a semantic layer, including the integration of taxonomies and ontologies

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Enterprise Knowledge

With more memory available, system performance of three Dell devices increased, which can translate to a better user experience Conclusion When your system has plenty of RAM to meet your needs, you can efficiently access the applications and data you need to finish projects and to-do lists without sacrificing time and focus. Our test results show that with more memory available, three Dell PCs delivered better performance and took less time to complete the Procyon Office Productivity benchmark. These advantages translate to users being able to complete workflows more quickly and multitask more easily. Whether you need the mobility of the Latitude 5440, the creative capabilities of the Precision 3470, or the high performance of the OptiPlex Tower Plus 7010, configuring your system with more RAM can help keep processes running smoothly, enabling you to do more without compromising performance.

Boost PC performance: How more available memory can improve productivity

Principled Technologies

In this session, we will delve into strategic approaches for optimizing knowledge management within Microsoft 365, amidst the evolving landscape of Copilot. From leveraging automatic metadata classification and permission governance with SharePoint Premium, to unlocking Viva Engage for the cultivation of knowledge and communities, you will gain actionable insights to bolster your organization's knowledge-sharing initiatives. In this session, we will also explore how to facilitate solutions to enable your employees to find answers and expertise within Microsoft 365. You will leave equipped with practical techniques and a deeper understanding of how there is more to effective knowledge management than just enabling Copilot, but building actual solutions to prepare the knowledge that Copilot and your employees can use.

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Drew Madelung

MySQL Webinar, presented on the 25th of April, 2024. Summary: MySQL solutions enable the deployment of diverse Database Architectures tailored to specific needs, including High Availability, Disaster Recovery, and Read Scale-Out. With MySQL Shell's AdminAPI, administrators can seamlessly set up, manage, and monitor these solutions, ensuring efficiency and ease of use in their administration. MySQL Router, on the other hand, provides transparent routing from the application traffic to the backend servers in the architectures, requiring minimal configuration. Completely built in-house and supported by Oracle, these solutions have been adopted by enterprises of all sizes for their business-critical applications. In this presentation, we'll delve into various database architecture solutions to help you choose the right one based on your business requirements. Focusing on technical details and the latest features to maximize the potential of these solutions.

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Miguel Araújo

Scaling API-first – The story of a global engineering organization

Radu Cotescu

How to Troubleshoot Apps for the Modern Connected Worker

ThousandEyes

GenAI Risks & Security Meetup 01052024.pdf

lior mazor

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Rafal Los

Automating Google Workspace (GWS) & more with Apps Script

wesley chun

Imagine a world where information flows as swiftly as thought itself, making decision-making as fluid as the data driving it. Every moment is critical, and the right tools can significantly boost your organization’s performance. The power of real-time data automation through FME can turn this vision into reality. Aimed at professionals eager to leverage real-time data for enhanced decision-making and efficiency, this webinar will cover the essentials of real-time data and its significance. We’ll explore: FME’s role in real-time event processing, from data intake and analysis to transformation and reporting An overview of leveraging streams vs. automations FME’s impact across various industries highlighted by real-life case studies Live demonstrations on setting up FME workflows for real-time data Practical advice on getting started, best practices, and tips for effective implementation Join us to enhance your skills in real-time data automation with FME, and take your operational capabilities to the next level.

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Safe Software

Dernier (20)

ProductAnonymous-April2024-WinProductDiscovery-MelissaKlemke

Understanding Discord NSFW Servers A Guide for Responsible Users.pdf

A Domino Admins Adventures (Engage 2024)

Powerful Google developer tools for immediate impact! (2023-24 C)

Strategies for Landing an Oracle DBA Job as a Fresher

Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024

Apidays Singapore 2024 - Building Digital Trust in a Digital Economy by Veron...

Finology Group – Insurtech Innovation Award 2024

How to Troubleshoot Apps for the Modern Connected Worker

Histor y of HAM Radio presentation slide

The Role of Taxonomy and Ontology in Semantic Layers - Heather Hedden.pdf

Boost PC performance: How more available memory can improve productivity

Strategies for Unlocking Knowledge Management in Microsoft 365 in the Copilot...

Mastering MySQL Database Architecture: Deep Dive into MySQL Shell and MySQL R...

Scaling API-first – The story of a global engineering organization

How to Troubleshoot Apps for the Modern Connected Worker

GenAI Risks & Security Meetup 01052024.pdf

The 7 Things I Know About Cyber Security After 25 Years | April 2024

Automating Google Workspace (GWS) & more with Apps Script

From Event to Action: Accelerate Your Decision Making with Real-Time Automation

Developing Document Image Retrieval System

1. K. Zagoris, K. Ergina and N. Papamarkos Image Processing and Multimedia Laboratory Department of Electrical & Computer Engineering Democritus University of Thrace, 67100 Xanthi, Greece

2.  Phenomenal growth of the size of multimedia data and especially document images  Caused by the easiness to create such images using scanners or digital cameras  Huge quantities of document images are created and stored in image archives without having any indexing information

3. Theoverall structureof the DocumentImage Retrieval System

4.  Binarization (Otsu Technique)  Original Document  Median Filter

5.  Indentify all the Connected Components (CCs)  Calculate the most common height of the document CCs (CCch)  Reject the CCs with height less than 70% of the CCch. That only reject areas of punctuation points and noise.  Expand the left and right sides of the resulted CCs by 20% of the CCch  The words are the merged overlapping CCs Using the Connected Components Labeling and Filtering method Word Segmentation

6.  Width to Height Ratio  Word Area Density. The percentage of the black pixels included in the word-bounding box  Center of Gravity. The Euclidean distance from the word’s center of gravity to the upper left corner of the bounding box: (1,0) (0,1) (0,0) (0,0) ,x y M M C C M M   ( , ) qp pq x y x y M f x y width height            

7.  Vertical Projection. The first twenty (20) coefficients of the Discrete Cosine Transform (DCT) of the smoothed and normalized vertical projection.  Original Image  The Vertical Projection  Smoothed and normalized

8.  Top – Bottom Shape Projections. A vector of 50 elements  The first 25 values are the first 25 coefficients of the smoothed and normalized Top Shape Projection DCT  The rest 25 values are equal to the first 25 coefficients of the smoothed and normalized Bottom Shape Projection DCT.

9.  Upper Grid Features is a ten element vector with binary values which are extracted from the upper part of each word image.  Down Grid Features is a ten element vector with binary values which are extracted from the lower part of the word image.

10. [0,0,0,1195 ,0,0,0,0,0,0] [0,0,0,1 ,0,0,0,0,0,0] [0,0,0,0 ,0,0,0, 598 , 50 , 33 ] [0,0,0,0 ,0,0,0,1,1,0]

11. Descriptor The Structure of the Descriptor

12.  User enters a query word  The proposed system creates an image of the query word with font height equal to the average height of all the word-boxes obtained through the Word Segmentation stage of the Offline operation.  For our experimental set the average height is 50  The font type of the query image is Arial  The smoothing and normalizing of the various features described before, suppress small differences between various types of fonts

13. The Matching Process

14.  100 image documents created artificially from various texts  Then Gaussian and “Salt and Pepper” noise was added  Implement in parallel a text search engine which makes easier the verification and evaluation of the search results of the proposed system

15. Implementation o Visual Studio 2008 o Microsoft .NET Framework 2.0 o C# Language o Microsoft SQL Server 2005 http://orpheus.ee.duth.gr/irs2_5/

16. Evaluation o Precision and the Recall metrics o 30 searches in 100 document images o Font Query: Arial  Mean Precision: 87.8%  Mean Recall: 99.26%

17. FineReader® 9.0 OCR Program Query Font Name “Tahoma”.  Mean Precision: 76.67%  Mean Recall: 58.42%  Mean Precision: 89.44%  Mean Recall: 88.05%

18.  The query word is given in text and then transformed to word image  The proposed system extract nine (9) powerful features for the description of the word images  These features describe satisfactorily the shape of the words while at the same moment they suppress small differences due to noise, size and type of fonts  Based on our experiments the proposed system performs better in the same database than a commercial OCR package

Developing Document Image Retrieval System

Recommandé

Recommandé

Contenu connexe

Tendances

Tendances (20)

Similaire à Developing Document Image Retrieval System

Similaire à Developing Document Image Retrieval System (20)

Dernier

Dernier (20)

Developing Document Image Retrieval System