1. Proposal for HP – Dspace (Digital Repository
Solution) from HP India Sales Pvt Ltd
May 11, 2009
2. HP India Sales Pvt Ltd
No. 2 Harrington Road, HP Towers
North Tower, Chetpet
Chennai 600031
www.hp.com
Satyanarayana May 11, 2009
Venkatarao
Attn:
Customer Details
Tel: +91 44 28365566
kantipudi.satya@hp.com
Dear Chairman,
HP India Sales Pvt Ltd is pleased to offer a proposal for Digital Library Solution
based on leading-edge information technology products and services -a broad
portfolio of market-leading products that offer flexibility, investment protection and
superior performance
Benefits of Digital Repository solution for the Institution:
• Preservation and conservation, an exact copy of original copy can be made any
number of times with out performance degradation.
• Round the clock availability of data.
• Digitally store TV Programs, News paper articles, photos, Audio Lectures, Video
Lectures and Sound.
• Fast and easy Information retrieval whereby the user is able to use any
search term, word, phrase, title, name or subject to search the entire
collection.
• Provide very user-friendly interfaces, giving clickable access to its resources.
The Institution can have confidence in the proposed approach because HP India
Sales Pvt Ltd has assisted customers worldwide with the successful deployment of
similar business solutions.
HP India Sales Pvt Ltd is committed to Institution’s success and is confident that
solution addresses all critical requirements. We look forward to meeting with you to
review our capabilities, to discuss the benefits of our proposed solution, and to
explore the next steps in forging a strong and mutually beneficial relationship.
Sincerely,
HP India Sales Pvt Ltd
Satyanarayana Venkatarao
3. HP Proposal for Digital Repository
May 11, 2009
Digital Repository Solution Framework
Background
Digital repository, also known as digital libraries are important for
organizations in helping to manage and capture intellectual assets as part of
their information strategy. A digital repository can hold wide range of
materials for a variety of purposes and users. It can support research,
learning, and administrative processes. Broadly speaking, a digital repository
is a library in which collections are stored in digital formats (as opposed to
print, microform, or other media) and accessible by computers. The digital
content may be stored locally, or accessed remotely via computer networks.
A digital library is a type of information retrieval system.
Digital Repositories can immediately adopt innovations in technology
providing users with improvements in electronic and audio book
technology as well as presenting new forms of communication such as
wikis and blogs.
No physical boundary. The user of a digital repository/library need not look for
physical paper files; people from across the locations can gain access to the
same information, as long as an Internet connection is available.
Round the clock availability. A major advantage of digital libraries/repository is
that people can gain access to the information at any time, night or day.
Multiple accesses. The same resources can be used at the same time by a
number of users.
Structured approach. Digital libraries/repository provide access to much richer
content in a more structured manner, i.e. we can easily move from the catalog
to the particular book then to a particular chapter and so on.
Information retrieval. The user is able to use any search term
(word, phrase, title, name, subject) to search the entire collection.
Digital libraries/repository can provide very user-friendly interfaces,
giving clickable access to its resources.
Preservation and conservation. An exact copy of the original can be made any
number of times without any degradation in quality.
Networking. A particular digital library can provide a link to any other
resources of other digital libraries very easily; thus a seamlessly
integrated resource sharing can be achieved.
Page-3
4. HP Proposal for Digital Repository
May 11, 2009
Digital Repository Architecture
Digital Repository Solution is based on multi-tier architecture, categorized broadly into
three tiers:
1. Presentation Tier (Web Server) Top most level of library
solution is the user interface. It comprises of a web server,
acting as an interface with library user(s).
2. Logic Tier (Application Server) : This layer processes commands, retrieves data
and sends to presentation layer to be shown to users. Application Servers like Tomcat
are typically part of this layer.
3. Data Tier (Database & Storage Server) : This layer stores library information for
retrieval by Application server which processes it and send to web server for display to
users. It consists of a database server, storage and backup devices for data integrity.
Page-4
5. HP Proposal for Digital Repository
May 11, 2009
Objective of the Proposal
This Proposal is for supply and deployment of the Dspace Digital Repository software
for the Institution.
This Proposal facilitates creation of a platform for a Digital Web Based knowledge
repository accessible by the community. The aim is to capture, store, index, preserve
and redistribute the knowledge assets of the Institution for the overall development
of the community on long-term basis.
The implementation of this proposal will bring years of joint Research between
Massachusetts Institute of Technology (MIT) and HP in developing Dspace to assist
the Institution in creation of a world-class knowledge repository for the benefit of the
community.
Benefits to the Institution
• Implementation of the Digital Repository provides an opportunity for the
Institution to achieve and sustain leadership position in Education,
Consultancy, Research and Development.
• Implementation of Digital Repository would help preserve knowledge
generated within the institution and make it accessible to all for the
overall development of the community.
• Implementation of the Digital Repository provides an opportunity on
updating the skills and knowledge of the community.
• Implementation of the Digital Repository provides an opportunity for the
Institution to provide to its community the latest technologies as per the
industry requirements.
• Implementation of Digital Repository will facilitate collaboration and
sharing of Knowledge amongst the community on long-term basis.
Specifically, the Digital Repository provides the following benefits:
• Provides the capability to publish research results out quickly.
• Helps in reaching the community through exposure to the omnipresent
Internet.
• Archiving and distributing material normally currently stored in the library.
• Keeping track of publications/bibliography
• Having a persistent network identifier for your work, as shown in this
image:
Page-5
6. HP Proposal for Digital Repository
May 11, 2009
1. The Dspace Digital Repository
A Digital institutional repository is a set of services that an institute offers to the
members of its community for the management and dissemination of digital
materials created by the institution and its community members. It is most
essentially an organizational commitment to the stewardship of these digital
materials, including long-term preservation where appropriate, as well as
organization and access or distribution.
To collect, distribute, and preserve research materials in increasingly complex digital
formats are a time-consuming and expensive chore for individual faculty and their
departments, labs, and centers to manage themselves. The DSpace system provides
a way to manage these research materials and publications in a professionally
maintained repository to give them greater visibility and accessibility over time.
DSpace is a groundbreaking digital library system to capture, store, index, preserve,
and redistribute all scholarly research material in digital formats.
One can share research findings quickly with a worldwide audience and preserve
materials in perpetuity.
DSpace captures data in any format – in text, video, audio, and data. It distributes
it over the web. It indexes the work, so users can search and retrieve items. It
preserves digital work over the long term.
DSpace provides a way to manage research materials and publications in a
professionally maintained repository to give them greater visibility and accessibility
over time.
Adding content in Dspace
DSpace is easy to use. You use your web browser to submit content and search or
browse its collections.
Page-6
7. HP Proposal for Digital Repository
May 11, 2009
To submit content, you upload the file(s) and add descriptive information including
title, author, publication information, and keywords. This descriptive data is known
as metadata.
To add your content, though, you must belong to a DSpace community. Speak with
your library’s staff to learn more about DSpace communities.
Licensing and copyright issues
To add content to DSpace, one must have the copyright to the material, or have
permission to submit work for which one does not have copyright. One should be
willing and able to grant the institute library the right to preserve and distribute the
work in DSpace.
Many publishers offer a “self-archiving” clause in publication contracts, which allows
one to archive a copy of work. If the publisher doesn’t offer such a clause, one can
negotiate to include one.
Each institute sets its own licensing requirements for DSpace.
Preserving data for grants
DSpace provides a means to preserve and distribute data and research, as is
required in many grants.
Reference Sites
• Central Plantation Crops Research Institute, Kasargod
• GB Pant University of Agriculture & Technology, Pant Nagar
• Indira Gandhi Institute for Development Research, Mumbai
• INFLIBNET, Ahemdabad
• Indian Institute of Astrophysics, Bangalore
• Indian Institute of Management, Kozhikode
• Indian Institute of Science, Bangalore
• Indian Institute of Technology, Bombay
• Indian Institute of Technology, New Delhi
• Indian Institute of Technology, Kharagpur
• Indian National Science Academy, New Delhi
• Indian Statistical Institute, Bangalore
• LDL: Librarians' Digital Library, DRTC
• National Centre for Radio Astrophysics, Pune
• National Chemical Laboratory (NCL), Pune
• National Institute of Oceanography, Goa
Page-7
8. HP Proposal for Digital Repository
May 11, 2009
• National Institute of Technology, Rourkela
• Raman Research Institute, Bangalore
• Sri Venkateswara University, Tirupati
• University of Hyderabad, Hyderbad
DSpace Features
The major features of Dspace are:
• Institutional Repository
• DSpace is a digital library system to capture, store, index, preserve, and
redistribute the intellectual output of a university’s research faculty in digital
formats.
• DSpace is organized to accommodate the multidisciplinary and organizational
needs of a large institution.
• DSpace provides access to the digital work of the whole institution through
one interface.
• DSpace is organized into Communities and Collections, each of which
retains its identity within the repository.
• Customization for DSpace communities and collections allows for flexibility in
determining policies and workflow.
Supported Formats and Content Types
DSpace accepts any type of digital content, including:
• Text
• Images
• Audio
• Video
Some examples of items that DSpace can accommodate are:
• Documents such as articles, preprints, working papers, technical reports,
conference papers
• Books
• Theses
• Data sets
• Computer programs
• Visual simulations and models
Each institution that implements DSpace can determine its own list of supported
formats and content types, based on its needs and resources.
Page-8
9. HP Proposal for Digital Repository
May 11, 2009
Digital Preservation
One of the primary goals of DSpace is to preserve digital information.
• DSpace provides long-term physical storage and management of digital items
in a secure, professionally managed repository including standard operating
procedures such as backup, mirroring, refreshing media, and disaster
recovery.
• DSpace assigns a persistent identifier to each contributed item to ensure its
retrievability far into the future.
• DSpace provides a mechanism for advising content contributors of the
preservation support levels they can expect for the files they submit.
Access Control
DSpace allows contributors to limit access to items in DSpace, at both the collection
and the individual item level.
Versioning
New versions of previously submitted DSpace items can be added and linked to each
other, with or without withdrawal of the older item.
Multiple formats of the same content item can be submitted to DSpace, for example,
a TIFF file and a GIF file of the same image.
Search and Retrieval
The DSpace submission process allows for the description of each item using a
qualified version of the “Dublin Core metadata” schema. These descriptions are
entered into a relational database, which is used by the search engine to retrieve
items.
Architecture Overview
The DSpace system is organized into three layers, each of which consists of a
number of components.
Page-9
10. HP Proposal for Digital Repository
May 11, 2009
DSpace System Architecture
The storage layer is responsible for physical storage of metadata and content.
The business logic layer deals with managing the content of the archive, users of
the archive (e-people), authorization, and workflow. The application layer
contains components that communicate with the world outside of the individual
DSpace installation, for example the Web user interface and the Open Archives
Initiative protocol for metadata harvesting service.
Each layer only invokes the layer below it; the application layer may not used
the storage layer directly, for example. Each component in the storage and
business logic layers has a defined public API. The union of the APIs of those
components is referred to as the Storage API (in the case of the storage layer)
and the DSpace Public API (in the case of the business logic layer). These APIs
are in-process Java classes, objects and methods.
It is important to note that each layer is trusted. Although the logic for
authorizing actions is in the business logic layer, the system relies on individual
applications in the application layer to correctly and securely authenticate e-
people. If a 'hostile' or insecure application were allowed to invoke the Public API
directly, it could very easily perform actions as any e-person in the system.
The reason for this design choice is that authentication methods will vary widely
between different applications, so it makes sense to leave the logic and
responsibility for that in these applications.
The source code is organized to cohere very strictly to this three-layer
architecture. Also, only methods in a component's public API are given the
public access level. This means that the Java compiler helps ensure that the
source code conforms to the architecture.
Creating Content with Dspace
Adding content to DSpace is very easy to do. This section illustrates the basic steps:
• Choose a collection
• Describe your content item by adding metadata and keywords
• Upload the file(s)
• Verify the submitted item
• Accept the DSpace license
• Find your submitted items in a workflow
Page-10
11. HP Proposal for Digital Repository
May 11, 2009
Deliverables
HP Deliverables are as detailed below:
1. Installation of OS and layered components: Installation of Operating System and
layered products like Apache Web Servers, SendMail System, Samba Services,
and infrastructure services like DNS client services
2. Prerequisites Software Installation: Installation of prerequisite software for
Dspace, which includes various Java patches and layered Java products, like java
activation services, mail api layers etc.
3. Installation of Apache: Installation and configuration of secured (SSL) Apache for
use with Dspace. Involves configuration of OpenSSL and Apache with dynamic
loadable modules like APR, mod_php and mod_ldap.
4. Installation of Tomcat: Installation and configuration of Tomcat for use with
Dspace. Involves integration of Apache with tomcat with Apache using apache
portable runtime and mod_jk2 connector (a connector which serves servlet and
jsp requests to apache from the backend and lets apache serve the web
requests).
5. Creating sample Users and setting permissions: Create sample users with varying
access rights for participation in Dspace digital library.
6. Install Dspace: Install and configure Dspace. This is base configuration.
7. Configure PostgreSQL: Configure postgresql and initialize persistant objects for
Dpsace.
8. Configure Dspace: Configure Dspace for usage by user community
9. Initialize (run scripts): Automate startup and shutdown procedure of dspace
10. Scan the sample articles provided by the Institution and upload the same along
with the Metadata into Dspace.
11. Training on Administration and usage: Administration of Dspace. Demonstrate
key concepts in Dspace to user groups
Duration
The services are expected to be completed in 6 Months.
Page-11
12. HP Proposal for Digital Repository
May 11, 2009
3. Commercials
Hardware and Software
Sl No Description Value
1 DSPACE Hardware and Software 9,99,999/-
2 Implementation and Support
Assumptions
1
Additional Servers, OS and Networking equipment may be
required if not already available
2 LAN and Networking equipment such Switches and Cabling for
the LAN are assumed to be available
3 Internet Connectivity is assumed to be available
4 Adequate Capacity UPS for the Infrastructure is assumed to be
available
5 LCD for Training is assumed to be available
6 High Speed Scanner is available with the Colleges.
Terms and Conditions
1. All Prices are inclusive of taxes. Prevailing rates of taxes are 2% CST,4%
VAT and 12.5% Service TAX
2. The Infrastructure suggested is based on certain assumptions listed herein.
3. Books are assumed to be in Bound conditions. It is assumed that they can
be unbound, scanned, rebound and returned.
4. It is assumed that the college will provide necessary resources for scanning
like content and man power.
5. Purchase order may be placed with our Business Partner (Name and
Address)
Page-12