5. Learn. Connect. Collaborate.
How Do We Get Our Data?
• How many documents are in Alfresco?
– ?q=DOC_TYPE:Node
• What is the total number of documents by site?
– limit(alfrescoFacets(facet(alfresco,q="*",buckets=SITE,bucketSorts="index
asc",sum(cm_content.size),bucketSizeLimit=1250)),1000)
7. Learn. Connect. Collaborate.
SQL Was Designed To Be…
“More accessible to users without formal
training in mathematics or computer
programming.”
Donald Chamberlin and Raymond Boyce, 1974
8. Learn. Connect. Collaborate.
How Do We Get Our Data?
• How many documents are in Alfresco?
– ?q=DOC_TYPE:Node
– select count(*) from alfresco
• What is the total number of documents by site?
– limit(alfrescoFacets(facet(alfresco,q="*",buckets=SITE,bucketSorts="index
asc",sum(cm_content.size),bucketSizeLimit=1250)),1000)
– SELECT SITE, sum(`cm_content.size`) from alfresco group by SITE
9. Learn. Connect. Collaborate.
The Benefits Of SQL Integration In Solr
• SQL skills are much more common and widely used
• It hides Solr’s complexity
• Ability to express in another way
• Existing reporting tools work with JDBC drivers
10. Learn. Connect. Collaborate.
How Does It Work
• It treats a Solr collection as a table
• It translates SQL statement to streaming expression
• Uses the Solr streaming expression API
V1 was very hard to setup, had many moving parts: additional db, active mq etc.
Moving data into analytics db was a challenge.
------------- solution-----------
Focus on what alfresco does best, platform that provide service (focus our efforts on api/jdbc).
Use existing stack
Simplify the process
Make it easy to use, make it user friendly
How do we get the data out, lack of solr query knowledge?
Our solution
The SQL design statement taken from the IBM research paper in 1974.
Combining the power of Full text search with SQL
Demonstrate how the jdbc works with db visualiser and Zeppelin with alfresco.