Soumettre la recherche
Mettre en ligne
Non-Relational Databases: This hurts. I like it.
•
Télécharger en tant que ODP, PDF
•
3 j'aime
•
614 vues
Onyxfish
Suivre
Delivered at San Luis Obispo .NET Users Group on October 13th, 2009.
Lire moins
Lire la suite
Technologie
Signaler
Partager
Signaler
Partager
1 sur 27
Télécharger maintenant
Recommandé
"Einstürzenden Neudaten: Building an Analytics Engine from Scratch", Tobias J...
"Einstürzenden Neudaten: Building an Analytics Engine from Scratch", Tobias J...
Dataconomy Media
Google Dremel. Concept and Implementations.
Google Dremel. Concept and Implementations.
Vicente Orjales
Realtime Risk Management Using Kafka, Python, and Spark Streaming by Nick Evans
Realtime Risk Management Using Kafka, Python, and Spark Streaming by Nick Evans
Spark Summit
Ajug april 2011
Ajug april 2011
Christopher Curtin
Lambda Architecture - Storm, Trident, SummingBird ... - Architecture and Over...
Lambda Architecture - Storm, Trident, SummingBird ... - Architecture and Over...
Dataiku
Databases benoitg 2009-03-10
Databases benoitg 2009-03-10
benoitg
Big Data Day LA 2016/ Big Data Track - Twitter Heron @ Scale - Karthik Ramasa...
Big Data Day LA 2016/ Big Data Track - Twitter Heron @ Scale - Karthik Ramasa...
Data Con LA
Nosql East October 2009
Nosql East October 2009
Christopher Curtin
Recommandé
"Einstürzenden Neudaten: Building an Analytics Engine from Scratch", Tobias J...
"Einstürzenden Neudaten: Building an Analytics Engine from Scratch", Tobias J...
Dataconomy Media
Google Dremel. Concept and Implementations.
Google Dremel. Concept and Implementations.
Vicente Orjales
Realtime Risk Management Using Kafka, Python, and Spark Streaming by Nick Evans
Realtime Risk Management Using Kafka, Python, and Spark Streaming by Nick Evans
Spark Summit
Ajug april 2011
Ajug april 2011
Christopher Curtin
Lambda Architecture - Storm, Trident, SummingBird ... - Architecture and Over...
Lambda Architecture - Storm, Trident, SummingBird ... - Architecture and Over...
Dataiku
Databases benoitg 2009-03-10
Databases benoitg 2009-03-10
benoitg
Big Data Day LA 2016/ Big Data Track - Twitter Heron @ Scale - Karthik Ramasa...
Big Data Day LA 2016/ Big Data Track - Twitter Heron @ Scale - Karthik Ramasa...
Data Con LA
Nosql East October 2009
Nosql East October 2009
Christopher Curtin
Parikshit Ram – Senior Machine Learning Scientist, Skytree at MLconf ATL
Parikshit Ram – Senior Machine Learning Scientist, Skytree at MLconf ATL
MLconf
RasterFrames + STAC
RasterFrames + STAC
Simeon Fitch
Hadoop and Cascading At AJUG July 2009
Hadoop and Cascading At AJUG July 2009
Christopher Curtin
How to Feed a Data Hungry Organization – by Traveloka Data Team
How to Feed a Data Hungry Organization – by Traveloka Data Team
Traveloka
Big data real time architectures
Big data real time architectures
Daniel Marcous
Big Data Analysis : Deciphering the haystack
Big Data Analysis : Deciphering the haystack
Srinath Perera
Yahoo! Mail antispam - Bay area Hadoop user group
Yahoo! Mail antispam - Bay area Hadoop user group
Hadoop User Group
Karmasphere hadoop-productivity-tools
Karmasphere hadoop-productivity-tools
Hadoop User Group
Real-Time Anomoly Detection with Spark MLib, Akka and Cassandra by Natalino Busa
Real-Time Anomoly Detection with Spark MLib, Akka and Cassandra by Natalino Busa
Spark Summit
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
Spark Summit
Streaming computing: architectures, and tchnologies
Streaming computing: architectures, and tchnologies
Natalino Busa
An excursion into Graph Analytics with Apache Spark GraphX
An excursion into Graph Analytics with Apache Spark GraphX
Krishna Sankar
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
guest5b1607
Emphemeral hadoop clusters in the cloud
Emphemeral hadoop clusters in the cloud
gfodor
Dataiku Flow and dctc - Berlin Buzzwords
Dataiku Flow and dctc - Berlin Buzzwords
Dataiku
Case study- Real-time OLAP Cubes
Case study- Real-time OLAP Cubes
Ziemowit Jankowski
A Hadoop Primer
A Hadoop Primer
sogrady
Hadoop - Simple. Scalable.
Hadoop - Simple. Scalable.
elliando dias
Getting Started on Hadoop
Getting Started on Hadoop
Paco Nathan
Realtime Data Analysis Patterns
Realtime Data Analysis Patterns
Mikio L. Braun
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
Reynold Xin
UnConference for Georgia Southern Computer Science March 31, 2015
UnConference for Georgia Southern Computer Science March 31, 2015
Christopher Curtin
Contenu connexe
Tendances
Parikshit Ram – Senior Machine Learning Scientist, Skytree at MLconf ATL
Parikshit Ram – Senior Machine Learning Scientist, Skytree at MLconf ATL
MLconf
RasterFrames + STAC
RasterFrames + STAC
Simeon Fitch
Hadoop and Cascading At AJUG July 2009
Hadoop and Cascading At AJUG July 2009
Christopher Curtin
How to Feed a Data Hungry Organization – by Traveloka Data Team
How to Feed a Data Hungry Organization – by Traveloka Data Team
Traveloka
Big data real time architectures
Big data real time architectures
Daniel Marcous
Big Data Analysis : Deciphering the haystack
Big Data Analysis : Deciphering the haystack
Srinath Perera
Yahoo! Mail antispam - Bay area Hadoop user group
Yahoo! Mail antispam - Bay area Hadoop user group
Hadoop User Group
Karmasphere hadoop-productivity-tools
Karmasphere hadoop-productivity-tools
Hadoop User Group
Real-Time Anomoly Detection with Spark MLib, Akka and Cassandra by Natalino Busa
Real-Time Anomoly Detection with Spark MLib, Akka and Cassandra by Natalino Busa
Spark Summit
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
Spark Summit
Streaming computing: architectures, and tchnologies
Streaming computing: architectures, and tchnologies
Natalino Busa
An excursion into Graph Analytics with Apache Spark GraphX
An excursion into Graph Analytics with Apache Spark GraphX
Krishna Sankar
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
guest5b1607
Emphemeral hadoop clusters in the cloud
Emphemeral hadoop clusters in the cloud
gfodor
Dataiku Flow and dctc - Berlin Buzzwords
Dataiku Flow and dctc - Berlin Buzzwords
Dataiku
Case study- Real-time OLAP Cubes
Case study- Real-time OLAP Cubes
Ziemowit Jankowski
A Hadoop Primer
A Hadoop Primer
sogrady
Hadoop - Simple. Scalable.
Hadoop - Simple. Scalable.
elliando dias
Getting Started on Hadoop
Getting Started on Hadoop
Paco Nathan
Realtime Data Analysis Patterns
Realtime Data Analysis Patterns
Mikio L. Braun
Tendances
(20)
Parikshit Ram – Senior Machine Learning Scientist, Skytree at MLconf ATL
Parikshit Ram – Senior Machine Learning Scientist, Skytree at MLconf ATL
RasterFrames + STAC
RasterFrames + STAC
Hadoop and Cascading At AJUG July 2009
Hadoop and Cascading At AJUG July 2009
How to Feed a Data Hungry Organization – by Traveloka Data Team
How to Feed a Data Hungry Organization – by Traveloka Data Team
Big data real time architectures
Big data real time architectures
Big Data Analysis : Deciphering the haystack
Big Data Analysis : Deciphering the haystack
Yahoo! Mail antispam - Bay area Hadoop user group
Yahoo! Mail antispam - Bay area Hadoop user group
Karmasphere hadoop-productivity-tools
Karmasphere hadoop-productivity-tools
Real-Time Anomoly Detection with Spark MLib, Akka and Cassandra by Natalino Busa
Real-Time Anomoly Detection with Spark MLib, Akka and Cassandra by Natalino Busa
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
Unified Framework for Real Time, Near Real Time and Offline Analysis of Video...
Streaming computing: architectures, and tchnologies
Streaming computing: architectures, and tchnologies
An excursion into Graph Analytics with Apache Spark GraphX
An excursion into Graph Analytics with Apache Spark GraphX
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
Text Analytics Summit 2009 - Roddy Lindsay - "Social Media, Happiness, Petaby...
Emphemeral hadoop clusters in the cloud
Emphemeral hadoop clusters in the cloud
Dataiku Flow and dctc - Berlin Buzzwords
Dataiku Flow and dctc - Berlin Buzzwords
Case study- Real-time OLAP Cubes
Case study- Real-time OLAP Cubes
A Hadoop Primer
A Hadoop Primer
Hadoop - Simple. Scalable.
Hadoop - Simple. Scalable.
Getting Started on Hadoop
Getting Started on Hadoop
Realtime Data Analysis Patterns
Realtime Data Analysis Patterns
Similaire à Non-Relational Databases: This hurts. I like it.
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
Reynold Xin
UnConference for Georgia Southern Computer Science March 31, 2015
UnConference for Georgia Southern Computer Science March 31, 2015
Christopher Curtin
Big Data - An Overview
Big Data - An Overview
Arvind Kalyan
Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...
Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...
javier ramirez
A look under the hood at Apache Spark's API and engine evolutions
A look under the hood at Apache Spark's API and engine evolutions
Databricks
NO SQL: What, Why, How
NO SQL: What, Why, How
Igor Moochnick
Azure Cosmos DB - Technical Deep Dive
Azure Cosmos DB - Technical Deep Dive
Andre Essing
Dataiku - hadoop ecosystem - @Epitech Paris - janvier 2014
Dataiku - hadoop ecosystem - @Epitech Paris - janvier 2014
Dataiku
Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist
SoftServe
Big data vahidamiri-tabriz-13960226-datastack.ir
Big data vahidamiri-tabriz-13960226-datastack.ir
datastack
Intro to Spark development
Intro to Spark development
Spark Summit
PyData 2015 Keynote: "A Systems View of Machine Learning"
PyData 2015 Keynote: "A Systems View of Machine Learning"
Joshua Bloom
Big Data Everywhere Chicago: Apache Spark Plus Many Other Frameworks -- How S...
Big Data Everywhere Chicago: Apache Spark Plus Many Other Frameworks -- How S...
BigDataEverywhere
Extreme Apache Spark: how in 3 months we created a pipeline that can process ...
Extreme Apache Spark: how in 3 months we created a pipeline that can process ...
Josef A. Habdank
Brandon
Brandon
Brandon Smith
Azure Cosmos DB - NoSQL Strikes Back (An introduction to the dark side of you...
Azure Cosmos DB - NoSQL Strikes Back (An introduction to the dark side of you...
Andre Essing
Why Spark Is the Next Top (Compute) Model
Why Spark Is the Next Top (Compute) Model
Dean Wampler
Introduction to Spark Training
Introduction to Spark Training
Spark Summit
Not only SQL
Not only SQL
Niklas Gustavsson
Ultra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & Alluxio
Alluxio, Inc.
Similaire à Non-Relational Databases: This hurts. I like it.
(20)
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
(Berkeley CS186 guest lecture) Big Data Analytics Systems: What Goes Around C...
UnConference for Georgia Southern Computer Science March 31, 2015
UnConference for Georgia Southern Computer Science March 31, 2015
Big Data - An Overview
Big Data - An Overview
Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...
Ingesting Over Four Million Rows Per Second With QuestDB Timeseries Database ...
A look under the hood at Apache Spark's API and engine evolutions
A look under the hood at Apache Spark's API and engine evolutions
NO SQL: What, Why, How
NO SQL: What, Why, How
Azure Cosmos DB - Technical Deep Dive
Azure Cosmos DB - Technical Deep Dive
Dataiku - hadoop ecosystem - @Epitech Paris - janvier 2014
Dataiku - hadoop ecosystem - @Epitech Paris - janvier 2014
Essential Data Engineering for Data Scientist
Essential Data Engineering for Data Scientist
Big data vahidamiri-tabriz-13960226-datastack.ir
Big data vahidamiri-tabriz-13960226-datastack.ir
Intro to Spark development
Intro to Spark development
PyData 2015 Keynote: "A Systems View of Machine Learning"
PyData 2015 Keynote: "A Systems View of Machine Learning"
Big Data Everywhere Chicago: Apache Spark Plus Many Other Frameworks -- How S...
Big Data Everywhere Chicago: Apache Spark Plus Many Other Frameworks -- How S...
Extreme Apache Spark: how in 3 months we created a pipeline that can process ...
Extreme Apache Spark: how in 3 months we created a pipeline that can process ...
Brandon
Brandon
Azure Cosmos DB - NoSQL Strikes Back (An introduction to the dark side of you...
Azure Cosmos DB - NoSQL Strikes Back (An introduction to the dark side of you...
Why Spark Is the Next Top (Compute) Model
Why Spark Is the Next Top (Compute) Model
Introduction to Spark Training
Introduction to Spark Training
Not only SQL
Not only SQL
Ultra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & Alluxio
Ultra Fast Deep Learning in Hybrid Cloud using Intel Analytics Zoo & Alluxio
Dernier
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
wesley chun
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
Khushali Kathiriya
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Edi Saputra
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
apidays
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
Igalia
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
Principled Technologies
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Neo4j
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
The Digital Insurer
🐬 The future of MySQL is Postgres 🐘
🐬 The future of MySQL is Postgres 🐘
RTylerCroy
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
Gabriella Davis
presentation ICT roal in 21st century education
presentation ICT roal in 21st century education
jfdjdjcjdnsjd
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
wesley chun
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
Safe Software
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
sammart93
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
panagenda
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
Rafal Los
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
Anna Loughnan Colquhoun
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
Remote DBA Services
Dernier
(20)
Automating Google Workspace (GWS) & more with Apps Script
Automating Google Workspace (GWS) & more with Apps Script
Artificial Intelligence Chap.5 : Uncertainty
Artificial Intelligence Chap.5 : Uncertainty
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Repurposing LNG terminals for Hydrogen Ammonia: Feasibility and Cost Saving
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
Apidays New York 2024 - Scaling API-first by Ian Reasor and Radu Cotescu, Adobe
A Year of the Servo Reboot: Where Are We Now?
A Year of the Servo Reboot: Where Are We Now?
Boost PC performance: How more available memory can improve productivity
Boost PC performance: How more available memory can improve productivity
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
Workshop - Best of Both Worlds_ Combine KG and Vector search for enhanced R...
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
TrustArc Webinar - Unlock the Power of AI-Driven Data Discovery
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
Bajaj Allianz Life Insurance Company - Insurer Innovation Award 2024
🐬 The future of MySQL is Postgres 🐘
🐬 The future of MySQL is Postgres 🐘
A Domino Admins Adventures (Engage 2024)
A Domino Admins Adventures (Engage 2024)
presentation ICT roal in 21st century education
presentation ICT roal in 21st century education
Powerful Google developer tools for immediate impact! (2023-24 C)
Powerful Google developer tools for immediate impact! (2023-24 C)
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
From Event to Action: Accelerate Your Decision Making with Real-Time Automation
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Strategize a Smooth Tenant-to-tenant Migration and Copilot Takeoff
Why Teams call analytics are critical to your entire business
Why Teams call analytics are critical to your entire business
The 7 Things I Know About Cyber Security After 25 Years | April 2024
The 7 Things I Know About Cyber Security After 25 Years | April 2024
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
TrustArc Webinar - Stay Ahead of US State Data Privacy Law Developments
Data Cloud, More than a CDP by Matt Robison
Data Cloud, More than a CDP by Matt Robison
Strategies for Landing an Oracle DBA Job as a Fresher
Strategies for Landing an Oracle DBA Job as a Fresher
Non-Relational Databases: This hurts. I like it.
1.
Non-Relational Databases: This
hurts. I like it. Christopher Groskopf / bouvard / @onyxfish
2.
3.
First! A Hypothetical
4.
I want to
query space.
5.
6.
100,000 stars
7.
3.5 years of
constant observation
8.
Sensitive measurements
9.
How would you
store this data so that your researchers can analyze it effectively?
10.
(Hint: It is
probably not sqlite on a thumb drive.)
11.
The Relational Model
12.
13.
Enforces data integrity
14.
Minimizes repetition
15.
Proven
16.
17.
18.
Joins rapidly become
a bottleneck
19.
Difficult to scale
up
20.
Gets in the
way of parallelization
21.
Optimization may mitigate
the benefits of normalization
22.
The Non-Relational Model
23.
24.
Master ↔ Master
replication
25.
Scales well
26.
Map/Reduce means everything
runs in parallel
27.
28.
Integrity-enforcement migrates to
code
29.
Limited ORM tooling
30.
Significant learning curve
31.
Proven only in
a subset of cases
32.
Second! Platforms
33.
34.
Often, they offer
Master ↔ Master replication
35.
In most cases
they store schema-less data
36.
Typically they scale
by “automatic” sharding
37.
Sometimes they offer
“eventual consistency”
38.
For the most
part they are fast
39.
Generally they are
targeted at web applications
40.
Frequently we can't
define what they are
41.
42.
Imagine if Memcache
was your database
43.
That is more
or less what an NRDB is
44.
Except that everything
is permanently “cached” to disk
45.
And only the
most common result sets are in held in RAM (it could be all of them)
46.
In most cases
this is faster than computing fresh results based on indices (that is, SQL)
47.
48.
Berkeley DB
49.
BigTable
50.
Cassandra
51.
CouchDB
52.
HyperTable
53.
MongoDB
54.
Project Voldemort
55.
SimpleDB
56.
Tokyo Cabinet
57.
58.
Berkeley DB ->
59.
BigTable ->
60.
Cassandra ->
61.
CouchDB ->
62.
HyperTable ->
63.
MongoDB ->
64.
Project Voldemort ->
65.
SimpleDB ->
66.
67.
This is not
a fad.
68.
69.
Unstructured data
70.
Massive datasets (broad
> deep)
71.
Fuzzy and/or fault
tolerant data
72.
Versioned data
73.
Logging
74.
When eventual consistency
is good enough
75.
If you are
storing a JSON or XML string in your SQL database: I Have Your Medicine
76.
77.
Deeply hierarchical datasets
78.
Data integrity that
must be enforced by a DBA
79.
High security applications
where the database must enforce that security (LAN/WAN facing)
80.
Transactional data (banking,
analytics, etc.)
81.
Usage is highly
unpredictable, combinatorial, or likely to change suddenly
82.
Third! Voter's Daily
and CouchDB
83.
84.
85.
86.
Understood by the
Gov2.0 community
87.
Reusable / Educational
/ Transparent
88.
89.
“Speaks” JSON
90.
“Thinks” Javascript (optionally,
Python)
91.
RESTful API
92.
Pre-collates Views (on
insert) for fast reads
93.
Supports Master ↔
Master replication
94.
“Futon” management interface
95.
Written in Erlang
96.
An Example JSON
Document { " _id ": "2006-12-06T00:00:00Z - C-SPAN House Ways and Means Committee Schedule Scraper" , " _rev ": "1-2ca577e0a4a25ad2704fdf5a20161f9f" , " datetime ": "2006-12-06T00:00:00Z" , " end_datetime ": null , " title ": "Hearing on Patient Safety and Quality Issues in End Stage Renal Disease Treatment" , " description ": null , " branch ": "Legislative" , " entity ": "House of Representatives" , " source_url ": "http://www3.capwiz.com/c-span/dbq/officials/schedule.dbq?committee= hways&command=committee_schedules&chambername=House&chamber=H& period=" , " source_text ": "<span class=amp;quot;cwnormalboldamp;quot;>DECEMBER 06, 2006<br /></span> 000a0009<span class=amp;quot;cwnormalamp;quot;>Hearing on Patient Safety and Quality Issues in End Stage Renal Disease Treatment<br /></span>" , " access_datetime ": "2009-09-28T04:19:02Z" , " parser_name ": "C-SPAN House Ways and Means Committee Schedule Scraper" , " parser_version ": "0.1" }
97.
98.
99.
100.
Harnessing “high availability”
requires a large up-front investment of development time
101.
Map/Reduce and SQL
shouldn't even be used in the same sentence (GQL is a stupid name)
102.
Schema-less data is
fantastic
103.
Integrity checking in
code is not so bad (that is what abstraction is for)
104.
Doing Joins in
code is actually very liberating
105.
106.
But you ought
to learn one anyway
107.
It's not just
for Twitter and bleeding edge startups
108.
Amazon, Facebook, Google,
IBM, and Microsoft all get this
109.
Sometimes it is
simply the right tool for the job
110.
111.
CouchDB & Map/Reduce
Emulator: http://labs.mudynamics.com/wp-content/uploads/2009/04/icouch.html
112.
NASA's Kepler Mission:
http://kepler.nasa.gov/
113.
ReadWriteWeb on NRDBs:
http://www.readwriteweb.com/enterprise/2009/02/is-the-relational-database-doomed.php
114.
Voter's Daily:
http://github.com/bouvard/votersdaily
Télécharger maintenant