Call Girls in Sarai Kale Khan Delhi 💯 Call Us 🔝9205541914 🔝( Delhi) Escorts S...
Asking Why
1. Asking “Why?”
A lesson for Data Scientists and
those who manage them
Adapted from a post by Mike Stringer &
Dean Malmgren, founders of Datascope
???
2. The other day we had a
conversation with a
bespectacled senior data
scientist at another
organization (named X to
protect the innocent).
3.
4. Many of us have had similar
conversations with people like
X, and many of us have been
X before.
5. Data scientists, being curious
individuals, are often drawn to projects
because:
☑ they’re interesting
☑ they’re fun
☑ they’re technically challenging
☑ their boss heard about “big data” in
the Wall Street Journal
6. These reasons are all
distinctly different from trying
to solve an important
problem.
8. …and there are established
experts already working on
them.
Operations Product
Development
Strategy
Human
Resources Marketing
IT R&DSales
9. Yet these roles increasingly have
an opportunity to use data in
innovative ways, to make dents
in long-standing problems where
quantitative approaches have
previously been impossible.
Operations Product
Development
Strategy
Human
Resources Marketing
IT R&DSales
10. To tap this abundant resource
of useful problems to solve,
data scientists must:
11. 1. learn from business domain
experts about real problems
13. 3. focus on problems that
actually improve the business.
14. Going in any different order is a
recipe for disillusionment about big
data’s true potential.
Starting with a real problem instead
of starting with some interesting
dataset often leads data scientists
down a completely different—and
much more fruitful—path.
16. In 2010, Brian Uzzi introduced
us to Daegis, an e-discovery
services provider
17. When a company gets sued, they
have to provide all documents
relevant to the case.
E-discovery companies like Daegis
use a combination of technology and
lawyers to help sued companies
provide these documents, without
providing anything they don’t need to.
18. Early conversations circled
around “social network
analysis”.
Daegis’ client datasets
contained millions of emails we
could parse, study and visualize!
20. But we caught ourselves, and
asked one important question.
Why?
21.
22. Instead of social networks, we
made the first phase of our
project building a quick
prototype using data from the
Text Retrieval Conference
(TREC).
23. We demonstrated that our
transductive learning
algorithms could reduce the
number of documents that
needed to be reviewed by
80-99%.
24. This was huge!
We were going to help Daegis gain
a tremendous advantage and
Daegis’ clients would be able to
defend themselves from frivolous
lawsuits.
+1 for the good guys. Right?
25.
26. There’s that “why” again.
Had we asked about this at the
beginning of the project we
would’ve known the
importance of defensibility.
27. After more design iterations
(see our Strata presentation or
slides if you’re interested), we
arrived at some insights: what
we developed needed to be
educational, transparent, and
understandable.
28. By the end, if you had to summarize
the project, it would be closer to
“educating attorneys about
information retrieval” than “social
network analysis.”
The final result is a product that
Daegis sells under the name
Acumen.
31. But beware.
The answers to this deceptively
simple question may surprise you,
take you into challenging uncharted
territory, and inspire you to think
about problems in completely
different ways.
32. Learn more about us at http://datasco.pe
Thanks for your attention.