SlideShare une entreprise Scribd logo
1  sur  73
Télécharger pour lire hors ligne
DBMS Research:
                        First 50 Years,
                        Next 50 Years
                      Jeffrey F. Naughton




Jeffrey F. Naughton
Reactions	
  to	
  This	
  Keynote	
  From	
  
                 Colleagues…	
  
                                                                                                  Jeffrey	
  F.	
  Naughton	
  


Assistant	
  	
               “Cool,	
  I	
  look	
  forward	
  to	
  it.”	
  
Professor	
  	
       ?	
  
                              (upon	
  hearing	
  what	
  I	
  proposed	
  to	
  talk	
  about):	
  	
  
 Associate	
                  “But	
  how	
  are	
  you	
  going	
  to	
  make	
  that	
  
 Professor	
  	
  
    #1	
  	
  	
  
                      ?	
     interesting?”	
  
                              “Well,	
  you	
  have	
  reached	
  the	
  age	
  where	
  
  Associate	
  
  Professor	
  	
     ?	
     you	
  can	
  scratch	
  your	
  butt	
  in	
  public,	
  go	
  
     #2	
                     ahead.”	
  
  Emeritus	
                   “Don’t	
  do	
  it.	
  Giving	
  a	
  keynote	
  means	
  
  Professor	
         ?	
      you	
  are	
  a	
  washed-­‐up	
  has-­‐been.”	
  
                                                                                                                  2	
  
My	
  Goals…	
  
          Talk	
  about	
  why	
  I	
  think	
  DBMS	
  research	
  has	
  been	
  so	
  
          rewarding	
  over	
  the	
  past	
  50	
  years	
  

                       Some	
  personal	
  anecdotes	
  from	
  a	
  “washed-­‐up	
  
                       has-­‐been”	
  about	
  fun	
  work	
  that	
  would	
  be	
  
                       discouraged	
  today	
  

                                 Discuss	
  why	
  the	
  next	
  50	
  years	
  are	
  at	
  
                                 risk	
  and	
  what	
  we	
  can	
  do	
  about	
  it	
  
                                 	
  
  Avoid	
  
scratching	
  

                                        Make	
  this	
  “interesting”	
  
Long,	
  Long	
  Ago…	
  
William	
  McGee	
  published	
  a	
  J.	
  ACM	
  
paper	
  “Generalization: Key to
Successful Electronic Data
Processing.”
[Jan. ’59]
	
  
Great	
  first	
  sentence:	
  	
  
“The wholesale acceptance of
electronic data processing
machines […] is ample proof
that management is beginning
to recognize the potential of
this new tool.”
The	
  More	
  Things	
  Change…	
  

Another	
  sentence	
  from	
  
McGee’s	
  intro:	
  
	
  
“management is
increasingly concerned over
the continued high cost of
installing and maintaining
data processing systems.”
	
  
Context:	
  McGee’s	
  Machine	
  
•  IBM	
  702	
  




                                            6	
  
More	
  On	
  The	
  702	
  

•  Memory	
  (cathode	
  ray	
  tubes):	
  20K	
  bytes.	
  
•  Disk	
  (magnetic	
  drum):	
  60K	
  bytes.	
  
•  CPU:	
  10MIPH	
  	
  
    –  MIPH	
  =	
  Million	
  Instructions	
  Per	
  Hour	
  
    –  For	
  those	
  of	
  you	
  without	
  calculators,	
  that	
  is	
  about	
  
       0.000278	
  MIPS	
  
    –  Or,	
  if	
  you	
  like,	
  a	
  0.000000278	
  GHz	
  processor.	
  
                                                                                         7	
  
A	
  Note	
  About	
  This	
  Early	
  History…	
  


           The	
  commercial,	
  technical,	
  
           and	
  intellectual	
  seeds	
  that	
  
            grew	
  into	
  our	
  community	
  
             were	
  planted	
  long	
  ago	
  




                                                      8	
  
Problem:	
  
          File-­‐Dependent	
  Programming	
  

•  Write	
  a	
  sort	
  program	
  for	
  the	
  
   payroll	
  file	
  
•  Write	
  another	
  sort	
  program	
  for	
  
   the	
  accounts	
  receivable	
  file	
  
•  No	
  sharing	
  	
  
    –  between	
  files	
  
    –  between	
  applications	
  
    –  between	
  institutions	
  


   Solution:	
  “generalized	
  programming”	
  
                                                     9	
  
What	
  Do	
  You	
  Need	
  for	
  Generalized	
  
                 Programming?	
  
 •  A	
  description	
  of	
  the	
  record	
  layouts	
  in	
  the	
  file	
  
 •  Ideally,	
  in	
  some	
  place	
  should	
  also	
  capture	
  
    elements	
  in	
  common	
  to	
  multiple	
  files	
  
        Schemas	
  

•  Programs	
  that	
  	
  
    –  Interpret	
  these	
  descriptions	
  and	
  	
  
    –  Make	
  it	
  easy	
  to	
  express	
  generally	
  useful	
  operations	
  
       (sorting,	
  reporting)	
  on	
  the	
  files	
  

          Data	
  Manipulation	
  Languages	
                                    10	
  
Another	
  Problem:	
  Updates	
  
•  How	
  do	
  you	
  undo	
  
   mistakes?	
  	
  
•  How	
  do	
  you	
  handle	
  
   failures	
  in	
  middle	
  of	
  
   processing?	
  
  Solution:	
  in	
  a	
  separate	
  file,	
  
  write	
  before/after	
  images	
  
  of	
  updated	
  records.	
  
  (Logging)	
  
                                                 11	
  
So	
  What’s	
  the	
  Point?	
  



1940S              1950s        …      1970s                1990s               2000s        … TODAY


                        The	
  seeds	
  of	
  a	
  rich	
  and	
  vibrant	
  database	
  
                        management	
  research	
  community	
  were	
  planted	
  

 In	
  particular,	
  
         •  Strong	
  commercial	
  interest	
  with	
  $$$	
  behind	
  it	
  
         •  Common	
  non-­‐trivial	
  technical	
  challenges	
  related	
  to	
  data	
  management	
  
         •  A	
  set	
  of	
  problems	
  amenable	
  to	
  abstraction	
  and	
  generalization	
  
Fast	
  Forward	
  to	
  Today…	
  
 1940S            1950s      …     1970s             1990s            2000s   … TODAY




•  These	
  three	
  key	
  factors	
  	
  
     –  commercial	
  interest	
  
     –  common	
  data	
  management	
  challenges	
  	
  
     –  attractive	
  problems	
  	
  
	
  	
  	
  	
  	
  	
  are	
  present	
  now	
  more	
  than	
  ever	
  

                     That	
  is	
  the	
  good	
  news!	
  
                                                                                   13	
  
What	
  About	
  the	
  Research	
  Community?	
  
•  Maybe	
  it	
  is	
  in	
  good	
  shape	
  
     –  But	
  I	
  am	
  not	
  so	
  sure	
  

•  Maybe	
  it	
  is	
  headed	
  in	
  a	
  good	
  direction	
  
     –  But	
  I	
  am	
  less	
  sure	
                              Modern DB Research


•  We	
  worry	
  a	
  lot	
  about	
  what	
  to	
  work	
  on	
  
     –  I	
  am	
  not	
  worried	
  about	
  this	
  

•  We	
  are	
  increasingly	
  valuing	
  and	
  
   rewarding	
  the	
  wrong	
  things	
  
     –  I	
  am	
  worried	
  about	
  this	
  
                                                                                      14	
  
The	
  Problem	
  I	
  Worry	
  About	
  
The	
  combination	
  of:	
  
    –  Pressure	
  to	
  publish	
  lots	
  of	
  papers	
  +	
  
    –  Low	
  acceptance	
  rates	
  +	
  
    –  Bad	
  reviewing	
  
is	
  sucking	
  the	
  air	
  out	
  of	
  our	
  community	
  
                                      Modern	
  Database	
  
                                        Researcher	
  




                                                                    15	
  
Doomsday	
  Scenario	
  

Being	
  a	
  database	
  researcher	
  means	
  a	
  life	
  
  filled	
  with	
  all	
  the	
  joy,	
  meaning,	
  and	
  
  creativity	
  of	
  studying	
  for	
  and	
  taking	
  
              college	
  entrance	
  exams	
  




                                                            16	
  
Doomsday	
  is	
  Here	
  

•  We	
  behave	
  as	
  if	
  
     –    	
  Researchers	
  are	
  “students”	
  
     –    	
  PC	
  members	
  are	
  “graders”	
  
     –    	
  Publications	
  pose	
  “exam	
  questions”	
  	
  
     –    	
  New	
  submissions	
  are	
  “answers”	
  



•  These	
  “exams”	
  are	
  
   perceived	
  as	
  the	
  
   only	
  path	
  to	
  success	
  
   in	
  the	
  community	
  
                                                                    17	
  
Some	
  More	
  on	
  the	
  Analogy…	
  
•  The	
  “students”	
  are	
  not	
  that	
  interested	
  in	
  the	
  questions	
  



•  The	
  “graders”	
  are	
  even	
  less	
  interested	
  in	
  the	
  answers	
  



•  No	
  one	
  else	
  is	
  interested	
  in	
  either,	
  caring	
  only	
  about	
  the	
  
   scores	
  

              Everyone	
                                           what’s the score?
                                                                                           18	
  
So	
  What	
  is	
  Wrong	
  With	
  This?	
  
                                                    What	
  kind	
  of	
  people	
  will	
  
Who	
  wants	
  to	
  spend	
  their	
  life	
     this	
  scenario	
  attract	
  to	
  our	
  
taking	
  meaningless	
  exams?	
                                field?	
  



                                                                   What	
  kind	
  of	
  
                                                                    work	
  will	
  be	
  

           +”	
  
                                                                  common	
  in	
  this	
  
        “A                                                         environment?	
  




                                                                                            19	
  
My	
  Take…	
  
The	
  problem	
  isn’t	
  	
  
    	
  “Researchers	
  today!	
  They	
  need	
  to	
  be	
  more	
  like	
  
    	
  	
  	
  the	
  researchers	
  in	
  the	
  good	
  old	
  days!”	
  


Rather,	
  it	
  is	
  more	
  	
  
    	
  “we	
  need	
  to	
  re-­‐think	
  the	
  environment	
  we	
  have	
  	
  
    	
  	
  	
  created	
  and	
  what	
  it	
  encourages”	
  


   More	
  on	
  this	
  later,	
  first	
  some	
  “old	
  war	
  stories”	
  
                                                                                      20	
  
…about	
  work	
  I	
  found	
  rewarding	
  and	
  fun	
  
                                                              21	
  
War	
  Story	
  #1:	
  The	
  Sorting	
  Record	
  
 1940S                  1950s           …       1970s                 1980s               2000s        … TODAY


                                                                              DeWitt	
  had	
  a	
  
                                                                 parallel	
  DBMS	
  project	
  (Gamma)	
  

•  Very	
  fortunately	
  for	
  me,	
  he	
  let	
  me	
  join	
  
•  One	
  of	
  the	
  things	
  we	
  worked	
  on:	
  	
  
       –  parallel	
  sorting	
  on	
  an	
  Intel	
  iPSC-­‐2	
  
          Hypercube	
  
       –  (Donovan	
  Schneider	
  was	
  key	
  to	
  this	
  
          project.)	
  

•  Our	
  main	
  idea:	
  
       –  Use	
  parallel	
  sampling	
  to	
  
          probabilistically	
  pick	
  range	
  partitioning	
  
          cutoffs	
  
       –  Repartition	
  the	
  data	
  
       –  Sort	
  locally	
  
                                                                                                              22	
  
In	
  Those	
  Days…	
  
•  It	
  still	
  made	
  (a	
  little)	
  sense	
  to	
  try	
  the	
  original	
  sort	
  
   benchmark:	
  	
  
    –  1M	
  100	
  byte	
  records	
  (now	
  perhaps	
  known	
  as	
  
          wristwatch	
  sort)	
  

•  I	
  think	
  our	
  time	
  was	
  something	
  like	
  30	
  seconds	
  


•  We	
  mentioned	
  it	
  to	
  Jim	
  Gray	
  when	
  he	
  
   visited	
  Wisconsin.	
  
    –  (it	
  seemed	
  a	
  safe	
  enough	
  thing	
  to	
  do)	
  


                                                                                               23	
  
Maybe	
  6	
  Months	
  Later…	
  
•  Gray	
  called	
  me	
  up	
  and	
  invited	
  me	
  to	
  visit	
  the	
  
   DEC	
  Bay	
  Area	
  Research	
  Center	
  

•  Said	
  they	
  wanted	
  to	
  hear	
  about	
  my	
  work	
  (that	
  
   alone	
  should	
  have	
  made	
  me	
  suspicious)	
  




                                                                                  24	
  
With	
  Some	
  Trepidation	
  I	
  Flew	
  Out	
  	
  
    (folks	
  from	
  Wisconsin	
  don’t	
  require	
  too	
  much	
  reason	
  	
  
                       to	
  fly	
  to	
  CA	
  in	
  the	
  winter)	
  




 CA	
     WI	
  




                                                                                       25	
  
At	
  the	
  Talk…	
  
•  Oddly,	
  quite	
  a	
  few	
  people	
  were	
  
   there	
  
•  At	
  the	
  end,	
  they	
  asked	
  	
  
   	
  “are	
  you	
  finished?”	
  

•  When	
  I	
  said	
  yes,	
  	
  
      –  A	
  lot	
  of	
  cameras	
  appeared.	
  
      –  Also	
  a	
  large	
  trophy	
                          World’s
                                                                Record for
                                                                 Sorting
•  They	
  handed	
  it	
  to	
  me	
  and	
  said	
  
   we	
  had	
  set	
  the	
  world	
  record	
  for	
  
   sorting!	
  

                                                                             26	
  
Continuing	
  With	
  the	
  Story…	
  
•  I	
  basked	
  in	
  the	
  glory	
  for	
  
   about	
  5	
  seconds	
  

•  Then	
  they	
  announced:	
  
     –  “We	
  have	
  beaten	
  your	
  
        record.”	
  
     –  “Please	
  hand	
  the	
  trophy	
  to	
  
        Chris,	
  we	
  want	
  a	
  picture	
  of	
  
        the	
  hand-­‐over.”	
  



                                                         27	
  
Would	
  You	
  Do	
  This	
  Today?	
  
  •  It	
  was	
  perhaps	
  6	
  months	
  work	
  for	
  three	
  researchers	
  	
  
  •  Yielded	
  one	
  paper	
  in	
  a	
  minor	
  conference	
  
                Today	
  it	
  would	
  be	
  zero	
  papers!	
  	
  

I	
  can	
  see	
  the	
  reviews:	
  
       “Reject:	
  Not	
  enough	
  math,	
  can’t	
  be	
  deep.”	
  
     “Reject:	
  Superficial	
  performance	
  evaluation.”	
  
     “Reject:	
  Their	
  sorting	
  program	
  cannot	
  converse	
  in	
  
     English	
  like	
  the	
  Star	
  Trek	
  computer.”	
  

                                                                                      28	
  
Today	
  We	
  Probably	
  Couldn’t	
  
 Spare	
  the	
  Time	
  to	
  do	
  This	
  

 •  A	
  team	
  of	
  halfway	
  decent	
  researchers:	
  	
  
     –  should	
  (must?)	
  write	
  more	
  than	
  one	
  paper	
  in	
  six	
  
        months	
  
     –  should	
  do	
  much	
  more	
  reviewer-­‐proof	
  work	
  than	
  
        implementing	
  and	
  running	
  a	
  sorting	
  algorithm	
  




                                                                                 29	
  
Was	
  it	
  Worth	
  Doing?	
  

Yes!	
  
    –  Taught	
  me	
  a	
  ton	
  about	
  sampling	
  in	
  
       parallel	
  DBMS	
  

    –  Convinced	
  me	
  of	
  the	
  importance	
  of	
  
       processor	
  architecture	
  in	
  performance	
  	
  
           •  the	
  BARC	
  guys	
  smoked	
  us	
  with	
  a	
  
              uniprocessor	
  implementation	
  by	
  making	
  
              clever	
  use	
  of	
  the	
  cache	
  
                                                                     YOU BET!
    –  It	
  was	
  really	
  fun	
  and	
  kept	
  me	
  
       interested	
  and	
  connected	
  with	
  the	
  
       field	
  
                                                                                30	
  
War	
  Story	
  #2:	
  007	
  Benchmark	
  
Carey,	
  DeWitt,	
  and	
  I	
  decided	
  to	
  benchmark	
  object	
  oriented	
  DBMS	
  


                   +	
              +	
                  =	
  
•  We	
  	
  
     –  designed	
  the	
  benchmark	
  
     –  negotiated	
  with	
  four	
  vendors	
  to	
  
        get	
  them	
  involved	
  
     –  implemented	
  it	
  on	
  five	
  systems	
  	
  
           •  four	
  commercial	
  plus	
  our	
  own	
  
     –  started	
  running	
  tests	
  
                  This	
  was	
  a	
  huge	
  amount	
  of	
  work!	
  
                                                                                         31	
  
After	
  Months	
  of	
  Work…	
  

•  We	
  received	
  a	
  fax	
  from	
  a	
  law	
  firm	
  
   representing	
  one	
  of	
  the	
  companies	
  

•  The	
  fax	
  ordered	
  us	
  	
                               FAX	
  
     –  to	
  stop	
  benchmarking	
  their	
  system	
  	
  
     –  and	
  to	
  destroy	
  all	
  copies	
  of	
  their	
  
        system	
  that	
  we	
  might	
  have	
  


   We	
  reacted	
  extremely	
  maturely	
  as	
  usual…	
  
                                                                             32	
  
The	
  Next	
  Morning…	
  

•  I	
  woke	
  up	
  early	
  
•  Made	
  a	
  big	
  pot	
  of	
  coffee	
  
•  Drank	
  most	
  of	
  it	
  

•  Fired	
  off	
  an	
  angry	
  fax	
  	
  
    –  demanding	
  that	
  the	
  company	
  destroy	
  all	
  copies	
  
       of	
  our	
  benchmark	
  implementation	
  

    We	
  knew	
  they	
  were	
  using	
  it	
  for	
  internal	
  testing.	
  	
  
                           We	
  were	
  pissed.	
                                   33	
  
15	
  Minutes	
  After	
  Sending	
  the	
  Fax	
  
•  I	
  received	
  a	
  phone	
  call	
  

•  The	
  following	
  conversation	
  ensued:	
  
    –  Confused Woman: “Professor Naughton?”
    –  Indignant Professor: “Yes?”
    –  Confused Woman: “This is First National Bank, we
       received a fax from you about destroying a
       benchmark…”
    –  Sheepish Professor: “OK, please comply, thanks.”


                                                          34	
  
Would	
  This	
  Project	
  be	
  Smart	
  Now?	
  

•  Probably	
  not	
                                                        #1:	
  	
  
                                                                 ie wer	
  
    –  Three	
  profs,	
  18+	
  months,	
  one	
  paper?	
   Rev
    –  Again,	
  today	
  would	
  be	
  zero	
  papers.	
  
                                                                                     Reject.
                                                                      No new algorithms!



                                                                           Rej
•  Reviews	
  today	
  would	
  be:	
                          Sho
                                                                     uld    ect.
                                                                         co
                                                                     SPE mpare
                                                                          Cma    with
                                                                             rk!


                                                                     Accept.
                                                                Lots of graphs!
                                                                       Reviewer	
  #3:	
  	
  

                                                                                                 35	
  
Worth	
  It?	
  
•  We	
  certainly	
  learned	
  a	
  lot	
  about:	
  
    –  OODBMS	
  technology	
  
    –  Stresses	
  companies	
  (and	
  researchers)	
  
       are	
  under	
  with	
  respect	
  to	
  
       benchmarking	
  
    –  Legal	
  issues	
  
    –  Pitfalls	
  in	
  designing	
  benchmarks	
  
    –  Interaction	
  with	
  popular	
  tech	
  press	
  



             Positive	
  for	
  all	
  of	
  us,	
  but	
  would	
  be	
  	
  
            discouraged	
  by	
  today’s	
  environment	
  
                                                                                 36	
  
Point	
  of	
  the	
  War	
  Stories…	
  
•  At	
  least	
  for	
  me,	
  some	
  of	
  the	
  most	
  rewarding	
  
   work	
  did	
  not	
  have	
  a	
  publication	
  as	
  its	
  goal	
  
                           Goal	
  

•  At	
  least	
  for	
  me,	
  some	
  of	
  the	
  most	
  rewarding	
  
   work	
  did	
  not	
  result	
  in	
  many	
  publications	
  
                         Result	
  

•  At	
  least	
  for	
  me,	
  some	
  of	
  the	
  most	
  rewarding	
  
   work	
  did	
  not	
  result	
  in	
  any	
  papers	
  that	
  would	
  
   pass	
  today’s	
  program	
  committees	
  
                          Result	
          Accept.
                                                                              37	
  
These	
  days	
  I	
  fear	
  we	
  are	
  
•  Discouraging	
  work	
  motivated	
  by	
  asking:	
  
   –  Can	
  we	
  learn	
  something?	
  
   –  Can	
  we	
  build	
  something?	
  
   –  Can	
  we	
  prove	
  something?	
  
   –  Can	
  we	
  improve	
  something?	
  


•  Encouraging	
  work	
  motivated	
  by	
  asking:	
  
   –  Can	
  I	
  write	
  a	
  paper	
  before	
  the	
  deadline?	
  

                                                                          38	
  
Today…	
  
•  We	
  are	
  so	
  frenetically	
  pursuing	
  the	
  next	
  
   conference	
  deadline	
  	
  
   	
  …under	
  the	
  watchful	
  glare	
  of	
  bad	
  reviewing,	
  
   	
  …that	
  the	
  freedom	
  required	
  to	
  do	
  exploratory	
  work	
  is	
  
disappearing.	
  




                                                                                          39	
  
Three	
  Factors	
  Causing	
  Problems	
  

               Low	
  acceptance	
  rates


               Emphasis	
  on	
  paper	
  count


               Bad	
  reviewing


      Let’s	
  consider	
  them	
  in	
  order	
  
The	
  Problem	
  With	
  	
  
             Low	
  Acceptance	
  Rates	
  
•  Very	
  discouraging	
  



•  Reduces	
  tolerance	
  for	
  errors	
  in	
  reviewing	
  
•  Enforces	
  evaluation	
  of	
  work	
  by	
  three	
  randomly	
  
   chosen	
  reviewers	
  rather	
  than	
  the	
  community	
  
•  Makes	
  major	
  event	
  in	
  the	
  course	
  of	
  research	
  
   acceptance/rejection,	
  not	
  scientific	
  or	
  
   engineering	
  progress	
                                       41	
  
How	
  to	
  Fix	
  Low	
  Acceptance	
  Rates	
  
•  Increase	
  acceptance	
  rates	
  (duh!)	
  
•  Mandate	
  a	
  target	
  
•  Hoped	
  for	
  effects:	
  
   –  Fewer	
  rejections	
  
   –  Fewer	
  papers	
  sloshing	
  from	
  conference	
  to	
  
      conference	
  
   –  Reduce	
  the	
  damage	
  done	
  by	
  bad	
  reviewing	
  



                                                                      42	
  
Turning	
  to	
  Paper	
  Count	
  
•  Paper	
  count	
  inflation	
                                        paper
                                                                             count

     –  To	
  be	
  a	
  success,	
  people	
  feel	
  one	
  
        has	
  to	
  publish	
  a	
  lot	
  of	
  papers	
  
     –  To	
  publish	
  a	
  lot	
  of	
  papers,	
  one	
  has	
  
        to	
  get	
  past	
  a	
  lot	
  of	
  program	
  
        committees	
  

•  To	
  do	
  this	
  is	
  a	
  more	
  than	
  full	
               [No Time to Explore]
   time	
  job	
  
•  Leaves	
  too	
  little	
  room	
  for	
  
   activities	
  not	
  focused	
  on	
  
   generating	
  publications	
  
                                                                                              43	
  
Tough	
  to	
  Fix,	
  but…	
  
Let	
  people	
  in	
  on	
  a	
  secret:	
  
      –  In	
  general	
  paper	
  count	
  is	
  much	
  less	
  important	
  in	
  
         evaluaAons	
  than	
  you	
  might	
  think.	
  
      –  Stars	
  are	
  never	
  evaluated	
  by	
  paper	
  count	
  
It	
  is	
  OK	
  to	
  write	
  a	
  lot	
  of	
  papers.	
  
      –  Just	
  don’t	
  make	
  it	
  the	
  primary	
  metric	
  moAvaAng	
  
         researchers.	
  
      –  Don’t	
  let	
  it	
  block	
  those	
  who	
  have	
  a	
  different	
  “research	
  
         style.”	
  

                                            Paper	
  Count	
  
                                                   	
  =	
  	
  
                                           Quality	
  Measure	
  


                                                                                                 44	
  
Acceptance	
  Rate	
  Again	
  
•  Hypothesis:	
  emphasis	
  on	
  paper	
  count	
  can	
  be	
  
   somewhat	
  ameliorated	
  by	
  increasing	
  acceptance	
  
   rate	
  
   –  If	
  it	
  is	
  easier	
  to	
  publish	
  papers,	
  publishing	
  lots	
  of	
  them	
  
      will	
  be	
  perceived	
  as	
  less	
  impressive	
  
   –  ShiR	
  the	
  focus	
  from	
  paper	
  count	
  to	
  paper	
  quality	
  




                                                                                                45	
  
Third	
  Issue:	
  Bad	
  Reviewing	
  
•  Very	
  hard	
  to	
  fix	
  
•  Very	
  important	
  to	
  fix	
  
•  Extremely	
  important	
  to	
  discuss	
  because	
  it	
  gives	
  
   me	
  a	
  chance	
  to	
  vent	
  

                           Reviewing	
  System	
         Reviewer	
  



            Paper	
  




         Reviewer	
                                  Reviewer	
            46	
  
Caveat!	
  
•  I	
  have	
  received	
  extremely	
  helpful	
  while	
  
   ulAmately	
  negaAve	
  reviews.	
  
•  These	
  reviews	
  have	
  dramaAcally	
  helped	
  me	
  
   and	
  my	
  co-­‐authors	
  with	
  the	
  presentaAon	
  and	
  
   content	
  of	
  the	
  work.	
  
•  I	
  am	
  very	
  grateful	
  to	
  those	
  reviewers.	
  
•  These	
  “helpful	
  but	
  negaAve”	
  reviews	
  are	
  not	
  
   the	
  same	
  as	
  the	
  “bad”	
  reviews	
  I	
  will	
  discuss	
  
   next!	
  
                                                                          47	
  
One	
  problem:	
  
Reviewers	
  Hate	
  EVERYTHING!	
  

                     •  One	
  anecdote:	
  	
  
                          	
  SIGMOD	
  2010	
  
 Anonymous	
  	
     •  	
  350	
  submissions	
  
  Reviewer	
             –  Number	
  of	
  papers	
  with	
  all	
  
                            reviews	
  “accept”	
  or	
  higher:	
  1	
  
                         –  Number	
  of	
  papers	
  with	
  
                            average	
  “accept”	
  or	
  higher:	
  4	
  

                              Either	
  we	
  all	
  suck	
  or	
  
                             something	
  is	
  broken!	
  
                                                                      48	
  
PC	
  Reviewing	
  is	
  Important	
  
•  This	
  is	
  the	
  primary	
  place	
  most	
  researchers	
  get	
  most	
  of	
  
   their	
  outside	
  feedback	
  

•  This	
  feedback	
  trains	
  most	
  researchers	
  in:	
  
     –  What	
  to	
  do	
  for	
  their	
  next	
  submission	
  
     –  How	
  to	
  evaluate	
  others	
  

           Feedback	
                                                Feedback	
  

           Feedback	
                                                Feedback	
  




                      Receiving	
  dysfunctional	
  reviews	
  	
  
                    begets	
  writing	
  dysfunctional	
  reviews	
                        49	
  
Why	
  is	
  This	
  Bad?	
  

•  Discouraging	
  for	
  authors	
  
•  Devastating	
  when	
  it	
  occurs	
  
   in	
  grant	
  proposal	
  
   reviewing 	
  	
  
    –  Funding	
  agencies	
  believe	
  us	
  
       when	
  we	
  say	
  we	
  suck	
  
•  The	
  absolute	
  score	
  can	
  be	
  
   fixed	
  by	
  enforced	
  scaling	
  

  More	
  fundamental	
  problem:	
  papers	
  are	
  being	
  
            rejected	
  for	
  the	
  wrong	
  reasons	
          50	
  
What	
  is	
  Modern	
  Reviewing	
  Most	
  Like?	
  

                                          Your	
  paper	
  is	
  not	
  bullet	
  proof.	
  
•  Today	
  reviewing	
  is	
  
   like	
  grading	
  
•  Perhaps	
  because	
  so	
  
   many	
  reviewers	
  are	
  
   professsors	
  or	
                                      JECT	
  
                                                         RE
   students	
  or	
  former	
  
   students?	
  
                                  OK,	
  I	
  will	
  focus	
  all	
  my	
  energy	
  on	
  
                                  fixing	
  that	
  before	
  the	
  next	
  deadline.	
  

                                                                                               51	
  
But	
  Reviewing	
  is	
  Not	
  Grading	
  

•  When	
  grading	
  exams,	
  zero	
  credit	
  goes	
  	
  for	
  
   thinking	
  of	
  the	
  question	
  
    –  (good)	
  reviewing:	
  acknowledges	
  that	
  the	
  
       question	
  can	
  be	
  the	
  major	
  contribution	
  

•  When	
  grading	
  exams,	
  zero	
  credit	
  goes	
  for	
  a	
  
   novel	
  approach	
  to	
  solution.	
  
    –  (good)	
  reviewing:	
  acknowledges	
  that	
  a	
  novel	
  
       approach	
  can	
  be	
  more	
  important	
  than	
  the	
  
       existence	
  of	
  the	
  solution	
  
                                                                         52	
  
Bad	
  Reviewing:	
  	
  
         Rejection	
  Checklist	
  Reviewing	
  
•  Rejection	
  checklist:	
  
   –  Is	
  it	
  “difficult”?	
  	
  
   –  Is	
  it	
  “complete”?	
  
   –  Can	
  I	
  find	
  any	
  flaw?	
  
   –  Can	
  I	
  kill	
  it	
  quickly?	
  
        •  Writing	
  negative	
  reviews	
  à	
  you	
  are	
  intelligent	
  and	
  
           have	
  high	
  standards	
  
        •  Finding	
  the	
  positive	
  in	
  a	
  flawed	
  paper	
  à	
  you	
  are	
  soft,	
  
           stupid,	
  and	
  have	
  low	
  standards	
  

                                                                                                      53	
  
Bad	
  Reviewing	
  2	
  
•  The	
  “dog	
  and	
  bone”	
  model	
  of	
  reviewing	
  
     –    The	
  reviewer	
  is	
  a	
  dog	
  
     –    The	
  reviewer’s	
  previous	
  work	
  is	
  a	
  bone	
  
     –    The	
  author	
  of	
  the	
  new	
  paper	
  is	
  another	
  dog	
  trying	
  to	
  steal	
  the	
  bone	
  
     –    Response	
  by	
  the	
  reviewer:	
  growl	
  like	
  hell	
  

                                                                                             Reviewer’s	
  response	
  

                                                                                                  Grrrrrrrrrrrrrr…	
  

                  Reviewer	
  



             Reviewer’s	
  	
  
           previous	
  work	
  


Systems	
  folks	
  seem	
  much	
  much	
  worse	
  about	
  this	
  than	
  theoreticians	
  
                                                                                                                           54	
  
Bad	
  Reviewing	
  3	
  
•  The	
  ignorant	
  but	
                         I’ll	
  fake	
  it	
  
                                                          Ull	
  I	
  
   confident	
  reviewer	
                            make	
  it	
  

•  Doesn’t	
  know	
  what	
  is	
  
   going	
  on	
  but	
  would	
  rather	
  
   fake	
  it	
  than	
  admit	
  it	
  
•  Seems	
  to	
  feel	
  admission	
  of	
  
   lack	
  of	
  knowledge	
  would	
  
   reveal	
  a	
  weakness	
  
    –  The	
  exact	
  opposite	
  is	
  true	
  

                                                                             55	
  
Why	
  Are	
  Things	
  Bad?	
  
•  One	
  explanaUon:	
  reviewers	
  are	
  bad	
  people	
  
•  More	
  likely:	
  reviewers	
  are	
  being	
  poorly	
  trained	
  
•  In	
  prehistoric	
  Ames	
  
    –  We	
  had	
  face-­‐to-­‐face	
  PC	
  meeAngs	
  
    –  There	
  was	
  a	
  lot	
  of	
  accountability	
  pressure	
  	
  
    –  There	
  was	
  a	
  lot	
  of	
  coaching	
  and	
  mentoring	
  




                                                            Experience	
  
                       Experience	
  

                                        Exp	
  




                                                  Exp	
  




                                                                              56	
  
What	
  Is	
  the	
  Training	
  Today?	
  
•  Reviewers	
  are	
  trained	
  by	
  receiving	
  bad	
  
     reviews	
  from	
  other	
  reviewers	
  who	
  have	
  
     received	
  bad	
  reviews	
  in	
  the	
  past	
  
	
  




                                                                57	
  
Can	
  We	
  Change	
  This?	
  

•  Here	
  are	
  some	
  ideas,	
  with	
  the	
  
   goal:	
  
    –  Encourage	
  discussion	
  
    –  Argue	
  that	
  change	
  is	
  possible	
  
    –  Agitate	
  for	
  change	
  
    –  Some	
  are	
  deliberately	
  far	
  
       fetched!	
  




                                                       58	
  
Idea	
  #1:	
  Disarm	
  the	
  Killers	
  
•  A	
  large	
  part	
  of	
  “success”	
  in	
  
   conference	
  submissions	
  is	
  
   getting	
  a	
  lucky	
  assignment	
  of	
  
   reviewers	
  
    –  If	
  you	
  get	
  a	
  “killer”	
  reviewer,	
  you	
  
       are	
  dead	
  
    –  So	
  you	
  resubmit	
  until	
  you	
  have	
  
       three	
  non-­‐killers	
  
•  Possible	
  solution:	
  	
  
    –  Mandate	
  a	
  certain	
  percentage	
  of	
  
       “accepts”	
  from	
  each	
  reviewer	
  

                                                                   59	
  
Idea	
  #2:	
  Shine	
  Light	
  on	
  the	
  Process	
  
•  Publish	
  reviews	
  
    –  Might	
  already	
  encourage	
  
       more	
  care	
  in	
  reviewing	
  
    –  At	
  the	
  very	
  least	
  it	
  would	
  
       be	
  cathartic	
  

•  Allow	
  voting	
  for	
  best	
  
   reviews	
  
    –  Somehow	
  reward	
  best	
  
       reviewers?	
  
•  Allow	
  voting	
  for	
  worst	
  
   reviews	
  
    –  And?	
  
                                                         60	
  
Idea	
  #3:	
  Return	
  to	
  the	
  Past	
  
•  Require	
  face-­‐to-­‐face	
  PC	
  meetings	
  
•  Perhaps	
  have	
  partitioned	
  committees	
  to	
  make	
  
   this	
  tractable	
  
•  Restore	
  accountability	
  for	
  reviews	
  
•  Create	
  opportunities	
  for	
  mentorship	
  


                           Mentorship	
  
                         Accountability	
  




                                                                    61	
  
Proposal	
  #4:	
  Single-­‐Blind	
  Reviewing	
  
•  But	
  reverse	
  it:	
  
     –  Reviewer	
  doesn’t	
  know	
  author	
  
     –  Author	
  knows	
  reviewer	
  

•  So	
  
     –  You	
  know	
  who	
  is	
  criticizing	
  you,	
  but	
  not	
  who	
  you	
  are	
  criticizing	
  
     –  Would	
  certainly	
  encourage	
  more	
  thoughtful	
  reviewing	
  




                                              Reviewers	
  


                                              Authors	
  
                                                                                                            62	
  
Proposal	
  #5:	
  Eliminate	
  the	
  Problem	
  
•    No	
  reviewing!	
  
•    Accept	
  everything.	
  
•    Let	
  the	
  community	
  sort	
  things	
  out	
  over	
  time.	
  
•    Why	
  are	
  we	
  still	
  behaving	
  as	
  if	
  our	
  
     proceedings	
  are	
  printed	
  by	
  overworked	
  
     monks	
  on	
  a	
  Gutenberg	
  press?	
  




                                                                         63	
  
Wrapping	
  Up	
  the	
  Talk	
  
•  There	
  is	
  a	
  lot	
  of	
  angst	
  in	
  the	
  field	
  
    –  Where	
  are	
  the	
  big	
  new	
  ideas?	
  
    –  What	
  defines	
  us	
  as	
  a	
  community?	
  
    –  How	
  did	
  we	
  miss	
  the	
  web?	
  
    –  How	
  long	
  is	
  this	
  guy	
  going	
  to	
  talk	
  in	
  this	
  keynote?	
  

•  These	
  are	
  all	
  great	
  questions,	
  worthy	
  of	
  
   discussion	
  

      But	
  I	
  don’t	
  think	
  our	
  success	
  in	
  the	
  next	
  
       50	
  years	
  depends	
  on	
  answering	
  them	
                                64	
  
Looking	
  Forward	
  
•  My	
  crystal	
  ball	
  cannot	
  see	
  
   specific	
  technical	
  trends	
  
   50	
  years	
  out	
  
•  It	
  does	
  predict	
  that	
  the	
  
   three	
  drivers	
  	
  
     –  commercial	
  interest	
  
     –  common	
  data	
  
        management	
  challenges	
  	
  
     –  attractive	
  problems	
  	
  
will	
  exist	
  in	
  for	
  another	
  50	
  
years	
  
                                                  65	
  
So	
  what	
  do	
  we	
  need?	
  
•  We	
  will	
  be	
  OK	
  if	
  we:	
  
    –  Periodically	
  reconnect	
  to	
  these	
  three	
  drivers	
  
    –  Create	
  an	
  environment	
  that	
  attracts	
  good	
  people	
  
       and	
  gives	
  them	
  the	
  freedom	
  and	
  incentive	
  to	
  do	
  
       good	
  work	
  
•  Our	
  success	
  depends	
  on	
  both	
  of	
  these.	
  




                                                                                66	
  
This	
  is	
  Important	
  
•  As	
  a	
  research	
  community,	
  despite	
  commercial	
  
   interest	
  and	
  great	
  problems	
  to	
  work	
  on,	
  we	
  
   will	
  not	
  thrive	
  if	
  we	
  create	
  a	
  stifling,	
  
   depressing	
  environment	
  that	
  discourages	
  a	
  
   diversity	
  of	
  work.	
  




                                                                    67	
  
Who	
  is	
  Going	
  to	
  Fix	
  This?	
  
•  This	
  is	
  not	
  “us	
  vs.	
  them”	
  
•  There	
  is	
  only	
  us.	
  
•  Don’t	
  wait	
  for	
  “them”	
  to	
  change	
  things.	
  
•  We	
  are	
  “them.”	
  
	
  




                                                                   68	
  
Are	
  Things	
  Really	
  So	
  Bad?	
  
•  Not	
  entirely,	
  not	
  yet.	
  
    –  Somehow	
  good	
  work	
  still	
  gets	
  done.	
  
    –  Somehow	
  great	
  papers	
  still	
  get	
  written.	
  
    –  Somehow	
  great	
  new	
  people	
  still	
  join	
  the	
  community.	
  
    –  The	
  community	
  is	
  beginning	
  to	
  respond	
  with	
  great	
  
       initiatives	
  at	
  various	
  conferences.	
  
•  But	
  the	
  overall	
  trend	
  is	
  not	
  good.	
  
•  If	
  we	
  don’t	
  address	
  this,	
  innovative	
  data	
  
   management	
  research	
  will	
  get	
  done,	
  but	
  
   probably	
  not	
  by	
  us.	
  
                                                                                 69	
  
The	
  Next	
  Big	
  Research	
  Idea	
  
•  Maybe	
  the	
  next	
  big	
  idea	
  is	
  not	
  a	
  new	
  data	
  model.	
  
•  Maybe	
  it	
  is	
  a	
  new	
  community	
  model.	
  
    –  How	
  we	
  create,	
  disseminate,	
  and	
  evaluate	
  research.	
  
    –  How	
  we	
  attract,	
  evaluate,	
  and	
  motivate	
  researchers.	
  
•  The	
  ideas	
  in	
  this	
  keynote	
  are	
  incremental.	
  
•  Can	
  some	
  brilliant	
  person	
  come	
  up	
  with	
  a	
  
   paradigm-­‐changing	
  idea	
  for	
  the	
  community?	
  



                                                                                   70	
  
Acknowledgements	
  
•  I’d	
  like	
  to	
  thank	
  the	
  many	
  colleagues	
  near	
  and	
  
   far	
  whose	
  ideas	
  I	
  have	
  used	
  in	
  this	
  talk.	
  
•  I’d	
  like	
  to	
  apologize	
  to	
  the	
  many	
  colleagues	
  
   near	
  and	
  far	
  whose	
  ideas	
  I	
  should	
  have	
  used	
  in	
  
   this	
  talk	
  but	
  didn’t.	
  
•  I’d	
  like	
  to	
  thank	
  an	
  anonymous	
  colleague	
  for	
  
   heroic	
  work	
  making	
  these	
  slides	
  much	
  less	
  
   visually	
  boring	
  than	
  is	
  typical	
  for	
  a	
  Naughton	
  
   talk.	
  

                                                                              71	
  
Closing	
  Thought:	
  	
  
        We	
  Are	
  In	
  It	
  for	
  the	
  Long	
  Haul	
  
 1959       …       1980        …        2009       …        2040         …         2060

                     Past	
                                  Future	
  
•  The	
  McGee	
  paper	
  I	
  discussed	
  was	
  from	
  1959	
  
•  His	
  most	
  recent	
  publication	
  was	
  in	
  2009	
  
•  So	
  note	
  to	
  youngsters	
  first	
  publishing	
  in	
  this	
  
   conference:	
  you	
  should	
  still	
  be	
  publishing	
  in	
  
   2060!	
  

    So	
  it	
  is	
  REALLY	
  in	
  your	
  interest	
  to	
  decide	
  how	
  
     you	
  would	
  like	
  to	
  spend	
  the	
  next	
  50	
  years…	
  
                                                                                       72	
  
73	
  

Contenu connexe

Similaire à ICDE2010: DBMS: Lessons from the First 50 Years, Speculations for the Next 50

Escaping the Knowledge Management Black Hole: New Approaches to Leveraging Or...
Escaping the Knowledge Management Black Hole: New Approaches to Leveraging Or...Escaping the Knowledge Management Black Hole: New Approaches to Leveraging Or...
Escaping the Knowledge Management Black Hole: New Approaches to Leveraging Or...
Paul Culmsee
 
Forty Years of Crisis, Ten Years of Agile, Now What?
Forty Years of Crisis, Ten Years of Agile, Now What?Forty Years of Crisis, Ten Years of Agile, Now What?
Forty Years of Crisis, Ten Years of Agile, Now What?
Morendil
 
Altman pitt 2013_v3
Altman pitt 2013_v3Altman pitt 2013_v3
Altman pitt 2013_v3
Micah Altman
 
Search, citation and plagiarism: skills for a digital age have to be taught!
Search, citation and plagiarism: skills for a digital age have to be taught!Search, citation and plagiarism: skills for a digital age have to be taught!
Search, citation and plagiarism: skills for a digital age have to be taught!
CIT, NUS
 
Tablet apps, or the future of Digital Scholarly Editions?
Tablet apps, or the future of Digital Scholarly Editions?Tablet apps, or the future of Digital Scholarly Editions?
Tablet apps, or the future of Digital Scholarly Editions?
Elena Pierazzo
 

Similaire à ICDE2010: DBMS: Lessons from the First 50 Years, Speculations for the Next 50 (20)

How2research
How2researchHow2research
How2research
 
Escaping the Knowledge Management Black Hole: New Approaches to Leveraging Or...
Escaping the Knowledge Management Black Hole: New Approaches to Leveraging Or...Escaping the Knowledge Management Black Hole: New Approaches to Leveraging Or...
Escaping the Knowledge Management Black Hole: New Approaches to Leveraging Or...
 
Ml pluss ejan2013
Ml pluss ejan2013Ml pluss ejan2013
Ml pluss ejan2013
 
Career introduction of Engineering Student SSVIT rizwan
Career introduction of Engineering Student SSVIT rizwanCareer introduction of Engineering Student SSVIT rizwan
Career introduction of Engineering Student SSVIT rizwan
 
Research industry panel review
Research industry panel reviewResearch industry panel review
Research industry panel review
 
10 Observations from 10+ years in the Corporate UX Trenches
10 Observations from 10+ years in the Corporate UX Trenches10 Observations from 10+ years in the Corporate UX Trenches
10 Observations from 10+ years in the Corporate UX Trenches
 
Forty Years of Crisis, Ten Years of Agile, Now What?
Forty Years of Crisis, Ten Years of Agile, Now What?Forty Years of Crisis, Ten Years of Agile, Now What?
Forty Years of Crisis, Ten Years of Agile, Now What?
 
Social media & sentiment analysis splunk conf2012
Social media & sentiment analysis   splunk conf2012Social media & sentiment analysis   splunk conf2012
Social media & sentiment analysis splunk conf2012
 
Making Social Data Sing
Making Social Data SingMaking Social Data Sing
Making Social Data Sing
 
Orlando Techkeynote Post
Orlando Techkeynote PostOrlando Techkeynote Post
Orlando Techkeynote Post
 
Lyn's knowledge journeyv4
Lyn's knowledge journeyv4Lyn's knowledge journeyv4
Lyn's knowledge journeyv4
 
Waiting for Exascale
Waiting for ExascaleWaiting for Exascale
Waiting for Exascale
 
SharePoint Governance. Stop features thinking,
SharePoint Governance. Stop features thinking, SharePoint Governance. Stop features thinking,
SharePoint Governance. Stop features thinking,
 
S.P.A.C.E. Exploration for Software Engineering
 S.P.A.C.E. Exploration for Software Engineering S.P.A.C.E. Exploration for Software Engineering
S.P.A.C.E. Exploration for Software Engineering
 
Promise notes
Promise notesPromise notes
Promise notes
 
Altman pitt 2013_v3
Altman pitt 2013_v3Altman pitt 2013_v3
Altman pitt 2013_v3
 
Search, citation and plagiarism: skills for a digital age have to be taught!
Search, citation and plagiarism: skills for a digital age have to be taught!Search, citation and plagiarism: skills for a digital age have to be taught!
Search, citation and plagiarism: skills for a digital age have to be taught!
 
Tablet apps, or the future of Digital Scholarly Editions?
Tablet apps, or the future of Digital Scholarly Editions?Tablet apps, or the future of Digital Scholarly Editions?
Tablet apps, or the future of Digital Scholarly Editions?
 
Introduction to ai
Introduction to aiIntroduction to ai
Introduction to ai
 
Classrooms for the gifted
Classrooms for the gifted Classrooms for the gifted
Classrooms for the gifted
 

Plus de zukun

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009
zukun
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCV
zukun
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Information
zukun
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statistics
zukun
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibration
zukun
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
zukun
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluation
zukun
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-software
zukun
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptors
zukun
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectors
zukun
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-intro
zukun
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video search
zukun
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video search
zukun
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video search
zukun
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learning
zukun
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer vision
zukun
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick start
zukun
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysis
zukun
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
zukun
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities
zukun
 

Plus de zukun (20)

My lyn tutorial 2009
My lyn tutorial 2009My lyn tutorial 2009
My lyn tutorial 2009
 
ETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCVETHZ CV2012: Tutorial openCV
ETHZ CV2012: Tutorial openCV
 
ETHZ CV2012: Information
ETHZ CV2012: InformationETHZ CV2012: Information
ETHZ CV2012: Information
 
Siwei lyu: natural image statistics
Siwei lyu: natural image statisticsSiwei lyu: natural image statistics
Siwei lyu: natural image statistics
 
Lecture9 camera calibration
Lecture9 camera calibrationLecture9 camera calibration
Lecture9 camera calibration
 
Brunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer visionBrunelli 2008: template matching techniques in computer vision
Brunelli 2008: template matching techniques in computer vision
 
Modern features-part-4-evaluation
Modern features-part-4-evaluationModern features-part-4-evaluation
Modern features-part-4-evaluation
 
Modern features-part-3-software
Modern features-part-3-softwareModern features-part-3-software
Modern features-part-3-software
 
Modern features-part-2-descriptors
Modern features-part-2-descriptorsModern features-part-2-descriptors
Modern features-part-2-descriptors
 
Modern features-part-1-detectors
Modern features-part-1-detectorsModern features-part-1-detectors
Modern features-part-1-detectors
 
Modern features-part-0-intro
Modern features-part-0-introModern features-part-0-intro
Modern features-part-0-intro
 
Lecture 02 internet video search
Lecture 02 internet video searchLecture 02 internet video search
Lecture 02 internet video search
 
Lecture 01 internet video search
Lecture 01 internet video searchLecture 01 internet video search
Lecture 01 internet video search
 
Lecture 03 internet video search
Lecture 03 internet video searchLecture 03 internet video search
Lecture 03 internet video search
 
Icml2012 tutorial representation_learning
Icml2012 tutorial representation_learningIcml2012 tutorial representation_learning
Icml2012 tutorial representation_learning
 
Advances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer visionAdvances in discrete energy minimisation for computer vision
Advances in discrete energy minimisation for computer vision
 
Gephi tutorial: quick start
Gephi tutorial: quick startGephi tutorial: quick start
Gephi tutorial: quick start
 
EM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysisEM algorithm and its application in probabilistic latent semantic analysis
EM algorithm and its application in probabilistic latent semantic analysis
 
Object recognition with pictorial structures
Object recognition with pictorial structuresObject recognition with pictorial structures
Object recognition with pictorial structures
 
Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities Iccv2011 learning spatiotemporal graphs of human activities
Iccv2011 learning spatiotemporal graphs of human activities
 

Dernier

Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
KarakKing
 

Dernier (20)

Micro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdfMicro-Scholarship, What it is, How can it help me.pdf
Micro-Scholarship, What it is, How can it help me.pdf
 
This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.This PowerPoint helps students to consider the concept of infinity.
This PowerPoint helps students to consider the concept of infinity.
 
ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.ICT role in 21st century education and it's challenges.
ICT role in 21st century education and it's challenges.
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Plant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptxPlant propagation: Sexual and Asexual propapagation.pptx
Plant propagation: Sexual and Asexual propapagation.pptx
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
SOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning PresentationSOC 101 Demonstration of Learning Presentation
SOC 101 Demonstration of Learning Presentation
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Google Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptxGoogle Gemini An AI Revolution in Education.pptx
Google Gemini An AI Revolution in Education.pptx
 
Salient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functionsSalient Features of India constitution especially power and functions
Salient Features of India constitution especially power and functions
 
ICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptxICT Role in 21st Century Education & its Challenges.pptx
ICT Role in 21st Century Education & its Challenges.pptx
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
Towards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptxTowards a code of practice for AI in AT.pptx
Towards a code of practice for AI in AT.pptx
 

ICDE2010: DBMS: Lessons from the First 50 Years, Speculations for the Next 50

  • 1. DBMS Research: First 50 Years, Next 50 Years Jeffrey F. Naughton Jeffrey F. Naughton
  • 2. Reactions  to  This  Keynote  From   Colleagues…   Jeffrey  F.  Naughton   Assistant     “Cool,  I  look  forward  to  it.”   Professor     ?   (upon  hearing  what  I  proposed  to  talk  about):     Associate   “But  how  are  you  going  to  make  that   Professor     #1       ?   interesting?”   “Well,  you  have  reached  the  age  where   Associate   Professor     ?   you  can  scratch  your  butt  in  public,  go   #2   ahead.”   Emeritus   “Don’t  do  it.  Giving  a  keynote  means   Professor   ?   you  are  a  washed-­‐up  has-­‐been.”   2  
  • 3. My  Goals…   Talk  about  why  I  think  DBMS  research  has  been  so   rewarding  over  the  past  50  years   Some  personal  anecdotes  from  a  “washed-­‐up   has-­‐been”  about  fun  work  that  would  be   discouraged  today   Discuss  why  the  next  50  years  are  at   risk  and  what  we  can  do  about  it     Avoid   scratching   Make  this  “interesting”  
  • 4. Long,  Long  Ago…   William  McGee  published  a  J.  ACM   paper  “Generalization: Key to Successful Electronic Data Processing.” [Jan. ’59]   Great  first  sentence:     “The wholesale acceptance of electronic data processing machines […] is ample proof that management is beginning to recognize the potential of this new tool.”
  • 5. The  More  Things  Change…   Another  sentence  from   McGee’s  intro:     “management is increasingly concerned over the continued high cost of installing and maintaining data processing systems.”  
  • 6. Context:  McGee’s  Machine   •  IBM  702   6  
  • 7. More  On  The  702   •  Memory  (cathode  ray  tubes):  20K  bytes.   •  Disk  (magnetic  drum):  60K  bytes.   •  CPU:  10MIPH     –  MIPH  =  Million  Instructions  Per  Hour   –  For  those  of  you  without  calculators,  that  is  about   0.000278  MIPS   –  Or,  if  you  like,  a  0.000000278  GHz  processor.   7  
  • 8. A  Note  About  This  Early  History…   The  commercial,  technical,   and  intellectual  seeds  that   grew  into  our  community   were  planted  long  ago   8  
  • 9. Problem:   File-­‐Dependent  Programming   •  Write  a  sort  program  for  the   payroll  file   •  Write  another  sort  program  for   the  accounts  receivable  file   •  No  sharing     –  between  files   –  between  applications   –  between  institutions   Solution:  “generalized  programming”   9  
  • 10. What  Do  You  Need  for  Generalized   Programming?   •  A  description  of  the  record  layouts  in  the  file   •  Ideally,  in  some  place  should  also  capture   elements  in  common  to  multiple  files   Schemas   •  Programs  that     –  Interpret  these  descriptions  and     –  Make  it  easy  to  express  generally  useful  operations   (sorting,  reporting)  on  the  files   Data  Manipulation  Languages   10  
  • 11. Another  Problem:  Updates   •  How  do  you  undo   mistakes?     •  How  do  you  handle   failures  in  middle  of   processing?   Solution:  in  a  separate  file,   write  before/after  images   of  updated  records.   (Logging)   11  
  • 12. So  What’s  the  Point?   1940S 1950s … 1970s 1990s 2000s … TODAY The  seeds  of  a  rich  and  vibrant  database   management  research  community  were  planted   In  particular,   •  Strong  commercial  interest  with  $$$  behind  it   •  Common  non-­‐trivial  technical  challenges  related  to  data  management   •  A  set  of  problems  amenable  to  abstraction  and  generalization  
  • 13. Fast  Forward  to  Today…   1940S 1950s … 1970s 1990s 2000s … TODAY •  These  three  key  factors     –  commercial  interest   –  common  data  management  challenges     –  attractive  problems                are  present  now  more  than  ever   That  is  the  good  news!   13  
  • 14. What  About  the  Research  Community?   •  Maybe  it  is  in  good  shape   –  But  I  am  not  so  sure   •  Maybe  it  is  headed  in  a  good  direction   –  But  I  am  less  sure   Modern DB Research •  We  worry  a  lot  about  what  to  work  on   –  I  am  not  worried  about  this   •  We  are  increasingly  valuing  and   rewarding  the  wrong  things   –  I  am  worried  about  this   14  
  • 15. The  Problem  I  Worry  About   The  combination  of:   –  Pressure  to  publish  lots  of  papers  +   –  Low  acceptance  rates  +   –  Bad  reviewing   is  sucking  the  air  out  of  our  community   Modern  Database   Researcher   15  
  • 16. Doomsday  Scenario   Being  a  database  researcher  means  a  life   filled  with  all  the  joy,  meaning,  and   creativity  of  studying  for  and  taking   college  entrance  exams   16  
  • 17. Doomsday  is  Here   •  We  behave  as  if   –   Researchers  are  “students”   –   PC  members  are  “graders”   –   Publications  pose  “exam  questions”     –   New  submissions  are  “answers”   •  These  “exams”  are   perceived  as  the   only  path  to  success   in  the  community   17  
  • 18. Some  More  on  the  Analogy…   •  The  “students”  are  not  that  interested  in  the  questions   •  The  “graders”  are  even  less  interested  in  the  answers   •  No  one  else  is  interested  in  either,  caring  only  about  the   scores   Everyone   what’s the score? 18  
  • 19. So  What  is  Wrong  With  This?   What  kind  of  people  will   Who  wants  to  spend  their  life   this  scenario  attract  to  our   taking  meaningless  exams?   field?   What  kind  of   work  will  be   +”   common  in  this   “A environment?   19  
  • 20. My  Take…   The  problem  isn’t      “Researchers  today!  They  need  to  be  more  like        the  researchers  in  the  good  old  days!”   Rather,  it  is  more      “we  need  to  re-­‐think  the  environment  we  have          created  and  what  it  encourages”   More  on  this  later,  first  some  “old  war  stories”   20  
  • 21. …about  work  I  found  rewarding  and  fun   21  
  • 22. War  Story  #1:  The  Sorting  Record   1940S 1950s … 1970s 1980s 2000s … TODAY DeWitt  had  a   parallel  DBMS  project  (Gamma)   •  Very  fortunately  for  me,  he  let  me  join   •  One  of  the  things  we  worked  on:     –  parallel  sorting  on  an  Intel  iPSC-­‐2   Hypercube   –  (Donovan  Schneider  was  key  to  this   project.)   •  Our  main  idea:   –  Use  parallel  sampling  to   probabilistically  pick  range  partitioning   cutoffs   –  Repartition  the  data   –  Sort  locally   22  
  • 23. In  Those  Days…   •  It  still  made  (a  little)  sense  to  try  the  original  sort   benchmark:     –  1M  100  byte  records  (now  perhaps  known  as   wristwatch  sort)   •  I  think  our  time  was  something  like  30  seconds   •  We  mentioned  it  to  Jim  Gray  when  he   visited  Wisconsin.   –  (it  seemed  a  safe  enough  thing  to  do)   23  
  • 24. Maybe  6  Months  Later…   •  Gray  called  me  up  and  invited  me  to  visit  the   DEC  Bay  Area  Research  Center   •  Said  they  wanted  to  hear  about  my  work  (that   alone  should  have  made  me  suspicious)   24  
  • 25. With  Some  Trepidation  I  Flew  Out     (folks  from  Wisconsin  don’t  require  too  much  reason     to  fly  to  CA  in  the  winter)   CA   WI   25  
  • 26. At  the  Talk…   •  Oddly,  quite  a  few  people  were   there   •  At  the  end,  they  asked      “are  you  finished?”   •  When  I  said  yes,     –  A  lot  of  cameras  appeared.   –  Also  a  large  trophy   World’s Record for Sorting •  They  handed  it  to  me  and  said   we  had  set  the  world  record  for   sorting!   26  
  • 27. Continuing  With  the  Story…   •  I  basked  in  the  glory  for   about  5  seconds   •  Then  they  announced:   –  “We  have  beaten  your   record.”   –  “Please  hand  the  trophy  to   Chris,  we  want  a  picture  of   the  hand-­‐over.”   27  
  • 28. Would  You  Do  This  Today?   •  It  was  perhaps  6  months  work  for  three  researchers     •  Yielded  one  paper  in  a  minor  conference   Today  it  would  be  zero  papers!     I  can  see  the  reviews:   “Reject:  Not  enough  math,  can’t  be  deep.”   “Reject:  Superficial  performance  evaluation.”   “Reject:  Their  sorting  program  cannot  converse  in   English  like  the  Star  Trek  computer.”   28  
  • 29. Today  We  Probably  Couldn’t   Spare  the  Time  to  do  This   •  A  team  of  halfway  decent  researchers:     –  should  (must?)  write  more  than  one  paper  in  six   months   –  should  do  much  more  reviewer-­‐proof  work  than   implementing  and  running  a  sorting  algorithm   29  
  • 30. Was  it  Worth  Doing?   Yes!   –  Taught  me  a  ton  about  sampling  in   parallel  DBMS   –  Convinced  me  of  the  importance  of   processor  architecture  in  performance     •  the  BARC  guys  smoked  us  with  a   uniprocessor  implementation  by  making   clever  use  of  the  cache   YOU BET! –  It  was  really  fun  and  kept  me   interested  and  connected  with  the   field   30  
  • 31. War  Story  #2:  007  Benchmark   Carey,  DeWitt,  and  I  decided  to  benchmark  object  oriented  DBMS   +   +   =   •  We     –  designed  the  benchmark   –  negotiated  with  four  vendors  to   get  them  involved   –  implemented  it  on  five  systems     •  four  commercial  plus  our  own   –  started  running  tests   This  was  a  huge  amount  of  work!   31  
  • 32. After  Months  of  Work…   •  We  received  a  fax  from  a  law  firm   representing  one  of  the  companies   •  The  fax  ordered  us     FAX   –  to  stop  benchmarking  their  system     –  and  to  destroy  all  copies  of  their   system  that  we  might  have   We  reacted  extremely  maturely  as  usual…   32  
  • 33. The  Next  Morning…   •  I  woke  up  early   •  Made  a  big  pot  of  coffee   •  Drank  most  of  it   •  Fired  off  an  angry  fax     –  demanding  that  the  company  destroy  all  copies   of  our  benchmark  implementation   We  knew  they  were  using  it  for  internal  testing.     We  were  pissed.   33  
  • 34. 15  Minutes  After  Sending  the  Fax   •  I  received  a  phone  call   •  The  following  conversation  ensued:   –  Confused Woman: “Professor Naughton?” –  Indignant Professor: “Yes?” –  Confused Woman: “This is First National Bank, we received a fax from you about destroying a benchmark…” –  Sheepish Professor: “OK, please comply, thanks.” 34  
  • 35. Would  This  Project  be  Smart  Now?   •  Probably  not   #1:     ie wer   –  Three  profs,  18+  months,  one  paper?   Rev –  Again,  today  would  be  zero  papers.   Reject. No new algorithms! Rej •  Reviews  today  would  be:   Sho uld ect. co SPE mpare Cma with rk! Accept. Lots of graphs! Reviewer  #3:     35  
  • 36. Worth  It?   •  We  certainly  learned  a  lot  about:   –  OODBMS  technology   –  Stresses  companies  (and  researchers)   are  under  with  respect  to   benchmarking   –  Legal  issues   –  Pitfalls  in  designing  benchmarks   –  Interaction  with  popular  tech  press   Positive  for  all  of  us,  but  would  be     discouraged  by  today’s  environment   36  
  • 37. Point  of  the  War  Stories…   •  At  least  for  me,  some  of  the  most  rewarding   work  did  not  have  a  publication  as  its  goal   Goal   •  At  least  for  me,  some  of  the  most  rewarding   work  did  not  result  in  many  publications   Result   •  At  least  for  me,  some  of  the  most  rewarding   work  did  not  result  in  any  papers  that  would   pass  today’s  program  committees   Result   Accept. 37  
  • 38. These  days  I  fear  we  are   •  Discouraging  work  motivated  by  asking:   –  Can  we  learn  something?   –  Can  we  build  something?   –  Can  we  prove  something?   –  Can  we  improve  something?   •  Encouraging  work  motivated  by  asking:   –  Can  I  write  a  paper  before  the  deadline?   38  
  • 39. Today…   •  We  are  so  frenetically  pursuing  the  next   conference  deadline      …under  the  watchful  glare  of  bad  reviewing,    …that  the  freedom  required  to  do  exploratory  work  is   disappearing.   39  
  • 40. Three  Factors  Causing  Problems   Low  acceptance  rates Emphasis  on  paper  count Bad  reviewing Let’s  consider  them  in  order  
  • 41. The  Problem  With     Low  Acceptance  Rates   •  Very  discouraging   •  Reduces  tolerance  for  errors  in  reviewing   •  Enforces  evaluation  of  work  by  three  randomly   chosen  reviewers  rather  than  the  community   •  Makes  major  event  in  the  course  of  research   acceptance/rejection,  not  scientific  or   engineering  progress   41  
  • 42. How  to  Fix  Low  Acceptance  Rates   •  Increase  acceptance  rates  (duh!)   •  Mandate  a  target   •  Hoped  for  effects:   –  Fewer  rejections   –  Fewer  papers  sloshing  from  conference  to   conference   –  Reduce  the  damage  done  by  bad  reviewing   42  
  • 43. Turning  to  Paper  Count   •  Paper  count  inflation   paper count –  To  be  a  success,  people  feel  one   has  to  publish  a  lot  of  papers   –  To  publish  a  lot  of  papers,  one  has   to  get  past  a  lot  of  program   committees   •  To  do  this  is  a  more  than  full   [No Time to Explore] time  job   •  Leaves  too  little  room  for   activities  not  focused  on   generating  publications   43  
  • 44. Tough  to  Fix,  but…   Let  people  in  on  a  secret:   –  In  general  paper  count  is  much  less  important  in   evaluaAons  than  you  might  think.   –  Stars  are  never  evaluated  by  paper  count   It  is  OK  to  write  a  lot  of  papers.   –  Just  don’t  make  it  the  primary  metric  moAvaAng   researchers.   –  Don’t  let  it  block  those  who  have  a  different  “research   style.”   Paper  Count    =     Quality  Measure   44  
  • 45. Acceptance  Rate  Again   •  Hypothesis:  emphasis  on  paper  count  can  be   somewhat  ameliorated  by  increasing  acceptance   rate   –  If  it  is  easier  to  publish  papers,  publishing  lots  of  them   will  be  perceived  as  less  impressive   –  ShiR  the  focus  from  paper  count  to  paper  quality   45  
  • 46. Third  Issue:  Bad  Reviewing   •  Very  hard  to  fix   •  Very  important  to  fix   •  Extremely  important  to  discuss  because  it  gives   me  a  chance  to  vent   Reviewing  System   Reviewer   Paper   Reviewer   Reviewer   46  
  • 47. Caveat!   •  I  have  received  extremely  helpful  while   ulAmately  negaAve  reviews.   •  These  reviews  have  dramaAcally  helped  me   and  my  co-­‐authors  with  the  presentaAon  and   content  of  the  work.   •  I  am  very  grateful  to  those  reviewers.   •  These  “helpful  but  negaAve”  reviews  are  not   the  same  as  the  “bad”  reviews  I  will  discuss   next!   47  
  • 48. One  problem:   Reviewers  Hate  EVERYTHING!   •  One  anecdote:      SIGMOD  2010   Anonymous     •   350  submissions   Reviewer   –  Number  of  papers  with  all   reviews  “accept”  or  higher:  1   –  Number  of  papers  with   average  “accept”  or  higher:  4   Either  we  all  suck  or   something  is  broken!   48  
  • 49. PC  Reviewing  is  Important   •  This  is  the  primary  place  most  researchers  get  most  of   their  outside  feedback   •  This  feedback  trains  most  researchers  in:   –  What  to  do  for  their  next  submission   –  How  to  evaluate  others   Feedback   Feedback   Feedback   Feedback   Receiving  dysfunctional  reviews     begets  writing  dysfunctional  reviews   49  
  • 50. Why  is  This  Bad?   •  Discouraging  for  authors   •  Devastating  when  it  occurs   in  grant  proposal   reviewing     –  Funding  agencies  believe  us   when  we  say  we  suck   •  The  absolute  score  can  be   fixed  by  enforced  scaling   More  fundamental  problem:  papers  are  being   rejected  for  the  wrong  reasons   50  
  • 51. What  is  Modern  Reviewing  Most  Like?   Your  paper  is  not  bullet  proof.   •  Today  reviewing  is   like  grading   •  Perhaps  because  so   many  reviewers  are   professsors  or   JECT   RE students  or  former   students?   OK,  I  will  focus  all  my  energy  on   fixing  that  before  the  next  deadline.   51  
  • 52. But  Reviewing  is  Not  Grading   •  When  grading  exams,  zero  credit  goes    for   thinking  of  the  question   –  (good)  reviewing:  acknowledges  that  the   question  can  be  the  major  contribution   •  When  grading  exams,  zero  credit  goes  for  a   novel  approach  to  solution.   –  (good)  reviewing:  acknowledges  that  a  novel   approach  can  be  more  important  than  the   existence  of  the  solution   52  
  • 53. Bad  Reviewing:     Rejection  Checklist  Reviewing   •  Rejection  checklist:   –  Is  it  “difficult”?     –  Is  it  “complete”?   –  Can  I  find  any  flaw?   –  Can  I  kill  it  quickly?   •  Writing  negative  reviews  à  you  are  intelligent  and   have  high  standards   •  Finding  the  positive  in  a  flawed  paper  à  you  are  soft,   stupid,  and  have  low  standards   53  
  • 54. Bad  Reviewing  2   •  The  “dog  and  bone”  model  of  reviewing   –  The  reviewer  is  a  dog   –  The  reviewer’s  previous  work  is  a  bone   –  The  author  of  the  new  paper  is  another  dog  trying  to  steal  the  bone   –  Response  by  the  reviewer:  growl  like  hell   Reviewer’s  response   Grrrrrrrrrrrrrr…   Reviewer   Reviewer’s     previous  work   Systems  folks  seem  much  much  worse  about  this  than  theoreticians   54  
  • 55. Bad  Reviewing  3   •  The  ignorant  but   I’ll  fake  it   Ull  I   confident  reviewer   make  it   •  Doesn’t  know  what  is   going  on  but  would  rather   fake  it  than  admit  it   •  Seems  to  feel  admission  of   lack  of  knowledge  would   reveal  a  weakness   –  The  exact  opposite  is  true   55  
  • 56. Why  Are  Things  Bad?   •  One  explanaUon:  reviewers  are  bad  people   •  More  likely:  reviewers  are  being  poorly  trained   •  In  prehistoric  Ames   –  We  had  face-­‐to-­‐face  PC  meeAngs   –  There  was  a  lot  of  accountability  pressure     –  There  was  a  lot  of  coaching  and  mentoring   Experience   Experience   Exp   Exp   56  
  • 57. What  Is  the  Training  Today?   •  Reviewers  are  trained  by  receiving  bad   reviews  from  other  reviewers  who  have   received  bad  reviews  in  the  past     57  
  • 58. Can  We  Change  This?   •  Here  are  some  ideas,  with  the   goal:   –  Encourage  discussion   –  Argue  that  change  is  possible   –  Agitate  for  change   –  Some  are  deliberately  far   fetched!   58  
  • 59. Idea  #1:  Disarm  the  Killers   •  A  large  part  of  “success”  in   conference  submissions  is   getting  a  lucky  assignment  of   reviewers   –  If  you  get  a  “killer”  reviewer,  you   are  dead   –  So  you  resubmit  until  you  have   three  non-­‐killers   •  Possible  solution:     –  Mandate  a  certain  percentage  of   “accepts”  from  each  reviewer   59  
  • 60. Idea  #2:  Shine  Light  on  the  Process   •  Publish  reviews   –  Might  already  encourage   more  care  in  reviewing   –  At  the  very  least  it  would   be  cathartic   •  Allow  voting  for  best   reviews   –  Somehow  reward  best   reviewers?   •  Allow  voting  for  worst   reviews   –  And?   60  
  • 61. Idea  #3:  Return  to  the  Past   •  Require  face-­‐to-­‐face  PC  meetings   •  Perhaps  have  partitioned  committees  to  make   this  tractable   •  Restore  accountability  for  reviews   •  Create  opportunities  for  mentorship   Mentorship   Accountability   61  
  • 62. Proposal  #4:  Single-­‐Blind  Reviewing   •  But  reverse  it:   –  Reviewer  doesn’t  know  author   –  Author  knows  reviewer   •  So   –  You  know  who  is  criticizing  you,  but  not  who  you  are  criticizing   –  Would  certainly  encourage  more  thoughtful  reviewing   Reviewers   Authors   62  
  • 63. Proposal  #5:  Eliminate  the  Problem   •  No  reviewing!   •  Accept  everything.   •  Let  the  community  sort  things  out  over  time.   •  Why  are  we  still  behaving  as  if  our   proceedings  are  printed  by  overworked   monks  on  a  Gutenberg  press?   63  
  • 64. Wrapping  Up  the  Talk   •  There  is  a  lot  of  angst  in  the  field   –  Where  are  the  big  new  ideas?   –  What  defines  us  as  a  community?   –  How  did  we  miss  the  web?   –  How  long  is  this  guy  going  to  talk  in  this  keynote?   •  These  are  all  great  questions,  worthy  of   discussion   But  I  don’t  think  our  success  in  the  next   50  years  depends  on  answering  them   64  
  • 65. Looking  Forward   •  My  crystal  ball  cannot  see   specific  technical  trends   50  years  out   •  It  does  predict  that  the   three  drivers     –  commercial  interest   –  common  data   management  challenges     –  attractive  problems     will  exist  in  for  another  50   years   65  
  • 66. So  what  do  we  need?   •  We  will  be  OK  if  we:   –  Periodically  reconnect  to  these  three  drivers   –  Create  an  environment  that  attracts  good  people   and  gives  them  the  freedom  and  incentive  to  do   good  work   •  Our  success  depends  on  both  of  these.   66  
  • 67. This  is  Important   •  As  a  research  community,  despite  commercial   interest  and  great  problems  to  work  on,  we   will  not  thrive  if  we  create  a  stifling,   depressing  environment  that  discourages  a   diversity  of  work.   67  
  • 68. Who  is  Going  to  Fix  This?   •  This  is  not  “us  vs.  them”   •  There  is  only  us.   •  Don’t  wait  for  “them”  to  change  things.   •  We  are  “them.”     68  
  • 69. Are  Things  Really  So  Bad?   •  Not  entirely,  not  yet.   –  Somehow  good  work  still  gets  done.   –  Somehow  great  papers  still  get  written.   –  Somehow  great  new  people  still  join  the  community.   –  The  community  is  beginning  to  respond  with  great   initiatives  at  various  conferences.   •  But  the  overall  trend  is  not  good.   •  If  we  don’t  address  this,  innovative  data   management  research  will  get  done,  but   probably  not  by  us.   69  
  • 70. The  Next  Big  Research  Idea   •  Maybe  the  next  big  idea  is  not  a  new  data  model.   •  Maybe  it  is  a  new  community  model.   –  How  we  create,  disseminate,  and  evaluate  research.   –  How  we  attract,  evaluate,  and  motivate  researchers.   •  The  ideas  in  this  keynote  are  incremental.   •  Can  some  brilliant  person  come  up  with  a   paradigm-­‐changing  idea  for  the  community?   70  
  • 71. Acknowledgements   •  I’d  like  to  thank  the  many  colleagues  near  and   far  whose  ideas  I  have  used  in  this  talk.   •  I’d  like  to  apologize  to  the  many  colleagues   near  and  far  whose  ideas  I  should  have  used  in   this  talk  but  didn’t.   •  I’d  like  to  thank  an  anonymous  colleague  for   heroic  work  making  these  slides  much  less   visually  boring  than  is  typical  for  a  Naughton   talk.   71  
  • 72. Closing  Thought:     We  Are  In  It  for  the  Long  Haul   1959 … 1980 … 2009 … 2040 … 2060 Past   Future   •  The  McGee  paper  I  discussed  was  from  1959   •  His  most  recent  publication  was  in  2009   •  So  note  to  youngsters  first  publishing  in  this   conference:  you  should  still  be  publishing  in   2060!   So  it  is  REALLY  in  your  interest  to  decide  how   you  would  like  to  spend  the  next  50  years…   72  
  • 73. 73