SlideShare une entreprise Scribd logo
1  sur  55
Télécharger pour lire hors ligne
DataUp:	
  	
  
           Helping	
  
         manage	
  &	
  
       archive	
  data	
  	
  




                                           From	
  Flickr	
  by	
  kaniths	
  
Carly	
  Strasser	
  	
  
California	
  Digital	
  Library	
  	
  
                 USGS	
  CDI	
  
                 13	
  March	
  2013	
  
From	
  Flickr	
  by	
  	
  DW0825	
  
                                                                                                                 From	
  Flickr	
  by	
  Flickmor	
  




                                                          From	
  Flickr	
  by	
  	
  deltaMike	
  
                                                                                                                                                                       Digital	
  data	
  




                                             www.woodrow.org	
  
                                                                                            C.	
  Strasser	
  




                                                                                                                                                        Courtesey	
  of	
  WHOI	
  
 From	
  Flickr	
  by	
  US	
  Army	
  Environmental	
  Command	
  
Digital	
  data	
  
                                                             +	
  	
  
                                                         Complex	
  
                                                        workflows	
  




From	
  Calisphere	
  via	
  San	
  Jose	
  Public	
  Library	
  
2	
  tables	
                             Random	
  notes	
  

       C:Documents and SettingshamptonMy DocumentsNCEAS Distributed Graduate Seminars[Wash Cres Lake Dec 15 Dont_Use.xls]Sheet1
                          Stable Isotope Data Sheet
                     Sampling Site / Identifier: Wash Cresc Lake                                                                                                               Peter's lab     Don't use - old data
                                Sample Type: Algal                                                                                                                             Washed Rocks
                                         Date: Dec. 16
                       Tray ID and Sequence: Tray 004

                                                                 13                                                        15
                            Reference statistics: SD for delta        C = 0.07                              SD for delta        N = 0.15


                 Position        SampleID         Weight (mg)           %C       delta 13C   delta 13C_ca         %N               delta 15N   delta 15N_ca   Spec. No.
                A1                            ref    0.98              38.27      -25.05         -24.59           1.96                4.12          3.47       25354
                A2                            ref    0.98              39.78      -25.00         -24.54           2.03                4.01          3.36       25356
                A3                            ref    0.98              40.37      -24.99         -24.53           2.04                4.09          3.44       25358
                A4                            ref    1.01              42.23      -25.06         -24.60           2.17                4.20          3.55       25360           Shore           Avg Con
                A5          ALG01                    3.05              1.88       -24.34         -23.88           0.17               -1.65         -2.30       25362      c        -1.26          -27.22
                A6          Lk Outlet Alg            3.06              31.55      -30.17         -29.71           0.92                0.87          0.22       25364                1.26            0.32
                A7          ALG03                    2.91              6.85       -21.11         -20.65           0.48               -0.97         -1.62       25366      c
                A8          ALG05                    2.91              35.56      -28.05         -27.59           2.30                0.59         -0.06       25368
                A9          ALG07                    3.04              33.49      -29.56         -29.10           1.68                0.79          0.14       25370
                A10         ALG06                    2.95              41.17      -27.32         -26.86           1.97                2.71          2.06       25372
                B1          ALG04                    3.01              43.74      -27.50         -27.04           1.36                0.99          0.34       25374      c
                B2          ALG02                      3               4.51       -22.68         -22.22           0.34                4.31          3.66       25376
                B3          ALG01                    2.99              1.59       -24.58         -24.12           0.15               -1.69         -2.34       25378      c
                B4          ALG03                    2.92              4.37       -21.06         -20.60           0.34               -1.52         -2.17       25380      c
                B5          ALG07                     2.9              33.58      -29.44         -28.98           1.74                0.62         -0.03       25382
                B6                            ref    1.01              44.94      -25.00         -24.54           2.59                3.96          3.31       25384
                B7                            ref    0.99              42.28      -24.87         -24.41           2.37                4.33          3.68       25386
                B8          Lk Outlet Alg            3.04              31.43      -29.69         -29.23           1.07                0.95          0.30       25388
                B9          ALG06                    3.09              35.57      -27.26         -26.80           1.96                2.79          2.14       25390
                B10         ALG02                    3.05              5.52       -22.31         -21.85           0.45                4.72          4.07       25392
                C1          ALG04                    2.98              37.90      -27.42         -26.96           1.36                1.21          0.56       25394      c
                C2          ALG05                    3.04              31.74      -27.93         -27.47           2.40                0.73          0.08       25396
                C3                            ref    0.99              38.46      -25.09         -24.63           2.40                4.37          3.72       25398
                                                                       23.78                                      1.17




                                                                                                                                                                    From	
  Stephanie	
  Hampton	
  (2010)	
          	
  	
  
From	
  Stephanie	
  Hampton	
                                                                                                                                      ESA	
  Workshop	
  on	
  Best	
  Practices	
  
Wash	
  Cres	
  Lake	
  Dec	
  15	
  Dont_Use.xls	
  
       C:Documents and SettingshamptonMy DocumentsNCEAS Distributed Graduate Seminars[Wash Cres Lake Dec 15 Dont_Use.xls]Sheet1
                          Stable Isotope Data Sheet
                     Sampling Site / Identifier: Wash Cresc Lake                                                                                                               Peter's lab     Don't use - old data
                                Sample Type: Algal                                                                                                                             Washed Rocks
                                         Date: Dec. 16
                       Tray ID and Sequence: Tray 004

                                                                 13                                                        15
                            Reference statistics: SD for delta        C = 0.07                              SD for delta        N = 0.15


                 Position        SampleID         Weight (mg)           %C       delta 13C   delta 13C_ca         %N               delta 15N   delta 15N_ca   Spec. No.
                A1                            ref    0.98              38.27      -25.05         -24.59           1.96                4.12          3.47       25354
                A2                            ref    0.98              39.78      -25.00         -24.54           2.03                4.01          3.36       25356
                A3                            ref    0.98              40.37      -24.99         -24.53           2.04                4.09          3.44       25358
                A4                            ref    1.01              42.23      -25.06         -24.60           2.17                4.20          3.55       25360           Shore           Avg Con
                A5          ALG01                    3.05              1.88       -24.34         -23.88           0.17               -1.65         -2.30       25362      c        -1.26          -27.22
                A6          Lk Outlet Alg            3.06              31.55      -30.17         -29.71           0.92                0.87          0.22       25364                1.26            0.32
                A7          ALG03                    2.91              6.85       -21.11         -20.65           0.48               -0.97         -1.62       25366      c
                A8          ALG05                    2.91              35.56      -28.05         -27.59           2.30                0.59         -0.06       25368
                A9          ALG07                    3.04              33.49      -29.56         -29.10           1.68                0.79          0.14       25370
                A10         ALG06                    2.95              41.17      -27.32         -26.86           1.97                2.71          2.06       25372
                B1          ALG04                    3.01              43.74      -27.50         -27.04           1.36                0.99          0.34       25374      c
                B2          ALG02                      3               4.51       -22.68         -22.22           0.34                4.31          3.66       25376
                B3          ALG01                    2.99              1.59       -24.58         -24.12           0.15               -1.69         -2.34       25378      c
                B4          ALG03                    2.92              4.37       -21.06         -20.60           0.34               -1.52         -2.17       25380      c
                B5          ALG07                     2.9              33.58      -29.44         -28.98           1.74                0.62         -0.03       25382
                B6                            ref    1.01              44.94      -25.00         -24.54           2.59                3.96          3.31       25384
                B7                            ref    0.99              42.28      -24.87         -24.41           2.37                4.33          3.68       25386
                B8          Lk Outlet Alg            3.04              31.43      -29.69         -29.23           1.07                0.95          0.30       25388
                B9          ALG06                    3.09              35.57      -27.26         -26.80           1.96                2.79          2.14       25390
                B10         ALG02                    3.05              5.52       -22.31         -21.85           0.45                4.72          4.07       25392
                C1          ALG04                    2.98              37.90      -27.42         -26.96           1.36                1.21          0.56       25394      c
                C2          ALG05                    3.04              31.74      -27.93         -27.47           2.40                0.73          0.08       25396
                C3                            ref    0.99              38.46      -25.09         -24.63           2.40                4.37          3.72       25398
                                                                       23.78                                      1.17




                                                                                                                                                                    From	
  Stephanie	
  Hampton	
  (2010)	
          	
  	
  
From	
  Stephanie	
  Hampton	
                                                                                                                                      ESA	
  Workshop	
  on	
  Best	
  Practices	
  
Random	
  stats	
  output	
  


     C:Documents and SettingshamptonMy DocumentsNCEAS Distributed Graduate Seminars[Wash Cres Lake Dec 15 Dont_Use.xls]Sheet1
                        Stable Isotope Data Sheet
                   Sampling Site / Identifier: Wash Cresc Lake                                                                                               Peter's lab              Don't use - old data
                              Sample Type: Algal                                                                                                             Washed Rocks
                                       Date: Dec. 16
                     Tray ID and Sequence: Tray 004

                                                          13                                                   15
                          Reference statistics: SD for delta C = 0.07                              SD for delta N = 0.15


               Position        SampleID        Weight (mg)      %C      delta 13C   delta 13C_ca        %N          delta 15N   delta 15N_ca Spec. No.
              A1                           ref    0.98         38.27     -25.05         -24.59          1.96           4.12          3.47     25354
              A2                           ref    0.98         39.78     -25.00         -24.54          2.03           4.01          3.36     25356
              A3                           ref    0.98         40.37     -24.99         -24.53          2.04           4.09          3.44     25358
              A4                           ref    1.01         42.23     -25.06         -24.60          2.17           4.20          3.55     25360          Shore                    Avg Con
              A5          ALG01                   3.05         1.88      -24.34         -23.88          0.17          -1.65         -2.30     25362      c       -1.26                   -27.22
              A6          Lk Outlet Alg           3.06         31.55     -30.17         -29.71          0.92           0.87          0.22     25364               1.26                     0.32
              A7          ALG03                   2.91         6.85      -21.11         -20.65          0.48          -0.97         -1.62     25366      c
              A8          ALG05                   2.91         35.56     -28.05         -27.59          2.30           0.59         -0.06     25368
              A9          ALG07                   3.04         33.49     -29.56         -29.10          1.68           0.79          0.14     25370
              A10         ALG06                   2.95         41.17     -27.32         -26.86          1.97           2.71          2.06     25372
              B1          ALG04                   3.01         43.74     -27.50         -27.04          1.36           0.99          0.34     25374      c               SUMMARY OUTPUT
              B2          ALG02                     3          4.51      -22.68         -22.22          0.34           4.31          3.66     25376
              B3          ALG01                   2.99         1.59      -24.58         -24.12          0.15          -1.69         -2.34     25378      c                Regression Statistics
              B4          ALG03                   2.92         4.37      -21.06         -20.60          0.34          -1.52         -2.17     25380      c               Multiple R 0.283158
              B5          ALG07                    2.9         33.58     -29.44         -28.98          1.74           0.62         -0.03     25382                      R Square 0.080178
              B6                           ref    1.01         44.94     -25.00         -24.54          2.59           3.96          3.31     25384                      Adjusted R Square
                                                                                                                                                                                     -0.022024
              B7                           ref    0.99         42.28     -24.87         -24.41          2.37           4.33          3.68     25386                      Standard Error
                                                                                                                                                                                      1.906378
              B8          Lk Outlet Alg           3.04         31.43     -29.69         -29.23          1.07           0.95          0.30     25388                      Observations         11
              B9          ALG06                   3.09         35.57     -27.26         -26.80          1.96           2.79          2.14     25390
              B10         ALG02                   3.05         5.52      -22.31         -21.85          0.45           4.72          4.07     25392                      ANOVA
              C1          ALG04                   2.98         37.90     -27.42         -26.96          1.36           1.21          0.56     25394      c                                df         SS      MS        F Significance F
              C2          ALG05                   3.04         31.74     -27.93         -27.47          2.40           0.73          0.08     25396                      Regression             1 2.851116 2.851116 0.784507 0.398813
              C3                           ref    0.99         38.46     -25.09         -24.63          2.40           4.37          3.72     25398                      Residual               9 32.7085 3.634278
                                                               23.78                                    1.17                                                             Total                 10 35.55962

                                                                                                                                                                                   Coefficients
                                                                                                                                                                                             Standard Error t Stat P-value Lower 95%Upper 95%Lower 95.0%
                                                                                                                                                                                                                                                       Upper 95.0%
                                                                                                                                                                         Intercept -4.297428 4.671099 -0.920003 0.381568 -14.8642 6.269341 -14.8642 6.269341
                                                                                                                                                                         X Variable 1-0.158022 0.17841 -0.885724 0.398813 -0.561612 0.245569 -0.561612 0.245569




From	
  Stephanie	
  Hampton	
  
C:Documents and SettingshamptonMy DocumentsNCEAS Distributed Graduate Seminars[Wash Cres Lake Dec 15 Dont_Use.xls]Sheet1
                          Stable Isotope Data Sheet
                     Sampling Site / Identifier: Wash Cresc Lake                                                                                                          Peter's lab          Don't use - old data
                                Sample Type: Algal                                                                                                                        Washed Rocks
                                         Date: Dec. 16
                       Tray ID and Sequence: Tray 004

                                                                 13                                                      15
                            Reference statistics: SD for delta        C = 0.07                            SD for delta        N = 0.15


                 Position        SampleID         Weight (mg)           %C       delta 13C delta 13C_ca        %N                delta 15N delta 15N_ca   Spec. No.
                A1                            ref    0.98              38.27      -25.05       -24.59         1.96                  4.12        3.47       25354
                A2                            ref    0.98              39.78      -25.00       -24.54         2.03                  4.01        3.36       25356
                A3                            ref    0.98              40.37      -24.99       -24.53         2.04                  4.09        3.44       25358
                A4                            ref    1.01              42.23      -25.06       -24.60         2.17                  4.20        3.55       25360          Shore                Avg Con
                A5          ALG01                    3.05              1.88       -24.34       -23.88         0.17                 -1.65       -2.30       25362 c            -1.26               -27.22
                A6          Lk Outlet Alg            3.06              31.55      -30.17       -29.71         0.92                  0.87        0.22       25364               1.26                 0.32
                A7          ALG03                    2.91              6.85       -21.11       -20.65         0.48                 -0.97       -1.62       25366 c
                A8          ALG05                    2.91              35.56      -28.05       -27.59         2.30                  0.59       -0.06       25368
                A9          ALG07                    3.04              33.49      -29.56       -29.10         1.68                  0.79        0.14       25370
                A10         ALG06                    2.95              41.17      -27.32       -26.86         1.97                  2.71        2.06       25372
                B1          ALG04                    3.01              43.74      -27.50       -27.04         1.36                  0.99        0.34       25374 c                    SUMMARY OUTPUT
                B2          ALG02                      3               4.51            SampleID
                                                                                  -22.68       -22.22        ALG03
                                                                                                              0.34               ALG05
                                                                                                                                    4.31        3.66         ALG07
                                                                                                                                                           25376           ALG06            ALG04            ALG02                ALG01                  ALG03           ALG07
                B3          ALG01                    2.99              1.59       -24.58       -24.12         0.15                 -1.69       -2.34       25378 c                 Regression Statistics
                B4          ALG03                    2.92              4.37       -21.06       -20.60         0.34                 -1.52       -2.17       25380 c                Multiple R 0.283158
                B5          ALG07                     2.9              33.58         Weight (mg)
                                                                                  -29.44       -28.98          2.91
                                                                                                              1.74                  0.62    2.91
                                                                                                                                               -0.03       25382 3.04          2.95 Square 0.080178
                                                                                                                                                                                  R            3.01                     3                  2.99               2.92                  2.9
                B6                            ref    1.01              44.94      -25.00       -24.54         2.59                  3.96        3.31       25384                  Adjusted R Square
                                                                                                                                                                                              -0.022024
                B7                            ref    0.99              42.28      -24.87       -24.41         2.37                  4.33        3.68       25386                  Standard Error
                                                                                                                                                                                               1.906378
                B8          Lk Outlet Alg            3.04              31.43      -29.69 %C-29.23              6.85
                                                                                                              1.07                  0.95   35.560.30       25388 33.49        41.17
                                                                                                                                                                                  Observations43.74    11              4.51                1.59              4.37               33.58
                B9          ALG06                    3.09              35.57      -27.26       -26.80         1.96                  2.79        2.14       25390
                B10         ALG02                    3.05              5.52       -22.31
                                                                                        delta 13C
                                                                                               -21.85
                                                                                                              -21.11
                                                                                                              0.45                  4.72
                                                                                                                                          -28.054.07       25392
                                                                                                                                                                 -29.56       -27.32
                                                                                                                                                                                  ANOVA
                                                                                                                                                                                        -27.50                        -22.68             -24.58             -21.06             -29.44
                C1          ALG04                    2.98              37.90         delta 13C_ca
                                                                                  -27.42       -26.96         -20.65
                                                                                                              1.36                  1.21  -27.590.56       25394 -29.10
                                                                                                                                                                    c         -26.86    -27.04
                                                                                                                                                                                           df              SS         -22.22
                                                                                                                                                                                                                         MS  F           -24.12
                                                                                                                                                                                                                                      Significance F        -20.60             -28.98
                C2          ALG05                    3.04              31.74      -27.93       -27.47         2.40                  0.73        0.08       25396                  Regression          1 2.851116 2.851116 0.784507 0.398813
                C3                            ref    0.99              38.46      -25.09       -24.63         2.40                  4.37        3.72       25398                  Residual            9 32.7085 3.634278
                                                                       23.78             %N                    0.48
                                                                                                              1.17                          2.30                 1.68          1.97
                                                                                                                                                                                  Total          1.3610 35.55962 0.34                0.15                     0.34                  1.74
                                                                                     delta 15N                  -0.97                       0.59                 0.79          2.71              0.99                 4.31                -1.69              -1.52                  0.62
                                                                                                                                                                                                Coefficients
                                                                                                                                                                                                          Standard Error t Stat  P-value Lower 95%Upper 95%Lower 95.0%
                                                                                                                                                                                                                                                                     Upper 95.0%
                                                                                    delta 15N_ca                -1.62                      -0.06                 0.14          2.06
                                                                                                                                                                                  Intercept       -4.297428 4.671099 3.66
                                                                                                                                                                                                   0.34                                    -2.34              -2.17
                                                                                                                                                                                                                       -0.920003 0.381568 -14.8642 6.269341 -14.8642 6.269341      -0.03
                                                                                                                                                                                      X Variable 1-0.158022 0.17841 -0.885724 0.398813 -0.561612 0.245569 -0.561612 0.245569




                                                                                                                                                                                                                                                          4.00



                                                                                                                                                                                                                                                          3.00



                                                                                                                                                                                                                                                          2.00



                                                                                                                                                                                                                                                          1.00

                                                                                                                                                                                                                                                                             Series1

                                                                                                                                                                                                                                                          0.00
                                                                                     -35.00                  -30.00                       -25.00                -20.00                 -15.00                  -10.00                  -5.00                  0.00

                                                                                                                                                                                                                                                         -1.00



                                                                                                                                                                                                                                                         -2.00



                                                                                                                                                                                                                                                         -3.00
From	
  Stephanie	
  Hampton	
  
                                                                                                                                                                                                                                                                                           8	
  
Who	
  cares?	
  



From	
  Flickr	
  by	
  AJC1	
  




                                   From	
  Flickr	
  by	
  Redden-­‐McAllister	
  
The	
  Fallout	
  

                        Data	
  
                        Reuse	
  


                        Data	
  
                       Sharing	
  


                        Data	
  
                     Management	
  
•  Cost	
  
                                                 Hurdles	
  	
  
                                                                   •  Confusion	
  about	
  
                                                 to	
  Data	
         standards	
  
                                                 Stewardship	
     •  Disparate	
  datasets	
  
                                                                   •  Lack	
  of	
  training	
  
                                                                   •  Fear	
  of	
  lost	
  rights	
  
From	
  Flickr	
  by	
  iowa_spirit_walker	
  




                                                                      or	
  benefits	
  
                                                                   •  No	
  incentives	
  
The	
  Fallout	
  



        ?               Data	
  
                        Reuse	
  


                        Data	
  
                       Sharing	
  


                        Data	
  
                     Management	
  
Intercept	
  researchers	
  
where	
  they	
  already	
  work	
  
Facilitate	
  
                      Archiving	
  
    Data	
  
management	
  &	
                         Data	
  Reuse	
  &	
  
                       Sharing	
         Reproducibility	
  
 organization	
  
                      Publishing	
  
$$	
  and	
  advice	
  


$$	
  and	
  developers	
  

Requirements	
  gathering	
  
Project	
  management	
  
Outreach	
  
Requirements	
  gathering	
  
Project	
  management	
  
Outreach	
  
What	
  do	
  	
  
scientists	
  need?	
  
Asked	
  ~       200	
  scientists	
  
       How	
  do	
  you	
  use	
  Excel?	
  
      What	
  is	
  your	
  workflow?	
  
How	
  do	
  you	
  capture	
  metadata?	
  
Plans	
  for	
  saving	
  &	
  sharing	
  data?	
  
Scientist	
  Responses	
  
                                                                  How	
  often	
  are	
  they	
  
                                                                  using	
  Excel?	
  
                                                                                                          Rarely	
  
What	
  are	
  they	
  using	
  Excel	
  for?	
                                                                     Moder-­‐
  100	
                                                                                                             ately	
  
   90	
  
                                                                                           Every	
  day	
  or	
  
   80	
  
                                                                                           almost	
  
    70	
  
   60	
  
                                                                                           every	
  day	
  
    50	
  
   40	
  
    30	
  
    20	
  
    10	
  
      0	
  
              Organizing	
     Visualizing	
     Statistics	
        Sharing	
  data	
  
                 data	
           data	
  
Scientist	
  Responses	
  
•  No	
  data	
  preservation	
  
   – Unaware	
  of	
  archives	
  
   – Resistant	
  to	
  sharing	
  
•  Poor	
  data	
  documentation	
  
•  90%	
  use	
  Excel	
  w/	
  other	
  programs	
  
Requirements	
  



             Features	
  
   Best	
  practices	
  check	
  
 Generate	
  metadata	
  (EML)	
  
Generate	
  identifier	
  +	
  citation	
  
  Post	
  data	
  to	
  repository	
  
Open	
  Source	
  
   Tool	
                   Add-­‐in	
  &	
  Web	
  
                             Application	
  
            Earth,	
  
       environmental,	
                       ?
          ecological	
  
         researchers	
  
Add-­‐in	
  	
  
                    •  Software	
  you	
  download	
  &	
  install	
  
                    •  Appears	
  as	
  “ribbon”	
  in	
  Excel	
  
                    •  Works	
  for	
  Windows	
  Excel	
  2007+	
  



Web-­‐based	
  application	
  	
  
•  Website	
  that	
  does	
  something	
  
   with	
  user’s	
  files	
  
•  Any	
  platform	
  
•  But…	
  new	
  user	
  interface	
  
DataUp	
  Web	
  App	
  
Web	
  App	
  
Web	
  App	
  
Web	
  App:	
  Best	
  Practices	
  Check	
  
Web	
  App:	
  Metadata	
  
Web	
  App:	
  Metadata	
  
Web	
  App:	
  Citation	
  
Web	
  App:	
  Citation	
  
Web	
  App:	
  Posting	
  to	
  repository	
  
Web	
  App:	
  Posting	
  to	
  repository	
  
DataUp	
  Add-­‐In	
  
Add-­‐in:	
  Ribbon	
  
Add-­‐in:	
  
Metadata	
  tab	
  
Requirements	
  



             Features	
  
   Best	
  practices	
  check	
  
 Generate	
  metadata	
  (EML)	
  


                                             ?
Generate	
  identifier	
  +	
  citation	
  
  Post	
  data	
  to	
  repository	
  
Data	
  Repository	
  for	
  
Anyone	
  |	
  Anywhere	
  
NSF	
  funded	
  DataNet	
  Project	
  
Office	
  of	
  Cyberinfrastructure	
  



 www.dataone.org	
  
B	
  




A	
             C	
  
B	
  




A	
             C	
  
B	
  




A	
             C	
  
B	
  




A	
             C	
  
B	
  




A	
             C	
  
B	
  




A	
             C	
  
B	
  




                        D	
  
A	
             C	
  

                                E	
  
B	
  




                        D	
  
A	
             C	
  

                                E	
  
B	
  




                        D	
  
A	
             C	
  

                                E	
  
Main	
  site:	
  dataup.cdlib.org	
  
Main	
  site:	
  dataup.cdlib.org	
  
Code	
  site:	
  bitbucket.org/dataup/main	
  
Establish	
  
                                     Partnerships	
  
                                     	
  
From	
  animationresources.org	
  




                                     Engage	
  Developers	
  
                                     	
  
                                     Build	
  Community	
  
Website	
         dataup.cdlib.org	
  
               Twitter	
  feed	
     @DataUpCDL	
  
                 Facebook	
          facebook.com/DataUpCDL	
  
                 Code	
  site	
      bitbucket.org/dataup/main	
  

My	
  website	
      carlystrasser.net	
  
 Email	
  me	
       carlystrasser@gmail.com	
  
 Tweet	
  me	
       @carlystrasser	
  	
  
  My	
  slides	
     slideshare.net/carlystrasser	
  
 CDL	
  Blog	
       datapub.cdlib.org	
  

Contenu connexe

Plus de Carly Strasser

Funders and Publishers: Agents of Change
Funders and Publishers: Agents of ChangeFunders and Publishers: Agents of Change
Funders and Publishers: Agents of ChangeCarly Strasser
 
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015Carly Strasser
 
Data Matters for AGU Early Career Conference
Data Matters for AGU Early Career ConferenceData Matters for AGU Early Career Conference
Data Matters for AGU Early Career ConferenceCarly Strasser
 
Lightning Talk on open data for #oaw14sky
Lightning Talk on open data for #oaw14skyLightning Talk on open data for #oaw14sky
Lightning Talk on open data for #oaw14skyCarly Strasser
 
CDL Tools for DataCite 2014
CDL Tools for DataCite 2014CDL Tools for DataCite 2014
CDL Tools for DataCite 2014Carly Strasser
 
ESA Ignite talk on quality control for data
ESA Ignite talk on quality control for dataESA Ignite talk on quality control for data
ESA Ignite talk on quality control for dataCarly Strasser
 
ESA Ignite talk on UC3 Dash platform for data sharing
ESA Ignite talk on UC3 Dash platform for data sharingESA Ignite talk on UC3 Dash platform for data sharing
ESA Ignite talk on UC3 Dash platform for data sharingCarly Strasser
 
Data publication and Citation for CLIR postdoc seminar
Data publication and Citation for CLIR postdoc seminarData publication and Citation for CLIR postdoc seminar
Data publication and Citation for CLIR postdoc seminarCarly Strasser
 
Data Management for Mountain Observatories Workshop
Data Management for Mountain Observatories WorkshopData Management for Mountain Observatories Workshop
Data Management for Mountain Observatories WorkshopCarly Strasser
 
Libraries & Research Data Management for CO Alliance of Resrch Libraries
Libraries & Research Data Management for CO Alliance of Resrch LibrariesLibraries & Research Data Management for CO Alliance of Resrch Libraries
Libraries & Research Data Management for CO Alliance of Resrch LibrariesCarly Strasser
 
Open Science for Australian Institute of Marine Science Workshop
Open Science for Australian Institute of Marine Science WorkshopOpen Science for Australian Institute of Marine Science Workshop
Open Science for Australian Institute of Marine Science WorkshopCarly Strasser
 
Research Life Cycle for GeoData 2014
Research Life Cycle for GeoData 2014Research Life Cycle for GeoData 2014
Research Life Cycle for GeoData 2014Carly Strasser
 
Data management overview and UC3 tools for IASSIST 2014
Data management overview and UC3 tools for IASSIST 2014Data management overview and UC3 tools for IASSIST 2014
Data management overview and UC3 tools for IASSIST 2014Carly Strasser
 
Coping with Data for WHOI JP Students
Coping with Data for WHOI JP StudentsCoping with Data for WHOI JP Students
Coping with Data for WHOI JP StudentsCarly Strasser
 
DMPTool for UMass eScience Symposium
DMPTool for UMass eScience SymposiumDMPTool for UMass eScience Symposium
DMPTool for UMass eScience SymposiumCarly Strasser
 
DMPTool 2.0 for #IDCC14
DMPTool 2.0 for #IDCC14DMPTool 2.0 for #IDCC14
DMPTool 2.0 for #IDCC14Carly Strasser
 
Data Publication at CDL for IDCC14
Data Publication at CDL for IDCC14Data Publication at CDL for IDCC14
Data Publication at CDL for IDCC14Carly Strasser
 
Data Publication for UC Davis Publish or Perish
Data Publication for UC Davis Publish or PerishData Publication for UC Davis Publish or Perish
Data Publication for UC Davis Publish or PerishCarly Strasser
 
DMPTool for IMLS #WebWise14
DMPTool for IMLS #WebWise14DMPTool for IMLS #WebWise14
DMPTool for IMLS #WebWise14Carly Strasser
 

Plus de Carly Strasser (20)

Funders and Publishers: Agents of Change
Funders and Publishers: Agents of ChangeFunders and Publishers: Agents of Change
Funders and Publishers: Agents of Change
 
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
AIBS Bioinformatics Workforce Needs Workshop, Dec 2015
 
Data Matters for AGU Early Career Conference
Data Matters for AGU Early Career ConferenceData Matters for AGU Early Career Conference
Data Matters for AGU Early Career Conference
 
Lightning Talk on open data for #oaw14sky
Lightning Talk on open data for #oaw14skyLightning Talk on open data for #oaw14sky
Lightning Talk on open data for #oaw14sky
 
CDL Tools for DataCite 2014
CDL Tools for DataCite 2014CDL Tools for DataCite 2014
CDL Tools for DataCite 2014
 
ESA Ignite talk on quality control for data
ESA Ignite talk on quality control for dataESA Ignite talk on quality control for data
ESA Ignite talk on quality control for data
 
ESA Ignite talk on UC3 Dash platform for data sharing
ESA Ignite talk on UC3 Dash platform for data sharingESA Ignite talk on UC3 Dash platform for data sharing
ESA Ignite talk on UC3 Dash platform for data sharing
 
Data publication and Citation for CLIR postdoc seminar
Data publication and Citation for CLIR postdoc seminarData publication and Citation for CLIR postdoc seminar
Data publication and Citation for CLIR postdoc seminar
 
Data Management for Mountain Observatories Workshop
Data Management for Mountain Observatories WorkshopData Management for Mountain Observatories Workshop
Data Management for Mountain Observatories Workshop
 
Libraries & Research Data Management for CO Alliance of Resrch Libraries
Libraries & Research Data Management for CO Alliance of Resrch LibrariesLibraries & Research Data Management for CO Alliance of Resrch Libraries
Libraries & Research Data Management for CO Alliance of Resrch Libraries
 
Open Science for Australian Institute of Marine Science Workshop
Open Science for Australian Institute of Marine Science WorkshopOpen Science for Australian Institute of Marine Science Workshop
Open Science for Australian Institute of Marine Science Workshop
 
Research Life Cycle for GeoData 2014
Research Life Cycle for GeoData 2014Research Life Cycle for GeoData 2014
Research Life Cycle for GeoData 2014
 
Data management overview and UC3 tools for IASSIST 2014
Data management overview and UC3 tools for IASSIST 2014Data management overview and UC3 tools for IASSIST 2014
Data management overview and UC3 tools for IASSIST 2014
 
Dash for IASSIST 2014
Dash for IASSIST 2014Dash for IASSIST 2014
Dash for IASSIST 2014
 
Coping with Data for WHOI JP Students
Coping with Data for WHOI JP StudentsCoping with Data for WHOI JP Students
Coping with Data for WHOI JP Students
 
DMPTool for UMass eScience Symposium
DMPTool for UMass eScience SymposiumDMPTool for UMass eScience Symposium
DMPTool for UMass eScience Symposium
 
DMPTool 2.0 for #IDCC14
DMPTool 2.0 for #IDCC14DMPTool 2.0 for #IDCC14
DMPTool 2.0 for #IDCC14
 
Data Publication at CDL for IDCC14
Data Publication at CDL for IDCC14Data Publication at CDL for IDCC14
Data Publication at CDL for IDCC14
 
Data Publication for UC Davis Publish or Perish
Data Publication for UC Davis Publish or PerishData Publication for UC Davis Publish or Perish
Data Publication for UC Davis Publish or Perish
 
DMPTool for IMLS #WebWise14
DMPTool for IMLS #WebWise14DMPTool for IMLS #WebWise14
DMPTool for IMLS #WebWise14
 

DataUp for USGS CDI

  • 1. DataUp:     Helping   manage  &   archive  data     From  Flickr  by  kaniths   Carly  Strasser     California  Digital  Library     USGS  CDI   13  March  2013  
  • 2.
  • 3. From  Flickr  by    DW0825   From  Flickr  by  Flickmor   From  Flickr  by    deltaMike   Digital  data   www.woodrow.org   C.  Strasser   Courtesey  of  WHOI   From  Flickr  by  US  Army  Environmental  Command  
  • 4. Digital  data   +     Complex   workflows   From  Calisphere  via  San  Jose  Public  Library  
  • 5. 2  tables   Random  notes   C:Documents and SettingshamptonMy DocumentsNCEAS Distributed Graduate Seminars[Wash Cres Lake Dec 15 Dont_Use.xls]Sheet1 Stable Isotope Data Sheet Sampling Site / Identifier: Wash Cresc Lake Peter's lab Don't use - old data Sample Type: Algal Washed Rocks Date: Dec. 16 Tray ID and Sequence: Tray 004 13 15 Reference statistics: SD for delta C = 0.07 SD for delta N = 0.15 Position SampleID Weight (mg) %C delta 13C delta 13C_ca %N delta 15N delta 15N_ca Spec. No. A1 ref 0.98 38.27 -25.05 -24.59 1.96 4.12 3.47 25354 A2 ref 0.98 39.78 -25.00 -24.54 2.03 4.01 3.36 25356 A3 ref 0.98 40.37 -24.99 -24.53 2.04 4.09 3.44 25358 A4 ref 1.01 42.23 -25.06 -24.60 2.17 4.20 3.55 25360 Shore Avg Con A5 ALG01 3.05 1.88 -24.34 -23.88 0.17 -1.65 -2.30 25362 c -1.26 -27.22 A6 Lk Outlet Alg 3.06 31.55 -30.17 -29.71 0.92 0.87 0.22 25364 1.26 0.32 A7 ALG03 2.91 6.85 -21.11 -20.65 0.48 -0.97 -1.62 25366 c A8 ALG05 2.91 35.56 -28.05 -27.59 2.30 0.59 -0.06 25368 A9 ALG07 3.04 33.49 -29.56 -29.10 1.68 0.79 0.14 25370 A10 ALG06 2.95 41.17 -27.32 -26.86 1.97 2.71 2.06 25372 B1 ALG04 3.01 43.74 -27.50 -27.04 1.36 0.99 0.34 25374 c B2 ALG02 3 4.51 -22.68 -22.22 0.34 4.31 3.66 25376 B3 ALG01 2.99 1.59 -24.58 -24.12 0.15 -1.69 -2.34 25378 c B4 ALG03 2.92 4.37 -21.06 -20.60 0.34 -1.52 -2.17 25380 c B5 ALG07 2.9 33.58 -29.44 -28.98 1.74 0.62 -0.03 25382 B6 ref 1.01 44.94 -25.00 -24.54 2.59 3.96 3.31 25384 B7 ref 0.99 42.28 -24.87 -24.41 2.37 4.33 3.68 25386 B8 Lk Outlet Alg 3.04 31.43 -29.69 -29.23 1.07 0.95 0.30 25388 B9 ALG06 3.09 35.57 -27.26 -26.80 1.96 2.79 2.14 25390 B10 ALG02 3.05 5.52 -22.31 -21.85 0.45 4.72 4.07 25392 C1 ALG04 2.98 37.90 -27.42 -26.96 1.36 1.21 0.56 25394 c C2 ALG05 3.04 31.74 -27.93 -27.47 2.40 0.73 0.08 25396 C3 ref 0.99 38.46 -25.09 -24.63 2.40 4.37 3.72 25398 23.78 1.17 From  Stephanie  Hampton  (2010)       From  Stephanie  Hampton   ESA  Workshop  on  Best  Practices  
  • 6. Wash  Cres  Lake  Dec  15  Dont_Use.xls   C:Documents and SettingshamptonMy DocumentsNCEAS Distributed Graduate Seminars[Wash Cres Lake Dec 15 Dont_Use.xls]Sheet1 Stable Isotope Data Sheet Sampling Site / Identifier: Wash Cresc Lake Peter's lab Don't use - old data Sample Type: Algal Washed Rocks Date: Dec. 16 Tray ID and Sequence: Tray 004 13 15 Reference statistics: SD for delta C = 0.07 SD for delta N = 0.15 Position SampleID Weight (mg) %C delta 13C delta 13C_ca %N delta 15N delta 15N_ca Spec. No. A1 ref 0.98 38.27 -25.05 -24.59 1.96 4.12 3.47 25354 A2 ref 0.98 39.78 -25.00 -24.54 2.03 4.01 3.36 25356 A3 ref 0.98 40.37 -24.99 -24.53 2.04 4.09 3.44 25358 A4 ref 1.01 42.23 -25.06 -24.60 2.17 4.20 3.55 25360 Shore Avg Con A5 ALG01 3.05 1.88 -24.34 -23.88 0.17 -1.65 -2.30 25362 c -1.26 -27.22 A6 Lk Outlet Alg 3.06 31.55 -30.17 -29.71 0.92 0.87 0.22 25364 1.26 0.32 A7 ALG03 2.91 6.85 -21.11 -20.65 0.48 -0.97 -1.62 25366 c A8 ALG05 2.91 35.56 -28.05 -27.59 2.30 0.59 -0.06 25368 A9 ALG07 3.04 33.49 -29.56 -29.10 1.68 0.79 0.14 25370 A10 ALG06 2.95 41.17 -27.32 -26.86 1.97 2.71 2.06 25372 B1 ALG04 3.01 43.74 -27.50 -27.04 1.36 0.99 0.34 25374 c B2 ALG02 3 4.51 -22.68 -22.22 0.34 4.31 3.66 25376 B3 ALG01 2.99 1.59 -24.58 -24.12 0.15 -1.69 -2.34 25378 c B4 ALG03 2.92 4.37 -21.06 -20.60 0.34 -1.52 -2.17 25380 c B5 ALG07 2.9 33.58 -29.44 -28.98 1.74 0.62 -0.03 25382 B6 ref 1.01 44.94 -25.00 -24.54 2.59 3.96 3.31 25384 B7 ref 0.99 42.28 -24.87 -24.41 2.37 4.33 3.68 25386 B8 Lk Outlet Alg 3.04 31.43 -29.69 -29.23 1.07 0.95 0.30 25388 B9 ALG06 3.09 35.57 -27.26 -26.80 1.96 2.79 2.14 25390 B10 ALG02 3.05 5.52 -22.31 -21.85 0.45 4.72 4.07 25392 C1 ALG04 2.98 37.90 -27.42 -26.96 1.36 1.21 0.56 25394 c C2 ALG05 3.04 31.74 -27.93 -27.47 2.40 0.73 0.08 25396 C3 ref 0.99 38.46 -25.09 -24.63 2.40 4.37 3.72 25398 23.78 1.17 From  Stephanie  Hampton  (2010)       From  Stephanie  Hampton   ESA  Workshop  on  Best  Practices  
  • 7. Random  stats  output   C:Documents and SettingshamptonMy DocumentsNCEAS Distributed Graduate Seminars[Wash Cres Lake Dec 15 Dont_Use.xls]Sheet1 Stable Isotope Data Sheet Sampling Site / Identifier: Wash Cresc Lake Peter's lab Don't use - old data Sample Type: Algal Washed Rocks Date: Dec. 16 Tray ID and Sequence: Tray 004 13 15 Reference statistics: SD for delta C = 0.07 SD for delta N = 0.15 Position SampleID Weight (mg) %C delta 13C delta 13C_ca %N delta 15N delta 15N_ca Spec. No. A1 ref 0.98 38.27 -25.05 -24.59 1.96 4.12 3.47 25354 A2 ref 0.98 39.78 -25.00 -24.54 2.03 4.01 3.36 25356 A3 ref 0.98 40.37 -24.99 -24.53 2.04 4.09 3.44 25358 A4 ref 1.01 42.23 -25.06 -24.60 2.17 4.20 3.55 25360 Shore Avg Con A5 ALG01 3.05 1.88 -24.34 -23.88 0.17 -1.65 -2.30 25362 c -1.26 -27.22 A6 Lk Outlet Alg 3.06 31.55 -30.17 -29.71 0.92 0.87 0.22 25364 1.26 0.32 A7 ALG03 2.91 6.85 -21.11 -20.65 0.48 -0.97 -1.62 25366 c A8 ALG05 2.91 35.56 -28.05 -27.59 2.30 0.59 -0.06 25368 A9 ALG07 3.04 33.49 -29.56 -29.10 1.68 0.79 0.14 25370 A10 ALG06 2.95 41.17 -27.32 -26.86 1.97 2.71 2.06 25372 B1 ALG04 3.01 43.74 -27.50 -27.04 1.36 0.99 0.34 25374 c SUMMARY OUTPUT B2 ALG02 3 4.51 -22.68 -22.22 0.34 4.31 3.66 25376 B3 ALG01 2.99 1.59 -24.58 -24.12 0.15 -1.69 -2.34 25378 c Regression Statistics B4 ALG03 2.92 4.37 -21.06 -20.60 0.34 -1.52 -2.17 25380 c Multiple R 0.283158 B5 ALG07 2.9 33.58 -29.44 -28.98 1.74 0.62 -0.03 25382 R Square 0.080178 B6 ref 1.01 44.94 -25.00 -24.54 2.59 3.96 3.31 25384 Adjusted R Square -0.022024 B7 ref 0.99 42.28 -24.87 -24.41 2.37 4.33 3.68 25386 Standard Error 1.906378 B8 Lk Outlet Alg 3.04 31.43 -29.69 -29.23 1.07 0.95 0.30 25388 Observations 11 B9 ALG06 3.09 35.57 -27.26 -26.80 1.96 2.79 2.14 25390 B10 ALG02 3.05 5.52 -22.31 -21.85 0.45 4.72 4.07 25392 ANOVA C1 ALG04 2.98 37.90 -27.42 -26.96 1.36 1.21 0.56 25394 c df SS MS F Significance F C2 ALG05 3.04 31.74 -27.93 -27.47 2.40 0.73 0.08 25396 Regression 1 2.851116 2.851116 0.784507 0.398813 C3 ref 0.99 38.46 -25.09 -24.63 2.40 4.37 3.72 25398 Residual 9 32.7085 3.634278 23.78 1.17 Total 10 35.55962 Coefficients Standard Error t Stat P-value Lower 95%Upper 95%Lower 95.0% Upper 95.0% Intercept -4.297428 4.671099 -0.920003 0.381568 -14.8642 6.269341 -14.8642 6.269341 X Variable 1-0.158022 0.17841 -0.885724 0.398813 -0.561612 0.245569 -0.561612 0.245569 From  Stephanie  Hampton  
  • 8. C:Documents and SettingshamptonMy DocumentsNCEAS Distributed Graduate Seminars[Wash Cres Lake Dec 15 Dont_Use.xls]Sheet1 Stable Isotope Data Sheet Sampling Site / Identifier: Wash Cresc Lake Peter's lab Don't use - old data Sample Type: Algal Washed Rocks Date: Dec. 16 Tray ID and Sequence: Tray 004 13 15 Reference statistics: SD for delta C = 0.07 SD for delta N = 0.15 Position SampleID Weight (mg) %C delta 13C delta 13C_ca %N delta 15N delta 15N_ca Spec. No. A1 ref 0.98 38.27 -25.05 -24.59 1.96 4.12 3.47 25354 A2 ref 0.98 39.78 -25.00 -24.54 2.03 4.01 3.36 25356 A3 ref 0.98 40.37 -24.99 -24.53 2.04 4.09 3.44 25358 A4 ref 1.01 42.23 -25.06 -24.60 2.17 4.20 3.55 25360 Shore Avg Con A5 ALG01 3.05 1.88 -24.34 -23.88 0.17 -1.65 -2.30 25362 c -1.26 -27.22 A6 Lk Outlet Alg 3.06 31.55 -30.17 -29.71 0.92 0.87 0.22 25364 1.26 0.32 A7 ALG03 2.91 6.85 -21.11 -20.65 0.48 -0.97 -1.62 25366 c A8 ALG05 2.91 35.56 -28.05 -27.59 2.30 0.59 -0.06 25368 A9 ALG07 3.04 33.49 -29.56 -29.10 1.68 0.79 0.14 25370 A10 ALG06 2.95 41.17 -27.32 -26.86 1.97 2.71 2.06 25372 B1 ALG04 3.01 43.74 -27.50 -27.04 1.36 0.99 0.34 25374 c SUMMARY OUTPUT B2 ALG02 3 4.51 SampleID -22.68 -22.22 ALG03 0.34 ALG05 4.31 3.66 ALG07 25376 ALG06 ALG04 ALG02 ALG01 ALG03 ALG07 B3 ALG01 2.99 1.59 -24.58 -24.12 0.15 -1.69 -2.34 25378 c Regression Statistics B4 ALG03 2.92 4.37 -21.06 -20.60 0.34 -1.52 -2.17 25380 c Multiple R 0.283158 B5 ALG07 2.9 33.58 Weight (mg) -29.44 -28.98 2.91 1.74 0.62 2.91 -0.03 25382 3.04 2.95 Square 0.080178 R 3.01 3 2.99 2.92 2.9 B6 ref 1.01 44.94 -25.00 -24.54 2.59 3.96 3.31 25384 Adjusted R Square -0.022024 B7 ref 0.99 42.28 -24.87 -24.41 2.37 4.33 3.68 25386 Standard Error 1.906378 B8 Lk Outlet Alg 3.04 31.43 -29.69 %C-29.23 6.85 1.07 0.95 35.560.30 25388 33.49 41.17 Observations43.74 11 4.51 1.59 4.37 33.58 B9 ALG06 3.09 35.57 -27.26 -26.80 1.96 2.79 2.14 25390 B10 ALG02 3.05 5.52 -22.31 delta 13C -21.85 -21.11 0.45 4.72 -28.054.07 25392 -29.56 -27.32 ANOVA -27.50 -22.68 -24.58 -21.06 -29.44 C1 ALG04 2.98 37.90 delta 13C_ca -27.42 -26.96 -20.65 1.36 1.21 -27.590.56 25394 -29.10 c -26.86 -27.04 df SS -22.22 MS F -24.12 Significance F -20.60 -28.98 C2 ALG05 3.04 31.74 -27.93 -27.47 2.40 0.73 0.08 25396 Regression 1 2.851116 2.851116 0.784507 0.398813 C3 ref 0.99 38.46 -25.09 -24.63 2.40 4.37 3.72 25398 Residual 9 32.7085 3.634278 23.78 %N 0.48 1.17 2.30 1.68 1.97 Total 1.3610 35.55962 0.34 0.15 0.34 1.74 delta 15N -0.97 0.59 0.79 2.71 0.99 4.31 -1.69 -1.52 0.62 Coefficients Standard Error t Stat P-value Lower 95%Upper 95%Lower 95.0% Upper 95.0% delta 15N_ca -1.62 -0.06 0.14 2.06 Intercept -4.297428 4.671099 3.66 0.34 -2.34 -2.17 -0.920003 0.381568 -14.8642 6.269341 -14.8642 6.269341 -0.03 X Variable 1-0.158022 0.17841 -0.885724 0.398813 -0.561612 0.245569 -0.561612 0.245569 4.00 3.00 2.00 1.00 Series1 0.00 -35.00 -30.00 -25.00 -20.00 -15.00 -10.00 -5.00 0.00 -1.00 -2.00 -3.00 From  Stephanie  Hampton   8  
  • 9. Who  cares?   From  Flickr  by  AJC1   From  Flickr  by  Redden-­‐McAllister  
  • 10. The  Fallout   Data   Reuse   Data   Sharing   Data   Management  
  • 11. •  Cost   Hurdles     •  Confusion  about   to  Data   standards   Stewardship   •  Disparate  datasets   •  Lack  of  training   •  Fear  of  lost  rights   From  Flickr  by  iowa_spirit_walker   or  benefits   •  No  incentives  
  • 12. The  Fallout   ? Data   Reuse   Data   Sharing   Data   Management  
  • 13. Intercept  researchers   where  they  already  work  
  • 14.
  • 15. Facilitate   Archiving   Data   management  &   Data  Reuse  &   Sharing   Reproducibility   organization   Publishing  
  • 16. $$  and  advice   $$  and  developers   Requirements  gathering   Project  management   Outreach  
  • 17. Requirements  gathering   Project  management   Outreach  
  • 18. What  do     scientists  need?  
  • 19. Asked  ~ 200  scientists   How  do  you  use  Excel?   What  is  your  workflow?   How  do  you  capture  metadata?   Plans  for  saving  &  sharing  data?  
  • 20. Scientist  Responses   How  often  are  they   using  Excel?   Rarely   What  are  they  using  Excel  for?   Moder-­‐ 100   ately   90   Every  day  or   80   almost   70   60   every  day   50   40   30   20   10   0   Organizing   Visualizing   Statistics   Sharing  data   data   data  
  • 21. Scientist  Responses   •  No  data  preservation   – Unaware  of  archives   – Resistant  to  sharing   •  Poor  data  documentation   •  90%  use  Excel  w/  other  programs  
  • 22. Requirements   Features   Best  practices  check   Generate  metadata  (EML)   Generate  identifier  +  citation   Post  data  to  repository  
  • 23. Open  Source   Tool   Add-­‐in  &  Web   Application   Earth,   environmental,   ? ecological   researchers  
  • 24. Add-­‐in     •  Software  you  download  &  install   •  Appears  as  “ribbon”  in  Excel   •  Works  for  Windows  Excel  2007+   Web-­‐based  application     •  Website  that  does  something   with  user’s  files   •  Any  platform   •  But…  new  user  interface  
  • 28. Web  App:  Best  Practices  Check  
  • 33. Web  App:  Posting  to  repository  
  • 34. Web  App:  Posting  to  repository  
  • 38. Requirements   Features   Best  practices  check   Generate  metadata  (EML)   ? Generate  identifier  +  citation   Post  data  to  repository  
  • 39. Data  Repository  for   Anyone  |  Anywhere  
  • 40. NSF  funded  DataNet  Project   Office  of  Cyberinfrastructure   www.dataone.org  
  • 41. B   A   C  
  • 42. B   A   C  
  • 43. B   A   C  
  • 44. B   A   C  
  • 45. B   A   C  
  • 46. B   A   C  
  • 47. B   D   A   C   E  
  • 48. B   D   A   C   E  
  • 49. B   D   A   C   E  
  • 53.
  • 54. Establish   Partnerships     From  animationresources.org   Engage  Developers     Build  Community  
  • 55. Website   dataup.cdlib.org   Twitter  feed   @DataUpCDL   Facebook   facebook.com/DataUpCDL   Code  site   bitbucket.org/dataup/main   My  website   carlystrasser.net   Email  me   carlystrasser@gmail.com   Tweet  me   @carlystrasser     My  slides   slideshare.net/carlystrasser   CDL  Blog   datapub.cdlib.org