SlideShare une entreprise Scribd logo
1  sur  24
Télécharger pour lire hors ligne
Accepted for publication in the Astrophysical Journal on 2012 Dec, 15 - Submitted on 2012 Oct, 16
                                               Preprint typeset using L TEX style emulateapj v. 5/2/11
                                                                      A

                                                                                                                     +


                                                            THE FALSE POSITIVE RATE OF KEPLER AND THE OCCURRENCE OF PLANETS
                                                 Francois Fressin1 , Guillermo Torres1 , David Charbonneau1 , Stephen T. Bryson2 , Jessie Christiansen2 ,
                                                     ¸
                                                     Courtney D. Dressing1 , Jon M. Jenkins2 , Lucianne M. Walkowicz3 and Natalie M. Batalha2
                                                                Accepted for publication in the Astrophysical Journal on 2012 Dec, 15 - Submitted on 2012 Oct, 16

                                                                                                     ABSTRACT
                                                        The Kepler Mission is uniquely suited to study the frequencies of extrasolar planets. This goal
                                                      requires knowledge of the incidence of false positives such as eclipsing binaries in the background of
arXiv:1301.0842v1 [astro-ph.EP] 4 Jan 2013




                                                      the targets, or physically bound to them, which can mimic the photometric signal of a transiting planet.
                                                      We perform numerical simulations of the Kepler targets and of physical companions or stars in the
                                                      background to predict the occurrence of astrophysical false positives detectable by the Mission. Using
                                                      real noise level estimates, we compute the number and characteristics of detectable eclipsing pairs
                                                      involving main sequence stars and non-main sequence stars or planets, and we quantify the fraction
                                                      of those that would pass the Kepler candidate vetting procedure. By comparing their distribution
                                                      with that of the Kepler Objects of Interest (KOIs) detected during the first six quarters of operation
                                                      of the spacecraft, we infer the false positive rate of Kepler and study its dependence on spectral type,
                                                      candidate planet size, and orbital period. We find that the global false positive rate of Kepler is 9.4%,
                                                      peaking for giant planets (6–22 R⊕ ) at 17.7%, reaching a low of 6.7% for small Neptunes (2–4 R⊕ ),
                                                      and increasing again for Earth-size planets (0.8–1.25 R⊕ ) to 12.3%.
                                                        Most importantly, we also quantify and characterize the distribution and rate of occurrence of
                                                      planets down to Earth size with no prior assumptions on their frequency, by subtracting from the
                                                      population of actual Kepler candidates our simulated population of astrophysical false positives. We
                                                      find that 16.5 ± 3.6% of main-sequence FGK stars have at least one planet between 0.8 and 1.25 R⊕
                                                      with orbital periods up to 85 days. This result is a significant step towards the determination of
                                                      eta-earth, the occurrence of Earth-like planets in the habitable zone of their parent stars. There is no
                                                      significant dependence of the rates of planet occurrence between 0.8 and 4 Earth radii with spectral
                                                      type. In the process, we derive also a prescription for the signal recovery rate of Kepler that enables
                                                      a good match to both the KOI size and orbital period distribution, as well as their signal-to-noise
                                                      distribution.

                                                                   1. INTRODUCTION                                    Many of the most interesting candidate transiting plan-
                                               In February 2011, the Kepler Mission produced a cat-                ets identified by the Kepler Mission cannot be confirmed
                                             alog of more than 1200 candidate transiting planets, re-              with current spectroscopic capabilities, that is, by the
                                             ferred to as Kepler Objects of Interest (KOIs). These                 detection of the reflex motion of the star due to the
                                             candidates were identified during the first four months                 gravitational pull from the planet. Included in this cat-
                                             of operation of the spacecraft (Borucki et al. 2011) in its           egory are essentially all Earth-size planets, as well as
                                             quest to determine the frequency of Earth-size planets                most super-Earths that are in the habitable zone of Sun-
                                             around Sun-like stars. This unprecedented sample of po-               like stars, which have Doppler signals too small to de-
                                             tential exoplanets has become an invaluable resource for              tect. Faced with this difficulty, the approach adopted
                                             all manner of statistical investigations of the properties            for such objects is statistical in nature and consists in
                                             and distributions of planets around main-sequence stars.              demonstrating that the likelihood of a planet is much
                                             The most recent Kepler release expanded the sample to                 greater than that of a false positive, a process referred
                                             more than 2300 candidates (Batalha et al. 2012), based                to as “validation”. The Kepler team has made extensive
                                             on 16 months of observation. The analysis of this infor-              use of a technique referred to as BLENDER (Torres et al.
                                             mation must contend, however, with the fact that not all              2004, 2011; Fressin et al. 2011) to validate a number of
                                             photometric signals are caused by planets. Indeed, false              KOIs, including the majority of the smallest known exo-
                                             positive contamination is typically the main concern in               planets discovered to date. This technique requires an
                                             transit surveys (see, e.g., Brown 2003), including Kepler,            accurate knowledge of the target star usually derived
                                             because there is a large array of astrophysical phenomena             from spectroscopy or asteroseismology, and makes use
                                             that can produce small periodic dimmings in the light of              of other follow-up observations for the candidate includ-
                                             a star that can be virtually indistinguishable from those             ing high spatial resolution imaging, Spitzer observations
                                             due to a true planetary transit. A common example is                  when available, and high-resolution spectroscopy. The
                                             a background eclipsing binary falling within the photo-               telescope facilities required to gather such observations
                                             metric aperture of a Kepler target.                                   are typically scarce, so it is generally not possible to have
                                                                                                                   these constraints for thousands of KOIs such as those in
                                                1 Harvard-Smithsonian Center for Astrophysics, Cambridge,          the recent list by Batalha et al. (2012). For this reason
                                               MA 02138, USA, ffressin@cfa.harvard.edu                              the Kepler team has concentrated the BLENDER efforts
                                                2 NASA Ames Research Center, Moffett Field, CA 94035, USA           only on the most interesting and challenging cases.
                                                3 Princeton University, Princeton, NJ 08544, USA
                                                                                                                      Based on the previous list of Kepler candi-
2                                                       Fressin et al.

  dates by Borucki et al. (2011) available to them,                 troid of the target in and out of transit based the Kepler
  Morton & Johnson (2011) (hereafter MJ11) investigated             images themselves, which can exclude all chance align-
  the false positive rate for KOIs, and its dependence on           ments with background eclipsing binaries beyond a cer-
  some of the candidate-specific properties such as bright-          tain angular separation. They adopted for this angular
  ness and transit depth. They concluded that the false             separation a standard value of 2′′ based on the single
  positive rate is smaller than 10% for most of the KOIs,           example of Kepler-10 b (Batalha et al. 2011), and then
  and less than 5% for more than half of them. This conclu-         scaled this result for other KOIs assuming certain in-
  sion differs significantly from the experience of all previ-        tuitive dependencies with target brightness and transit
  ous transit surveys, where the rates were typically up to         depth. As it turns out, those dependencies are not borne
  an order of magnitude larger (e.g., ∼80% for the HAT-             out by actual multi-quarter centroid analyses performed
  Net survey; Latham et al. 2009). The realization that             since, and reported by Batalha et al. (2012);
  the Kepler rate is much lower than for the ground-based
  surveys allowed the community to proceed with statisti-         4. MJ11 did not consider the question of detectability of
  cal studies based on lists of mere candidates, without too         both planets and false positives, and its dependence on
  much concern that false positive contamination might               the noise level for each Kepler target star as well as on
  bias the results (e.g., Howard et al. 2012).                       the period and duration of the transit signals;
    In their analysis MJ11 made a number of simplifying
  assumptions that allowed them to provide these first es-         5. The overall frequency and distribution of eclipsing bina-
  timates of the false positive rate for Kepler, but that            ries in the Kepler field, which enters the calculation of
  are possibly not quite realistic enough for the most in-           the frequency of background blends, has been measured
  teresting smaller signals with lower signal-to-noise ratios        directly by the Kepler Mission itself (Prˇa et al. 2011;
                                                                                                                s
  (SNRs). A first motivation for the present work is thus to          Slawson et al. 2011), and is likely more accurate at pre-
  improve upon those assumptions and at the same time to             dicting the occurrence of eclipsing binaries in the solar
  approach the problem of false positive rate determination          neighborhood than inferences from the survey of non-
  in a global way, by numerically simulating the population          eclipsing binary stars by Raghavan et al. (2010), used by
  of blends in greater detail for the entire sample of Kepler        MJ11.
  targets. A second motivation of this paper is to use our
  improved estimates of the false positive rate to extract             The above details have the potential to bias the esti-
  the true frequencies of planets of different sizes (down           mates of the false positive rate by factors of several, espe-
  to Earth-size) as well as their distributions in terms of         cially for the smaller candidates with low signal-to-noise
  host star spectral types and orbital characteristics, a goal      ratios. Indeed, recent observational evidence suggests
  to which the Kepler Mission is especially suited to con-          the false positive rate may be considerably higher than
  tribute.                                                          claimed by MJ11. In one example, Demory & Seager
    Regarding our first objective —the determination of              (2011) reported that a significant fraction (14%) of the
  the false positive rate of Kepler — the assumptions by            115 hot Jupiter candidates they examined, with radii
  MJ11 we see as potentially having the greatest impact             in the 8–22 R⊕ range, show secondary eclipses inconsis-
  on their results are the following:                               tent with the planetary interpretation. The false posi-
                                                                    tive rate implied by the MJ11 results for the same ra-
1. The MJ11 false positive rate is based on an assumed              dius range is only 4%. Santerne et al. (2012) recently
   20% planet occurrence, with a power law distribution             conducted an extensive spectroscopic follow-up campaign
   of planet sizes between 0.5 and 20 R⊕ (peaking at small          using the HARPS and SOPHIE spectrographs, and ob-
   planets) independently of the orbital period or stellar          served a number of hot Jupiters with periods under 25
   host characteristics. This is a rather critical hypothesis,      days, transit depths greater than 0.4%, and host star
   as the frequency of the smallest or longest-period objects       magnitudes Kp < 14.7 in the Kepler bandpass. They
   in the KOI sample is essentially unknown. The adoption           reported finding a false positive rate of 34.8 ± 6.5%. In
   of a given planet frequency to infer the false positive rate     contrast, the MJ11 results imply an average value of 2.7%
   in poorly understood regions of parameter space is risky,        for the same period, depth, and magnitude ranges, which
   and may constitute circular reasoning;                           is a full order of magnitude smaller.
2. The scenarios MJ11 included as possible blends feature              We point out that Morton (2012) recently published
   both background eclipsing binaries and eclipsing bina-           an automated validation procedure for exoplanet tran-
   ries physically associated with the target. However,             sit candidates based on a similar approach as the MJ11
   other configurations that can also mimic transit signals          work, with the improvement that it now considers back-
   were not considered, such as those involving larger plan-        ground stars transited by planets as an additional source
   ets transiting an unseen physical companion or a back-           of potential blends (item #2 above). However, the
   ground star. These configurations are more difficult                Morton (2012) work does not quantify the global false
   to rule out with follow-up observations, and BLENDER             positive rate of Kepler, but focuses instead on false
   studies have shown that such scenarios are often the             alarm probabilities for individual transit candidates. The
   most common blend configuration for candidates that               methodology makes use of candidate-specific information
   have been carefully vetted (see, e.g., Batalha et al. 2011;      such as the transit shape that was not explicitly consid-
   Cochran et al. 2011; Ballard et al. 2011; Fressin et al.         ered in the actual vetting procedure used by the Kepler
   2012; Gautier et al. 2012; Borucki et al. 2012);                 team to generate the KOI list. Here we have chosen to
                                                                    emulate the Kepler procedures to the extent possible so
3. A key ingredient in the MJ11 study is the strong con-            that we may make a consistent use of the KOI list as
   straint offered by the analysis of the motion of the cen-         published.
Kepler FPR and Occurrence of Planets                                                  3

   Regarding the second goal of our work —to determine          of the target, rather than being systems comprised of true
the rate of occurrence of planets of different sizes— a          Earth-size planets.
main concern that such statistical studies must neces-             Our two objectives, the determination of the false pos-
sarily deal with is the issue of detectability. Only a          itive rate for Kepler and the determination of the occur-
fraction of the planets with the smallest sizes and the         rence rate of planets in different size ranges, are in fact
longest periods have passed the detection threshold of          interdependent, as described below. We have organized
Kepler. The studies of Catanzarite & Shao (2011) and            the paper as follows. In Sect. 2 we summarize the gen-
Traub (2012), based on the KOI catalog of Borucki et al.        eral approach we follow. The details of how we simulate
(2011), assumed the sample of Neptune-size and Jupiter-         false positives and quantify their frequency are given in
size planets to be both complete (i.e., that all transiting     Sect. 3, separately for each type of blend scenario includ-
planets have been detected by Kepler ) and essentially          ing background or physically associated stars eclipsed by
free of contamination (i.e., with a negligible false posi-      another star or by a planet. At the end of this section we
tive rate). They then extrapolated to infer the occur-          provide an example of the calculation for one of those sce-
rence rate of Earth-size planets assuming it follows the        narios. Sect. 4 is a summary of the false positive rates for
occurrence vs. size trend of larger planets. The more           planets of different sizes in the Kepler sample as a whole.
detailed study of Howard et al. (2012) focused on the           In Sect. 5 we describe the study of the Kepler detection
sub-sample of KOIs for which Kepler is closer to com-           rate, and how we estimate the frequencies of planets. We
pleteness, considering only planets larger than 2 R⊕ and        derive the occurrence of planets in different size ranges in
with periods shorter than 50 days. They tackled the issue       Sect. 6, where we examine also the dependence of the fre-
of detectability by making use of the Combined Differ-           quencies on the spectral type (mass) of the host star and
ential Photometric Precision (CDPP) estimates for each          other properties. In Sect. 7, we study the distribution of
KOI to determine the completeness of the sample. The            the transit durations of our simulated planet population.
CDPP is designed to be an estimator of the noise level of       We discuss the assumptions and possible improvements
Kepler light curves on the time scale of planetary tran-        of our study in Sect. 8, and conclude by listing our main
sits, and is available for each star Kepler has observed.       results in Sect. 9. Finally, an Appendix describes how we
Howard et al. (2012) found a rapid increase in planet oc-       model the detection process of Kepler and how we infer
currence with decreasing planet size that agrees with the       the exclusion limit from the centroid motion analysis for
predictions of the core-accretion formation scenario, but       each individual target.
disagrees with population synthesis models that predict                           2. GENERAL APPROACH
a dearth of objects with super-Earth and Neptune sizes
for close-in orbits. They also reported that the occur-                       2.1. False positive simulation
rence of planets between 2 and 4 R⊕ in the Kepler field             The first part of our analysis is the calculation of
increases linearly with decreasing effective temperature         the false positive rate for Kepler targets. We consider
of the host star. Their results rely to some degree on          here all Kepler targets that have been observed in at
the conclusions of MJ11 regarding the rate of false pos-        least one out of the first six quarters of operation of
itives of the sub-sample of KOIs they studied, and it is        the spacecraft (Q1–Q6)4 , to be consistent with the se-
unclear how the issues enumerated above might affect             lection imposed on the Batalha et al. (2012) list, which
them. In an independent study Youdin (2011) devel-              comes to a total of 156,453 stars. For each of these
oped a method to infer the underlying planetary distri-         stars, identified by a unique Kepler Input Catalog (KIC)
bution from that of the Kepler candidates published by          number (Brown et al. 2011), we have available their es-
Borucki et al. (2011). He investigated the occurrence of        timated mass, surface gravity log g, and brightness in
planets down to 0.5 R⊕ , but without considering false          the Kepler bandpass (Kp magnitude) as listed in the
positives, and relied on the simplified Kepler detection         KIC5 . Additionally we have their CDPP values on dif-
efficiency model from Howard et al. (2012). He reported           ferent timescales and for different quarters, provided in
a significant difference in the size distributions of shorter     the Mikulski Archive for Space Telescopes (MAST) at
(P < 7 days) and longer period planets.                         STScI.6 The CDPP allows us to estimate the detectabil-
   Aside from the question of detectability, we emphasize       ity of each type of false positive scenario.
that the occurrence rate of Earth-size planets is still effec-      We perform Monte Carlo simulations specific to each
tively unknown. Kepler is the only survey that has pro-         Kepler target to compute the number of blends of differ-
duced a list of Earth-size planet candidates, and because       ent kinds that we expect. The calculations are based on
of the issues raised above, strictly speaking we cannot         realistic assumptions about the properties of objects act-
yet rule out that the false positive rate is much higher for    ing as blends, informed estimates about the frequencies
such challenging objects than it is for larger ones, even       of such objects either in the background or physically
as high as 90%. Indeed, MJ11 predicted false positive           associated with the target, the detectability of the sig-
rates with the assumption of an overall 20% planet fre-         nals produced by these blends for the particular star in
quency, peaking precisely for the smallest planets. The         question based on its CDPP, and also on the ability to re-
question of the exact rate of occurrence of small planets       ject blends based on constraints provided by the centroid
has implications for other conclusions from Kepler. For
                                                                  4 A quarter corresponds to a period of observation of about
example, Lissauer et al. (2012) have recently shown that
most KOIs with multiple candidates (‘multis’) are bona-         three months between 90◦ spacecraft rolls designed to keep the
                                                                solar panels properly illuminated.
fide planets. However, if Earth-size planets are in fact           5 A number of biases are known to exist in the properties listed
rare, then most multis involving Earth-size candidates          in the KIC, the implications of which will be discussed later in
would likely correspond to larger objects transiting (to-       Section 8.
                                                                  6 http://stdatu.stsci.edu/kepler/
gether) the same unseen star in the photometric aperture
4                                                        Fressin et al.

motion analysis or the presence of significant secondary          than 2 R⊕ , in which they claimed that the smallest plan-
eclipses. Blended stars beyond a certain angular distance        ets (small Neptunes) are more common among later-type
from the target can be detected in the standard vetting          stars, but did not find the same trend for larger plan-
procedures carried out by the Kepler team because they           ets. Our detailed modeling of the planet detectability
cause changes in the flux-weighted centroid position, de-         for each KOI, along with our new estimates of the false
pending on the properties of the signal. This constraint,        positive rates among small planets, enables us to extend
and how we estimate it for each star, is described more          the study to two smaller classes of planets, and to inves-
fully in the Appendix. The inclusion of detectability and        tigate whether the claims by Howard et al. (2012) hold
the constraint from centroid motion analysis are included        for objects as small as the Earth.
in order to emulate the vetting process of Kepler to a
reasonable degree.                                                                 3. FALSE POSITIVES
                                                                   There exists a wide diversity of astrophysical phenom-
            2.2. Planet occurrence simulation                    ena that can lead to periodic dimming in the light curve
  The second part of our analysis seeks to determine the         of a Kepler target star and might be interpreted as a sig-
rate of occurrence of planets of different sizes. For this        nal from a transiting planet. These phenomena involve
we compare the frequencies and distributions of simu-            other stars that fall within the photometric aperture of
lated false positives from the previous step with those of       the target and contribute light. One possibility is a back-
the real sample of KOIs given by Batalha et al. (2012).          ground star (either main-sequence or giant) eclipsed by
We interpret the differences between these two distribu-          a smaller object; another is a main-sequence star physi-
tions as due to true detectable planets. In order to obtain      cally associated with the target and eclipsed by a smaller
the actual rate of occurrence of planets it is necessary to      object. In both cases the eclipsing body may be either a
correct for the fact that the KOI list includes only planets     smaller star or a planet. There are thus four main cat-
with orbital orientations such that they transit, and for        egories of false positives, which we consider separately
the fact that some fraction of transiting planets are not        below. We discuss also the possibility that the eclipsing
detectable by Kepler. We determine these corrections             objects may be brown dwarfs or white dwarfs.
by means of Monte-Carlo simulations of planets orbit-              We make here the initial assumption that all KOIs that
ing the Kepler targets, assuming initial distributions of        have passed the nominal detection threshold of Kepler
planets as a function of planet size and orbital period          are due to astrophysical causes, as opposed to instru-
(Howard et al. 2012). We then compare our simulated              mental causes such as statistical noise or aliases from
population of detectable transiting planets with that of         transient events in the Kepler light curve. The expected
KOIs minus our simulated false positives population, and         detection rate (Dr ) based on a matched filter method
we adjust our initial assumptions for the planetary distri-      used in the Kepler pipeline is discussed by Jenkins et al.
butions. We proceed iteratively to obtain the occurrence         (1996). It is given by the following formula, as a function
of planets that provides the best match to the KOI list.         of the signal-to-noise ratio (SNR) of the signal and a cer-
This analysis permits us to also study the dependence            tain threshold, assuming that the (possibly non-white)
of the occurrence rates on properties such as the stellar        observational noise is Gaussian:
mass.                                                                                                           √
  For the purpose of interpreting both the false posi-                  Dr (SNR) = 0.5 + 0.5 erf (SNR − 7.1)/ 2 . (1)
tive rates and the frequencies of planets, we have chosen
to separate planets into five different size (radius Rp )          In this expression erf is the standard error function, and
ranges:                                                          the adopted 7.1σ threshold was chosen so that no more
                                                                 than one false positive will occur over the course of the
    • Giant planets: 6 R⊕ < Rp ≤ 22 R⊕                           Mission due to random fluctuations (see Jenkins et al.
                                                                 2010). The formula corresponds to the cumulative prob-
    • Large Neptunes: 4 R⊕ < Rp ≤ 6 R⊕                           ability density function of a zero mean, unit variance
                                                                 Gaussian variable, evaluated at a point representing the
    • Small Neptunes: 2 R⊕ < Rp ≤ 4 R⊕                           distance between the threshold and the SNR. The con-
    • Super-Earths: 1.25 R⊕ < Rp ≤ 2 R⊕                          struction of the matched filter used in the Kepler pipeline
                                                                 is designed to yield detection statistics drawn from this
    • Earths: 0.8 R⊕ < Rp ≤ 1.25 R⊕                              distribution. With this in mind, only 50% of transits
                                                                 with an SNR of 7.1 will be detected. The detection rates
  An astrophysical false positive that is not ruled out          are 2.3%, 15.9%, 84.1%, 97.7%, and 99.9% for SNRs of
by centroid motion analysis and that would produce a             5.1, 6.1, 8.1 9.1, and 10.1. Below in Sect. 5.1 we will im-
signal detectable by Kepler with a transit depth corre-          prove upon this initial detection model, but we proceed
sponding to a planet in any of the above categories is           for now with this prescription for clarity.
counted as a viable blend, able to mimic a true planet              Our simulation of the vetting process carried out by
in that size range. These five classes were selected as a         the Kepler team includes two tests that the astrophysi-
compromise between the number of KOIs in each group              cal scenarios listed above must pass in order to be con-
and the nomenclature proposed by the Kepler team                 sidered viable false positives (that could be considered a
(Borucki et al. 2011; Batalha et al. 2012). In particular,       planet candidate): the most important is that they must
we have subdivided the 2–6 R⊕ category used by the Ke-           not produce a detectable shift in the flux centroid. Ad-
pler team into “small Neptunes” and “large Neptunes”.            ditionally, for blends involving eclipsing binaries, they
Howard et al. (2012) chose the same separation in their          must not lead to secondary eclipses that are of similar
recent statistical study of the frequencies of planets larger    depth as the primary eclipses (within 3σ).
Kepler FPR and Occurrence of Planets                                                    5

            3.1. Background eclipsing binaries                       from the Slawson et al. (2011) catalog. We then check
   We begin by simulating the most obvious type of false             whether the transit-like signal produced by this compan-
positive involving a background (or foreground) eclips-              ion has the proper depth to mimic a planet with a radius
ing binary in the photometric aperture of the target star,           in the size category under consideration (Sect. 2.2), after
whose eclipses are attenuated by the light of the target             dilution by the light of the Kepler target (which depends
and reduced to a shallow transit-like event. We model                on the known brightness of the target and of the back-
the stellar background of each of the 156,453 KIC targets            ground star). If it does not, we reject it as a possible
using the Besan¸on model of the Galaxy (Robin et al.
                  c                                                  blend. Next we compute the SNR of the signal as de-
                                                                     scribed in the Appendix, and we determine whether this
2003).7 We simulate 1 square degree areas at the center
                                                                     false-positive would be detectable by the Kepler pipeline
of each of the 21 modules comprising the focal plane of
                                                                     as follows. For each KIC target observed by Kepler be-
Kepler (each module containing two CCDs), and record
                                                                     tween Q1 and Q6 we compute the SNR as explained in
all stars (of any luminosity class) down to 27th magni-              the Appendix, and with Eq.[1] we derive Dr . We then
tude in the R band, which is sufficiently close to Kepler ’s           draw a random number between 0 and 1 and compare it
Kp band for our purposes. This limit of R = 27 corre-                with Dr ; if larger, we consider the false positive signal to
sponds to the faintest eclipsing binary that can mimic a             be detectable. We also estimate the fraction of the Nbs
transit by an Earth-size planet, after dilution by the tar-          stars that would be missed in the vetting carried out by
get. We assign to each Kepler target a background star               the Kepler team because they do not induce a measur-
drawn randomly from the simulated area for the module                able centroid motion. This contribution, Fcent , is simply
it is located on, and we assign also a coefficient Nbs to              the fraction of background stars interior to the exclusion
represent the average number of background stars down                limit set by the centroid analysis. As we describe in more
to 27th magnitude in the 1 square degree region around               detail in the Appendix, the size of this exclusion region
the target.                                                          is itself mainly a function of the SNR.
   Only a fraction of these background stars will be                    Based on all of the above, we compute the number Nbeb
eclipsing binaries, and we take this fraction from re-               of background (or foreground) eclipsing binaries in the
sults of the Kepler Mission itself, reported in the work of          Kepler field that could mimic the signal of a transiting
Slawson et al. (2011). The catalog compiled by these au-             planet in a given size range as
thors provides not only the overall rate of occurrence of
eclipsing binaries (1.4%, defined as the number of eclips-                       Ntarg
ing binaries found by Kepler divided by the number of                  Nbeb =           Nbs Feb Cbf Cgeom Tdepth Tdet Tsec Fcent ,
Kepler targets), but also their eclipse depth and period                        i=1
distributions. In practice we adopt a somewhat reduced
frequency of eclipsing binaries Feb = 0.79% that excludes            where F terms represent fractions, C terms represent
contact systems, as these generally cannot mimic a true              corrections to the eclipsing binary frequency, and T terms
planetary transit because the flux varies almost contin-              are either 0 or 1:
uously throughout the orbital cycle. We consider only                   • Ntarg is the number of Kepler targets observed for
detached, semi-detached, and unclassified eclipsing bina-                  at least one quarter during the Q1–Q6 observing
ries from the Slawson et al. (2011) catalog. Furthermore,                 interval considered here;
we apply two additional corrections to this overall fre-
quency that depend on the properties of the background                  • Nbs is the average number of background stars
star. The first factor, Cbf , adjusts the eclipsing binary                 down to magnitude R = 27 in a 1 square degree
frequency according to spectral type, as earlier-type stars               area around the target star;
have been found to have a significantly higher binary
frequency than later-type stars. We interpolate this cor-               • Feb is the frequency of eclipsing binaries, defined as
rection from Figure 12 of Raghavan et al. (2010). The                     the fraction of eclipsing binaries (excluding contact
second factor, Cgeom , accounts for the increased chance                  systems) found by Kepler divided by the number
that a larger star in the background would be eclipsed                    of Kepler targets, and is 0.79% for this work;
by a companion of a given period, from geometrical con-                 • Cbf is a correction to the binary occurrence rate
siderations. This correction factor is unity for a star of                Feb that accounts for the dependence on stellar
1 R⊙ , and increases linearly with radius. We note that                   mass (or spectral type), following Raghavan et al.
the scaling law we have chosen does not significantly im-                  (2010);
pact the overall occurrence of eclipsing binaries estab-
lished by Slawson et al. (2011), as the median radius of                • Cgeom is a correction to the geometric probability
a Kepler target (0.988 R⊙ ) is very close to 1 R⊙ . The size              of eclipse that takes into account the dependence
of each of our simulated background stars is provided by                  on the size of the background star;
the Besan¸on model.
           c
   Next, we assign a companion star to each of the 156,453              • Tdepth is 1 if the signal produced by the background
background stars drawn from the Besan¸on model, with
                                           c                              eclipsing binary has a depth corresponding to a
properties (eclipse depth, orbital period) taken randomly                 planet in the size range under consideration, af-
                                                                          ter accounting for dilution by the Kepler target, or
  7 The accuracy of the stellar densities (number of stars per            0 otherwise;
square degree) per magnitude bin provided by this model has
been tested against actual star counts at the center of the Kepler      • Tdet is 1 if the signal produced by the back-
field. The results have been found to be consistent within 15% (R.         ground eclipsing binary is detectable by the Kepler
Gilliland, priv. comm.).                                                  pipeline, or 0 otherwise;
6                                                       Fressin et al.

    • Tsec is 1 if the background eclipsing binary does not     as:
      show a secondary eclipse detectable by the Kepler                        Ntarg
      pipeline, or 0 otherwise. We note that we have also
      set Tsec to 1 if the simulated secondary eclipses have          Nbtp =           Nbs Ftp Ctpf Cgeom Tdepth Tdet Fcent ,
      a depth within 3σ of that of the primary eclipse, as                     i=1

      this could potentially be accepted by the pipeline        where Ntarg , Nbs , Cgeom , Tdepth, Tdet , and Fcent have
      as a transiting object, with a period that is half of     similar meanings as before, and
      the binary period;
                                                                      • Ftp is the frequency of transiting planets in the size
                                                                        class under consideration, computed as the frac-
    • Fcent is the fraction of the Nbs background stars                 tion of suitable KOIs found by Kepler divided by
      that would show no significant centroid shifts, i.e.,              the number of non-giant Kepler targets (defined
      that are interior to the angular exclusion region                 as those with KIC values of log g > 3.6, following
      that is estimated for each star, as explained in the              Brown et al. 2011)8 ;
      Appendix.
                                                                      • Ctpf is a correction factor to the transiting planet
                                                                        frequency Ftp that accounts for the three biases
       3.2. Background stars transited by planets                       mentioned above: incompleteness, the false posi-
  A second type of astrophysical scenario that can re-                  tive rate among the KOIs of Batalha et al. (2012),
produce the shape of a small transiting planet consists                 and the potential dependence of Ftp on the spectral
of a larger planet transiting a star in the background                  type of the background star.
(or foreground) of the Kepler target. We proceed in
a similar way as for background eclipsing binaries, ex-                     3.3. Companion eclipsing binaries
cept that instead of assigning a stellar companion to each        An eclipsing binary gravitationally bound to the tar-
background star, we assign a random planetary compan-           get star (forming a hierarchical triple system) may also
ion drawn from the actual list of KOIs by Batalha et al.        mimic a transiting planet signal since its eclipses can
(2012). However, there are three factors that prevent           be greatly attenuated by the light of the target. The
us from using this list as an unbiased sample of transit-       treatment of this case is somewhat different from the
ing planets: (1) the list is incomplete, as the shallowest      one of background binaries, though, as the occurrence
and longest period signals are only detectable among the        rate of intruding stars eclipsed by others is independent
Kepler targets in the most favorable cases; (2) the list        of the Galactic stellar population. To simulate these sce-
contains an undetermined number of false positives; and         narios we require knowledge of the frequency of triple
(3) the occurrence of planets of different sizes may be          systems, which we adopt from the results of a volume-
correlated with the spectral type of the host star (see         limited survey of the multiplicity of solar-type stars by
Howard et al. 2012).                                            Raghavan et al. (2010). They found that 8% of their
  While it is fairly straightforward to quantify the in-        sample stars are triples, and another 3% have higher
completeness by modeling the detectability of signals for       multiplicity. We therefore adopt a total frequency of
the individual KOIs (see the Appendix for a description         8% + 3% = 11%, since additional components beyond
of the detection model we use), the biases (2) and (3)          three merely produce extra dilution, which is small in
are more difficult to estimate. We adopt a bootstrap              any case compared to that of the Kepler target itself (as-
approach in which we first determine the false positive          sumed to be the brighter star). Of all possible triple con-
rate and the frequencies for the larger planets among           figurations we consider only those in which the tertiary
the KOIs, and we then proceed to study successively             star eclipses the close secondary, with the primary star
smaller planets. The only false positive sources for the        being farther removed (hierarchical structure). The two
larger planet candidates involve even larger (that is, stel-    other configurations involving a secondary star eclipsing
lar) objects, for which the frequency and distributions of      the primary or the tertiary eclipsing the primary would
the periods and eclipse depths are well known (i.e., from       typically not result in the target being promoted to a
the work of Slawson et al. 2011). Comparing the pop-            KOI, as the eclipses would be either very deep or ‘V’-
ulation of large-planet KOIs (of the ‘giant planet’ class       shaped. Thus we adopt 1/3 of 11% as the relevant fre-
previously defined) to the one of simulated false posi-          quency of triple and higher-multiplicity systems.
tives that can mimic their signals enables us to obtain           We proceed to simulate this type of false positive by
the false positive correction factor (2). This, in turn,        assigning to each Kepler target a companion star (“sec-
allows us to investigate whether the frequency of large         ondary”) with a random mass ratio q (relative to the tar-
planets is correlated with spectral type (see Sect. 6), and     get), eccentricity e, and orbital period P drawn from the
therefore to address bias (3) above. As larger planets          distributions presented by Raghavan et al. (2010), which
transiting background stars are potential false positive        are uniform in q and e and log-normal in P . We further
sources for smaller planets, once we have understood the        infer the radius and brightness of the secondary in the
giant planet population we may proceed to estimate the          Kp band from a representative 3 Gyr solar-metallicity
false positive rates and occurrence of each of the smaller      isochrone from the Padova series of Girardi et al. (2000).
planet classes in order of decreasing size, applying correc-    We then assign to each of these secondaries an eclips-
tions for the biases in the same way as described above.        ing companion, with properties (period, eclipse depth)
  We express the number Nbtp of background stars in the
                                                                  8 We do not consider background giant stars transited by a
Kepler field with larger transiting planets able to mimic
the signal of a true transiting planet in a given size class    planet as a viable blend, as the signal would likely be undetectable.
Kepler FPR and Occurrence of Planets                                                    7

taken from the catalog of Kepler eclipsing binaries by                    binaries slightly diluted by the light of a compan-
Slawson et al. (2011), as we did for background eclipsing                 ion, and we assume here that those configurations
binaries in Sect. 3.1. We additionally assign a mass ratio                would not have passed the Kepler vetting proce-
and eccentricity to the tertiary from the above distribu-                 dure.
tions.
  To determine whether each of these simulated triple                  • Tstab is 1 if the triple system is dynamically stable,
configurations constitutes a viable blend, we perform the                 and 0 otherwise;
following four tests: (1) We check whether the triple sys-             • Tcent is 1 if the triple system does not induce a
tem would be dynamically stable, to first order, using                    significant centroid motion, and 0 otherwise.
the condition for stability given by Holman & Wiegert
(1999); (2) we check whether the resulting depth of the                         3.4. Companion transiting planets
eclipse, after accounting for the light of the target, cor-
responds to the depth of a transiting planet in the size              Planets transiting a physical stellar companion to a
range under consideration; (3) we determine whether the             Kepler target can also produce a signal that will mimic
transit-like signal would be detectable, with the same              that of a smaller planet around the target itself. One
signal-to-noise criterion used earlier; and (4) we check            possible point of view regarding these scenarios is to not
whether the eclipsing binary would be angularly sepa-               consider them a false positive, as a planet is still present
rated enough from the target to induce a centroid shift.            in the system, only not of the size anticipated from the
For this last test we assign a random orbital phase to the          depth of the transit signal. However, since one of the
secondary in its orbit around the target, along with a              goals of the present work is to quantify the rate of occur-
random inclination angle from an isotropic distribution,            rence of planets in specific size categories, a configuration
a random longitude of periastron from a uniform distri-             of this kind would lead to the incorrect classification of
bution, and a random eccentricity drawn from the dis-               the object as belonging to a smaller planet class, biasing
tribution reported by Raghavan et al. (2010). The semi-             the rates of occurrence. For this reason we consider these
major axis is a function of the known masses and the                scenarios as a legitimate false positive.
orbital period. The final ingredient is the distance to                To quantify them we proceed in a similar way as for
the system, which we compute using the apparent mag-                companion eclipsing binaries in the preceding section, as-
nitude of the Kepler target as listed in the KIC and the            signing a bound stellar companion to each target in ac-
absolute magnitudes of the three stars from the Padova              cordance with the frequency and known distributions of
isochrone, ignoring extinction. If the resulting angular            binary properties from Raghavan et al. (2010), and as-
separation is smaller than the radius of the exclusion re-          signing a transiting planet to this bound companion us-
gion from centroid motion analysis, we count this as a              ing the KOI list by Batalha et al. (2012). Corrections to
viable blend.                                                       the rates of occurrence Ftp are as described in Sect. 3.2.
  The number Nceb of companion eclipsing binary con-                The total number of false positives of this kind among
figurations in the Kepler field that could mimic the signal           the Kepler targets is then
of a transiting planet in each size category is given by:                     Ntarg

          Ntarg                                                      Nctp =           Fbin Ftp Cgeom Ctpf Tdepth Tdet Tcent Tstab ,
 Nceb =           Feb/triple Cgeom Tstab Tdepth Tdet Tsec Tcent ,             i=1

          i=1                                                       where Fbin is the frequency of non-single stars in the
in which Ntarg , Cgeom , Tdepth, Tsec and Tdet represent the        solar neighborhood (44%; Raghavan et al. 2010) and the
same quantities as in previous sections, while                      remaining symbols have the same meaning as before.
                                                                      Adding up the contributions from this section and
   • Feb/triple is the frequency of eclipsing binaries in           the preceding three, the total number of false positives
     triple systems (or systems of higher multiplicity)             in the Kepler field due to physically-bound or back-
     in which the secondary star is eclipsed by the                 ground/foreground stars eclipsed by a smaller star or by
     tertiary. As indicated earlier, we adopt a fre-                a planet is Nbeb + Nbtp + Nceb + Nctp .
     quency of eclipsing binaries in the solar neighbor-
     hood of Feb = 0.79%, from the Kepler catalog by                 3.5. Eclipsing pairs involving brown dwarfs and white
     Slawson et al. (2011). The frequency of stars in                                            dwarfs
     binaries and higher-multiplicity systems has been                Because of their small size, brown dwarfs transiting a
     given by Raghavan et al. (2010) as Fbin = 44%                  star can produce signals that are very similar to those
     (where we use the notation ‘bin’ to mean ‘non-                 of giant planets. Therefore, they constitute a potential
     single’). Therefore, the chance of a random pair of            source of false positives, not only when directly orbiting
     stars to be eclipsing is Feb /Fbin = 0.0079/0.44 =             the target but also when eclipsing a star blended with the
     0.018. As mentioned earlier we will assume here                target (physically associated or not). However, because
     that one third of the triple star configurations                of their larger mass and non-negligible luminosity, other
     (with a frequency Ftriple of 11%, according to                 evidence would normally betray the presence of a brown
     Raghavan et al. 2010) have the secondary and ter-              dwarf, such as ellipsoidal variations in the light curve
     tiary as the close pair of the hierarchical system.            (for short periods), secondary eclipses, or even measur-
     Feb/triple is then equal to Feb /Fbin × 1/3 × Ftriple =        able velocity variations induced on the target star. Pre-
     0.018×1/3×0.11 = 0.00066. The two other config-                 vious Doppler searches have shown that the population
     urations in which the secondary or tertiary is eclips-         of brown dwarf companions to solar-type stars is signif-
     ing the primary would merely correspond to target              icantly smaller than that of true Jupiter-mass planets.
8                                                         Fressin et al.

Based on this, we expect the incidence of brown dwarfs            to the median ∼1 R⊙ Kepler target. The corresponding
as false positives to be negligible, and we do not consider       correction factor is then Cgeom = 0.707.
them here.                                                           The correction Ctpf to the rate of occurrence of tran-
  A white dwarf transiting a star can easily mimic the            siting planets acting as blends is the most complicated to
signal of a true Earth-size planet, since their sizes are         estimate, as it relies in part on similar calculations car-
comparable. Their masses, on the other hand, are of               ried out sequentially from the larger categories of planets,
course very different. Some theoretical predictions sug-           starting with the giant planets, as described earlier. For
gest that binary stars consisting of a white dwarf and a          this example we make use of the results of those calcu-
main-sequence star may be as frequent as main-sequence            lations presented in Sect. 5. The Ctpf factor corrects the
binary stars (see, e.g., Farmer & Agol 2003). However,            occurrence rate of planets in the 1.25–2 R⊕ super-Earth
evidence so far from the Kepler Mission seems to indi-            category (since this is the relevant class for the particu-
cate that these predictions may be off by orders of mag-           lar blended planet we have simulated) for three distinct
nitude. For example, none of the Earth-size candidates            biases in the KOI list of Batalha et al. (2012), due to in-
that have been monitored spectroscopically to determine           completeness, false positives, and the possible correlation
their radial velocities have shown any indication that the        of frequency with spectral type.
companion is as massive as a white dwarf. Also, while                For our simulated planet with Rp = 1.81 R⊕ and
transiting white dwarfs with suitable periods would be            P = 13.7 days, the incompleteness contribution to Ctpf
expected to produce many easily detectable gravitational          was estimated by computing the SNR of the signals gen-
lensing events if their frequency were as high as predicted       erated by a planet of this size transiting each individual
by Farmer & Agol (2003), no such magnification events              Kepler target, using the CDPP of each target. We find
compatible with lensing have been detected so far in the          that the transit signal could only be detected around
Kepler photometry (J. Jenkins, priv. comm.). For these            54.1% of the targets, from which the incompleteness
reasons we conclude that the relative number of eclipsing         boost is simply 1/0.541 = 1.848. The contribution to
white dwarfs among the 196 Earth-size KOIs is negligi-            Ctpf from the false positive bias was taken from Table 1
ble in comparison to other contributing sources of false          of Sect. 4 below, which summarizes the results from the
positives.                                                        bootstrap analysis mentioned earlier. That analysis in-
                                                                  dicates that the false positive rate for super-Earths is
        3.6. Example of a false positive calculation              8.8%, i.e., 91.2% of the KOIs in the 1.25–2 R⊕ are tran-
   To illustrate the process of false positive estimation         sited by true planets. Finally, the third contributor to
for planets of Earth size (0.8 R⊕ < Rp ≤ 1.25 R⊕ ), we            Ctpf addresses the possibility that small Neptunes from
present here the calculation for a single case of a false pos-    the KOI list acting as blends may be more common,
itive involving a larger planet transiting a star physically      for example, around stars of later spectral types than
bound to the Kepler target (Sect. 3.4), which happens to          earlier spectral types. Our analysis in Sect. 6.4 in fact
produce a signal corresponding to an Earth-size planet.           suggests that super-Earths have a uniform occurrence
   In this case selected at random, the target star (KIC          as a function of spectral type, and we account for this
3453569) has a mass of 0.73 M⊙ , a brightness of Kp =             absence of correlation by setting a value of 1.0 for the
15.34 in Kepler band, and a 3-hour CDPP of 133.1                  spectral-type dependence correction factor. Combining
ppm. As indicated above, the star has a 44% a priori              the three factors just described, we arrive at a value of
chance of being a multiple system based on the work of            Ctpf = 1.848 × 0.912 × 1.0 = 1.685.
Raghavan et al. (2010). We assign to the companion a                 To ascertain whether this particular false positive could
mass and an orbital period drawn randomly from the                have been detected by Kepler, we compute its signal-to-
corresponding distributions reported by these authors,            noise ratio as explained in the Appendix, in terms of
with values of 0.54 M⊙ and 67.6 yr, corresponding to a            the size of the small Neptune relative to the physically-
semimajor axis of a = 18.0 AU.                                    bound companion, the dilution factor from the target,
   This companion has an a priori chance Ftp of having a          the CDPP of the target, the number of transits observed
transiting planet equal to the number of KOIs (orbiting           (from the duration interval for the particular Kepler tar-
non-giant stars) divided by the number of non-giant stars         get divided by the period of the KOI), and the transit
(Ftp = 2,302/132,756 = 0.017) observed by Kepler in at            duration as reported by Batalha et al. (2012). The re-
least one quarter during the Q1–Q6 interval. We assign            sult, SNR = 9.44 is above the threshold (determined
to this companion star a transiting planet drawn ran-             using the detection recovery rate studied in section 5.1),
domly from the Batalha et al. (2012) list of KOIs, with           so the signal would be detectable and we set Tdet = 1.
a radius of 1.81 R⊕ (a super-Earth) and an orbital pe-               With the above SNR we may also estimate the an-
riod of 13.7 days. The diluted depth of the signal of this        gular size of the exclusion region outside of which the
transiting planet results in a 140 ppm dimming of the             multi-quarter centroid motion analysis would rule out a
combined flux of the two stars during the transit, that            blend. The value we obtain for the present example with
                                                                                                                  ′′
could be incorrectly interpreted as an Earth-size planet          the prescription given in the Appendix is 1. 1. To es-
(0.91 R⊕ ) transiting the target star. We set Tdepth = 1,         tablish whether the companion star is inside or outside
as this example pertains to false positives mimicking             of this region, we compute its angular separation from
transits of Earth-size planets. Because of the smaller ra-        the target as follows. First we estimate the distance to
dius of the stellar companion (Rcomp = 0.707 R⊙ , deter-          the system from the apparent magnitude of the target
mined with the help of our representative 3-Gyr, solar-           (Kp = 15.34) and the absolute magnitudes of the two
metallicity isochrone), that star has a correspondingly           stars, read off from our representative isochrone accord-
smaller chance that the planet we have assigned to or-            ing to their masses of 0.73 and 0.54 M⊙ . Ignoring ex-
bit it will actually transit at the given period, compared        tinction, we obtain a distance of 730 pc. Next we place
Kepler FPR and Occurrence of Planets                                           9

the companion at a random position in its orbit around          29.3 ± 3.1%, in good agreement with their observational
the target, using the known semi-major axis of 18.0 AU          result. This value is significantly larger than our over-
and the random values of other relevant orbital elements        all figure of 17.7% for giant planets because of the cut
(eccentricity, inclination angle, longitude of periastron)      at P < 25 days and the fact that the false positive rate
as reported above. With this, the angular separation            increases somewhat toward shorter periods, according to
    ′′
is 0. 02. This is about a factor of 50 smaller than the         our simulations (see Sect. 6.1).
limit from centroid motion analysis, so we conclude this           For large Neptunes we find that the false positive rate
false positive would not be detected in this way (i.e., it      decreases somewhat to 15.9%. This is due mainly to
remains a viable blend). We therefore set Tcent = 1.            the lower incidence of blends from hierarchical triples,
   Finally, the formalism by Holman & Wiegert (1999)            which can only mimic the transit depth of planets orbit-
indicates that, to first order, a hierarchical triple system     ing the largest stars in the sample, and to the relatively
like the one in this example would be dynamically un-           higher frequency of planets of this size in comparison
stable if the companion star were within 1.06 AU of the         to giant planets. The false positive rate decreases fur-
target. As their actual mean separation is much larger          ther for small Neptunes and super-Earths, and rises again
(a = 18.0 AU), we consider the system to be stable and          for Earth-size planets. The overall false positive rate of
we set Tstab = 1.                                               Kepler we find by combining all categories of planets is
   Given the results above, this case is therefore a viable     9.4%.
false positive that could be interpreted as a transiting           All of these rates depend quite strongly on how well
Earth-size planet. The chance that this would happen            we have emulated the vetting process of Kepler. We
for the particular Kepler target we have selected is given      may assess this as follows. We begin by noting that
by the product Fbin × Ftp × Cgeom × Ctpf × Tdepth × Tdet ×      before the vetting process is applied, the majority of
Tcent ×Tstab = 44%×0.017×0.707×(1.848×0.912×1.0)×               false positives are background eclipsing binaries. To es-
1 × 1 × 1 × 1 = 0.0089. We performed similar simulations        timate their numbers, we tallied all such systems falling
for each of the other Kepler targets observed between Q1        within the photometric aperture of a Kepler target and
and Q6, and found that larger planets transiting unseen         contributing more than 50% of their flux (755 cases).
companions to the targets are the dominant type of blend        Of these, 465 pass our secondary eclipse test (i.e., they
in this size class, and should account for a total of 16.3      present secondary eclipses that are detectable, and that
false positives that might be confused with Earth-size          have a depth differing by more than 3σ from the pri-
planets. Doing the same for all other types of blends that      mary eclipse). After applying the centroid test we find
might mimic Earth-size planets leads to an estimate of          that only 44.7 survive as viable blends. In other words,
24.1 false positives among the Kepler targets.                  our simulated application of the centroid test rules out
                                                                465 − 44.7 ≈ 420 eclipsing binary blends. We may com-
    4. RESULTS FOR THE FALSE POSITIVE RATE OF                   pare this with results from the actual vetting of Kepler
                           KEPLER                               as reported by Batalha et al. (2012), who indicated that
   The output of the simulations described above is sum-        1093 targets from an initial list of 1390 passed the cen-
marized in Table 1, broken down by planet class and             troid test and were included as KOIs, and were added
also by the type of scenario producing the false positive.      to the list of previously vetted KOIs from Borucki et al.
We point out here again that these results are based on         (2011). The difference, 1390 − 1093 = 297, is of the same
a revision of the detection model of Kepler (Sect. 5.1)         order of magnitude as our simulated results (∼420), pro-
rather than the a priori detection rate described in Sec-       viding a sanity check on our background eclipsing binary
tion 3. An interesting general result is that the dominant      occurrence rates as well as our implementation of the
source of false positives for all planet classes involves not   vetting process. This exercise also shows that the cen-
eclipsing binaries, but instead large planets transiting an     troid test is by far the most effective for weeding out
unseen companion to the Kepler target. This type of             blends. Even ignoring the test for secondary eclipses,
scenario is the most difficult to rule out in the vetting         the centroid analysis is able to bring down the number
process performed by the Kepler team.                           of blends involving background eclipsing binaries to only
   For giant planets our simulations project a total of 39.5    83.3 out of the original 755 in our simulations, represent-
false positives among the Kepler targets, or 17.7% of           ing a reduction of almost 90%.
the 223 KOIs that were actually identified in this size             Due to the complex nature of the simulations it is non-
category. This is significantly higher than the estimate         trivial to assign uncertainties to the false positive rates
from MJ11, who predicted a less than 5% false positive          reported in Table 1, and the values listed reflect our best
rate for this kind of objects. The relatively high fre-         knowledge of the various sources that may contribute.
quency of false positives we obtain is explained by the         Many of the ingredients in our simulations rely on counts
inherently low occurrence of giant planets in comparison        based directly on Kepler observations, such as the KOI
to the other astrophysical configurations that can mimic         list and the Kepler eclipsing binary catalog. For those
their signal. Another estimate of the false positive rate       quantities it is reasonable to adopt a Poissonian distri-
for giant planets was made recently by Santerne et al.          bution for the statistical error (ǫstat ). We have also at-
(2012), from a subsample of KOIs they followed up spec-         tempted to include contributions from inputs that do not
troscopically. They reported that 34.8 ± 6.5% of close-in       rely directly on Kepler observations. One is the uncer-
giant planets with periods shorter than 25 days, tran-          tainty in the star counts that we have adopted from the
sit depths greater than 0.4%, and brightness Kp < 14.7          Besan¸on model of Robin et al. (2003). A comparison of
                                                                       c
show radial-velocity signals inconsistent with a planetary      the simulated star densities near the center of the Kepler
interpretation, and are thus false positives. Adopting the      field with actual star counts (R. Gilliland, priv. comm.)
same sample restrictions we obtain a false positive rate of     shows agreement within 15%. We may therefore use this
10                                                     Fressin et al.

as an estimate of the error for false positives involving      mine the occurrence of planets in each of our five planet
background stars (ǫback ). As an additional test we com-       classes and for each of 11 period bins, which comes to 55
pared the Besan¸on results with those from a different
                   c                                           free parameters. We use the rates of occurrence found
Galactic structure model. Using the Trilegal model of          by Howard et al. (2012) for our initial guess (prior), with
Girardi et al. (2005) we found that the stellar densities      extrapolated values for the planet sizes (below 2 R⊕ ) and
from the latter are approximately 10% smaller, while the       periods (longer than 50 days) that they did not consider
distribution of stars in terms of brightness and spectral      in their study. Each star is initially assigned a global
type is similar (background stars using Trilegal are only      chance of hosting a planet equal to the sum of these 55
0.47 mag brighter in the R band, and 100 K hotter, on          occurrence rates. Our baseline assumption is that planet
average). We adopt the larger difference of 15% as our          occurrence is independent of the spectral type of the host
ǫback uncertainty.                                             star, but we later investigate whether this hypothesis is
  We have also considered the additional uncertainty           consistent with the observations (i.e., with the actual dis-
coming from our modeling of the detection level of the         tribution of KOI spectral types; Sect. 6).
Kepler pipeline (ǫdetec ). While we initially adopted a           We have also assumed that the planet sizes are log-
nominal detection threshold corresponding to the ex-           arithmically distributed between the size boundaries of
pected detection rate following Jenkins et al. (2010), in      each planet class, and that their periods are distributed
practice we find that this is somewhat optimistic, and          logarithmically within each period bin. For each of
below we describe a revision of that condition that pro-       our simulated planets we compute the geometric tran-
vides better agreement with the actual performance of          sit probability (which depends on the stellar radius) as
Kepler as represented by the published list of KOIs by         well as the SNR of its combined transit signals (see the
Batalha et al. (2012). Experiments in which we repeated        Appendix for details on the computation of the SNR).
the simulations with several prescriptions for the detec-      We assigned to each planet a random inclination an-
tion limit yielded a typical difference in the results com-     gle, and discarded cases that are not transiting or that
pared to the model we finally adopted (see below) that          would not be detectable by Kepler. We assigned also a
may be used as an estimate of ǫdetec . We note that the        random eccentricity and longitude of periastron, with ec-
difference between the results obtained from all these de-      centricities drawn from a Rayleigh distribution, following
tection prescriptions and those that use the a priori de-      Moorhead et al. (2011). These authors found that such
tection rate is approximately three times ǫdetec . This        a distribution with a mean eccentricity in the range 0.1–
suggests that the a priori detection model may be used         0.25 provides a satisfactory representation of the distri-
to predict reasonable lower limits for the false positive      bution of transit durations for KOIs cooler than 5100 K.
rates, as well as the planet occurrence rates discussed        We chose to adopt an intermediate value for the mean
later.                                                         of the Rayleigh distribution of e = 0.175, and used it for
  Based on the above, we take the total error for our          stars of all spectral types. Allowing for eccentric orbits
false positive population involving background stars to        alters the geometric probability of a transit as well as
be ǫtot =      ǫ2 + ǫ2 + ǫ2 . We adopt a similar
                stat   back   detec
                                                               their duration (and thus their SNR). Finally, we com-
expression for false positives involving stars physically      pare our simulated population of detectable transiting
associated with the target, without the ǫback term.            planets with that of the KOIs minus our simulated false
                                                               positives population, and we correct our initial assump-
      5. COMPUTING THE PLANET OCCURRENCE                       tion for the distribution of planets as a function of size
   The KOI list of Batalha et al. (2012) is composed of        and period.
both true planets and false positives. The true planet            To estimate uncertainties for our simulated true planet
population may be obtained by subtracting our simu-            population we adopt a similar prescription as for the false
lated false positives from the KOIs. However, this dif-        positive rates, and compute ǫtot = ǫ2 + ǫ2 , where
                                                                                                       stat   detec
ference corresponds only to planets in the Kepler field         the two contributions have the same meaning as before.
that both transit their host star and that are detectable      While the two terms in ǫtot have roughly the same aver-
by Kepler. In order to model the actual distribution           age impact on the global uncertainty, the statistical error
of planets in each size class and as a function of their       tends to dominate when the number of KOIs in a specific
orbital period, we must correct for the geometric tran-        size and period bin is small, and the detection error is
sit probabilities and for incompleteness. Our approach,        more important for smaller planets and longer periods.
therefore, is to not only simulate false positives, as de-        Modeling the detection limits of Kepler is a central
scribed earlier, but to also simulate in detail the true       component of the process, as the incompleteness correc-
planet population in such a way that the sum of the two        tions can be fairly large in some regimes (i.e., for small
matches the published catalog of KOIs, after accounting        signals and/or long periods). Exactly how this is done
for the detectability of both planets and false positives.     affects both the false positive rates and the planet oc-
The planet occurrence rates we will derive correspond          currence rates. We have therefore gone to some effort to
strictly to the average number of planets per star.            investigate the accuracy of the nominal detection model
   For our planet simulations we proceed as follows. We        (Sect. 3) according to which 50% of signals are consid-
assign a random planet to each Kepler target that has          ered detected by the Kepler pipeline if their SNR exceeds
been observed between Q1 and Q6, taking the planet oc-         a threshold of 7.1, and 99.9% of signals are detected for
currences per period bin and size class to be adjustable       a SNR over 10.1. We describe this in the following.
variables. We have elected to use the same logarithmic
period bins as adopted by Howard et al. (2012), to ease
comparisons, with additional bins for longer periods than                       5.1. Detection model
they considered (up to ∼400 days). We seek to deter-             Burke et al. (2012) have reported that a significant
Kepler FPR and Occurrence of Planets                                                               11

fraction of the Kepler targets have not actually been
searched for transit signals down to the official SNR
threshold of 7.1. This is due to the fact that spurious
detections with SNR over 7.1 can mask real planet sig-                             A priori detection recovery rate (50 % for SNR = 7.1)
nals with lower SNR in the same light curve. Pont et al.
(2006) have shown that time-correlated noise features in                                               KOI SNR (Batalha et al. 2012)
                                                                                                       KOI computed SNR (based on CDPP)
photometric time-series can produce spurious detections
well over this threshold, and that this has significant im-                                             Simulated False Positives SNR
plications for the yield of transit surveys. We note also                                              Simulated FP + planets SNR
that the vetting procedure that led to the published KOI
list of Batalha et al. (2012) involves human intervention
at various stages, and is likely to have missed some low-
SNR candidates for a variety of reasons. Therefore, the
assumption that most signals with SNR > 7.1 have been
detected and are present in the KOI list is probably op-
timistic. Nevertheless, this hypothesis is useful in that
it sets lower limits for the planet occurrence rates, and
an upper limit for the false positive rates. For example,        Fig. 1.— Comparison of signal-to-noise ratio distributions to
                                                              evaluate the detection model (signal recovery rate) of Kepler. The
the overall false positive rate (all planet classes) we ob-   dotted black curve corresponds to the KOIs from Batalha et al.
tain when following the a priori detection rate is 14.9%,     (2012), with the SNRs from their Table 9 adjusted to correspond
compared to our lower rate of 9.4% when using a more          to Q1–Q6, for consistency with our simulations (see text). There is
accurate model given below.                                   good agreement with the distribution of SNRs computed from the
                                                              CDPP (Christiansen et al. 2012) (red curve). The green curve is
   The clearest evidence that the Kepler team has missed      the SNR distribution of our simulated population of false positives
a significant fraction of the low SNR candidates is seen       and true planets (which by construction matches the size/period
in Figure 1. This figure displays the distribution of the      distribution of actual KOIs). It assumes the nominal Kepler de-
SNRs of the actual KOIs, and compares it with the SNRs        tection model in which 50% of the signals with SNR > 7.1 and
                                                              99.9% of the signals with SNR > 10.1 are detectable (solid black
for our simulated population of false positives and true      lines). The simulated false positives are shown separately by the
planets. To provide for smoother distributions we have        orange curve. The dotted black curve peaks at a larger SNR than
convolved the individual SNRs with a Gaussian with            the green curve, indicating that the nominal detection model is
a width corresponding to 20% of each SNR (a kernel            not accurate: the effective recovery threshold must be significantly
                                                              higher than 7.1.
density estimation technique). The SNRs for the KOIs
presented by Batalha et al. (2012) were originally com-
puted based on observations from Q1 to Q8. For consis-        model requires modification.
tency with our simulations, which only use Q1–Q6, we             It is not possible to adjust the SNR-dependent recov-
have therefore degraded the published SNRs accordingly.       ery rate simultaneously with the occurrence rates of plan-
Also shown in the figure is the SNR distribution for the       ets in our simulations, as the two are highly degenerate.
KOIs computed with the prescription described in the          However, we find that we are able to reach convergence
Appendix based on the CDPP (Christiansen et al. 2012).        if we assume the following:
Several conclusions can be drawn. One is that there is            • the recovery rate is represented by a monotonically
very good agreement between the SNRs computed by                    rising function, rather than a fixed threshold, in-
us from the CDPP (red line) and those presented by                  creasing from zero at some low SNR to 100% for
Batalha et al. (2012) (adjusted to Q1–Q6; black line).              some higher SNR;
Importantly, this validates the CDPP-based procedure
used in this paper to determine the detectability of a sig-       • the model for the recovery rate function should al-
nal. It also suggests that the KOI distribution contains            low for a good match of the distribution of SNRs
useful information on the actual signal recovery rate of            separately for each of our five planet classes.
Kepler. Secondly, we note that a small number of KOIs
(∼70) have SNRs (either computed from the CDPP or             A number of simple models were tried, and we used the
adjusted to Q1–Q6) that are actually below the nominal        Bayesian Information Criterion (BIC) to compare them
threshold of 7.1. Most of these lower SNRs are values we      and make a choice: BIC = χ2 + k ln n. In this expres-
degraded from Q1–Q8 to Q1–Q6, and others are for KOIs         sion χ2 was computed in the usual way by comparing our
that were not originally found by the Kepler pipeline,        simulated SNRs for the population of false positives and
but were instead identified later by further examination       true planets with the one for the KOIs; k is the number
of systems already containing one or more candidates.         of free parameters of the model, and n is the number of
Thirdly and most importantly, the peak of the SNR dis-        bins in the histogram of the SNR distributions. We con-
tribution from our simulations (green line in the figure),     sidered the five planet categories at the same time and
which by construction match the size and period distri-       computed the BIC by summing up the corresponding
butions of the KOIs and use the a priori detection model,     χ2 values, with n therefore being the sum of the num-
is shifted to smaller values than the one for KOIs. This      ber of bins for all classes. The model that provides the
suggests that the a priori detection model described in       best BIC involves a simple linear ramp for the recovery
Sect. 3 in which 50% of the signals with SNR > 7.1 (and       rate between SNRs of 6 and 16, in other words, no tran-
99.9% of the signals SNR > 10.1) have been detected as        sit signal with a SNR below 6 is recovered, and every
KOIs is not quite accurate, and indicates the detection       transit signal is recovered over 16. Figure 2 (to be com-
                                                              pared with Figure 1) shows the much better agreement
12                                                                                 Fressin et al.



            Detection recovery rate - ramp from SNR = 6 (0%) to SNR = 16 (100 %)
                                                                                                                                  KOIs (Batalha et al. 2012)

                                           KOI SNR (Batalha et al. 2012)                                                          Simulated False Positives
                                           KOI computed SNR (based on CDPP)                                                       Simulated FP + planets

                                           Simulated False Positives SNR
                                           Simulated FP + planets SNR




  Fig. 2.— Same as Figure 1 but with a detection model (solid                                Fig. 3.— Histogram of the periods of giant planet KOIs from
black lines) ranging linearly from 0% for candidates with a SNR                            Batalha et al. (2012) (dotted black lines), simulated false positives
of 6 or lower to 100% for candidates above a SNR of 16. With                               (orange), and the sum of simulated false positives and simulated
this model our simulations can match not only the size and period                          planets (green).
distribution of KOIs, but also the distribution of their SNRs.


between the SNR distribution of our simulated popula-                                                      6.1. Giant planets (6–22 R⊕ )
tion of false positives and planets and the SNRs for the                                      Planet occurrence rates and false positive rates are in-
KOIs. We adopt this detection model for the remainder                                      terdependent. As described earlier, the bootstrap ap-
of the paper.                                                                              proach we have adopted to determine those properties
                                                                                           for the different planet classes begins with the giant plan-
           6. PLANET OCCURRENCE RESULTS                                                    ets, as only larger objects (stars) with well understood
  This section presents the results of our joint simulation                                properties can mimic their signals. The process then
of false positives and true planets for each of the planet                                 continues with smaller planets in a sequential fashion.
categories, and compares their distributions with those                                       The frequencies of giant planets per period bin were
of actual KOIs from Batalha et al. (2012). In particular,                                  adjusted until their distribution added to that of false
since we have assumed no correlation between the planet                                    positives reproduces the period spectrum of actual KOIs.
frequencies and the spectral type of the host star (the                                    This is illustrated in Figure 3, where the simulated and
simplest model), here we examine whether there is any                                      actual period distributions (green and dotted black lines)
such dependence, using the stellar mass as a proxy for                                     match on average, though not in detail because of the
spectral type. As in some of the previous figures, we                                       statistical nature of the simulations. The frequency of
display generalized histograms computed with a kernel                                      false positives (orange line) is seen to increase somewhat
density estimator approach in which the stellar masses                                     toward shorter periods, peaking at P ∼ 3–4 days. Sim-
are convolved with a Gaussian function. We adopt a                                         ilar adjustments to the frequencies by period bin have
Gaussian width (σ) of 20%, which we consider to be a                                       been performed successively for large Neptunes, small
realistic estimate of the mass uncertainties in the KIC.                                   Neptunes, super-Earths, and Earth-size planets.
  The total frequencies (average number of planets per                                        We find that the overall frequency of giant planets
star) are reported in Table 2 and Table 3. The first table                                  (planets per star) in orbits with periods up to 418 days is
presents the occurrence of planets of different classes per                                 close to 5.2%. Figure 4 (top panel) displays the distribu-
period bin of equal size on a logarithmic scale. The bin                                   tion of simulated false positives for this class of planets
with the longest period for which we can provide an oc-                                    as a function of the mass of the host star (orange curve).
currence estimate differs for the different planet classes.                                  After adding to these the simulated planets, we obtain
We do not state results for period bins in which the num-                                  the green curve that represents stars that either have a
ber of KOIs is less than 1σ larger than the number of false                                true transiting giant planet or that constitute a false pos-
positives; for these size/period ranges the current list of                                itive mimicking a planet in this category. The compar-
Kepler candidates is not large enough to provide reli-                                     ison with the actual distribution of KOIs (black dotted
able values. Cumulative rates of occurrence of planets as                                  curve) shows significant differences, with a Kolmogorov-
a function of period are presented in Table 3. Another                                     Smirnov (K-S) probability of 0.7%. This suggests a pos-
interesting way to present planet occurrence results is                                    sible correlation between the occurrence of giant planets
to provide the number of stars that have at least one                                      and spectral type (mass), whereas our simulations have
planet in various period ranges. This requires a differ-                                    assumed none. In particular, the simulations produce an
ent treatment of KOIs with multiple planet candidates                                      excess of giant planets around late-type stars, implying
for the same star. To compute these numbers, shown in                                      that in reality there may be a deficit for M dwarfs. The
Table 4, we repeated our simulations by removing from                                      opposite seems to be true for G and K stars. Doppler
the KOI list the planet candidates beyond the inner one                                    surveys have also shown a dependence of the rates of
in the considered size range, for KOIs with multiple can-                                  occurrence of giant planets with spectral type. For ex-
didates.                                                                                   ample, Johnson et al. (2010) reported a roughly linear
Kepler FPR and Occurrence of Planets                                                       13

increase as a function of stellar mass, with an estimate
of about 3% for M dwarfs. We find a similar figure of
3.6 ± 1.7%. As is the case for the RV surveys, our fre-                                           Giant planets (6 REarth -2 RJup)
quency increases for G and K stars to 6.1 ± 0.9%, but




                                                               (per 0.1 M -bin)
then reverses for F (and warmer) stars to 4.3 ± 1.0%,                                             223 KOIs, FPR = 17.7 %
while the Doppler results suggest a higher frequency ap-
proaching 10%. We point out, however, that there are                                                 KOIs (Batalha et al. 2012)
rather important differences between the two samples:                                                 Simulated False Positives
(1) the estimates from radial velocity surveys extend out                                            Simulated FP + planets
to orbital separations of 2.5 AU, while our study is only
reasonably complete to periods of ∼400 days (1.06 AU for
a solar-type star); (2) the samples are based on two very
different characteristics: the planet mass for RV surveys,
and planet radius for Kepler ; and (3) there may well be
a significant correlation between the period distribution
of planets and their host star spectral type. Indeed, the
results from the study of planets orbiting A-type stars
by Bowler et al. (2010) provides an explanation for the
apparent discrepancy between RV and Kepler results for
the occurrence of giant planets orbiting hot stars: Fig-                                          KS prob = 0.7 %
ure 1 of the above paper shows that no Doppler planets
have been discovered with semimajor axes under 0.6 AU
for stellar masses over 1.5,M⊙ , creating a ‘planet desert’
in that region. Thus, the Doppler surveys find the more
common longer-period planets around the hotter stars,
and Kepler has found the rarer planets close-in.
  Averaged over all spectral types, the frequency of giant
planets up to orbital periods of 418 days is approximately
5.2% (Table 3).

             6.2. Large Neptunes (4–6 R⊕ )
  Our simulations result in an overall frequency of large
Neptunes with periods up to 418 days of approximately
3.2%. The distribution in terms of host star mass
                                                                Fig. 4.— (Top): Generalized histogram of the stellar masses of
is shown in Figure 5, and indicates a good match to           KOI host stars in the giant planet category (dotted black curve),
the distribution of the large Neptune-size KOIs from          compared against the results of our Monte-Carlo simulation of the
Batalha et al. (2012) (K-S probability of 23.3%). We          Kepler targets observed during Q1–Q6. The total number of KOIs
conclude that there is no significant dependence of the        with main-sequence host stars in this category (223) and the overall
                                                              false positive rate (FPR = 17.7%; Table 1) are indicated. Simu-
occurrence rate of planets in this class with the spectral    lated false positives are shown in orange, and the sum of these
type of the host star.                                        and the simulated planet population are represented in green. The
                                                              green and dotted black curves are statistically different. (Bottom):
                                                              cumulative distribution of stellar masses from the top panel. Our
             6.3. Small Neptunes (2–4 R⊕ )                    assumed uniform occurrence of giant planets as a function of the
  The overall rate of occurrence (planets per star) of        host star spectral type (mass) results in a distribution that is statis-
                                                              tically different from the actual KOI distribution (K-S probability
small Neptunes rises significantly compared to that of         of 0.7%). The KOI distribution indicates that G-K stars are more
larger planets, reaching 31% out to periods of 245 days.      likely to host giant planets than F- and M-type stars.
Due to small-number statistics we are unable to provide
reliable estimates for periods as long as those considered
for the larger planets (up to 418 days). The logarith-        with an interesting result they reported in a study based
mic distribution of sizes we have assumed within each         on the first three quarters of Kepler data. They found
planet category allows for a satisfactory fit to the ac-       that small Neptunes are more common around late-type
tual KOI distributions in each class (with separate K-S       stars than early-type stars, and that the chance, f (Teff ),
probabilities above 5%), with the exception of the small      that a star has a planet in the 2–4 R⊕ range depends
Neptunes. As noted earlier, the increase in planet oc-        roughly linearly on its temperature. Specifically, they
currence toward smaller radii for these objects is very       proposed f (Teff ) = f0 + kT (Teff − 5100 K)/1000 K, valid
steep (Figure 7). We find that dividing the small Nep-         over the temperature range from 3600 K to 7100 K, with
tunes into two subclasses (two radius bins of the same        coefficients f0 = 0.165 ± 0.011 and kT = −0.081 ± 0.011.
logarithmic size: 2–2.8 R⊕ , and 2.8–4 R⊕ ) we are able to      In addition to the use in the present work of the con-
obtain a much closer match to the KOI population (K-S         siderably expanded KOI list from Batalha et al. (2012)
probability of 6%) with similar logarithmic distributions     (which is roughly twice the size of the KOI list from
within each sub-bin as assumed before.                        Borucki et al. 2011 used by Howard et al. 2012), there
  In our analysis we have deliberately chosen the size        are a number of other significant differences between our
range for small Neptunes to be the same as that adopted       analysis and theirs including the fact that we account
by Howard et al. (2012), to facilitate the comparison         for false positives, and we use a different model for the
The false positive_rate_of_kepler_and_the_occurrence_of_planets
The false positive_rate_of_kepler_and_the_occurrence_of_planets
The false positive_rate_of_kepler_and_the_occurrence_of_planets
The false positive_rate_of_kepler_and_the_occurrence_of_planets
The false positive_rate_of_kepler_and_the_occurrence_of_planets
The false positive_rate_of_kepler_and_the_occurrence_of_planets
The false positive_rate_of_kepler_and_the_occurrence_of_planets
The false positive_rate_of_kepler_and_the_occurrence_of_planets
The false positive_rate_of_kepler_and_the_occurrence_of_planets
The false positive_rate_of_kepler_and_the_occurrence_of_planets
The false positive_rate_of_kepler_and_the_occurrence_of_planets

Contenu connexe

Tendances

Kepler 413b a_slightly_misaligned_neptune_size_transiting_circumbinary_planet
Kepler 413b a_slightly_misaligned_neptune_size_transiting_circumbinary_planetKepler 413b a_slightly_misaligned_neptune_size_transiting_circumbinary_planet
Kepler 413b a_slightly_misaligned_neptune_size_transiting_circumbinary_planetSérgio Sacani
 
An earth sized_planet_in_the_habitable_zone_of_a_cool_star
An earth sized_planet_in_the_habitable_zone_of_a_cool_starAn earth sized_planet_in_the_habitable_zone_of_a_cool_star
An earth sized_planet_in_the_habitable_zone_of_a_cool_starSérgio Sacani
 
The harps n-rocky_planet_search_hd219134b_transiting_rocky_planet
The harps n-rocky_planet_search_hd219134b_transiting_rocky_planetThe harps n-rocky_planet_search_hd219134b_transiting_rocky_planet
The harps n-rocky_planet_search_hd219134b_transiting_rocky_planetSérgio Sacani
 
The Hunt for Exoplanets
The Hunt for ExoplanetsThe Hunt for Exoplanets
The Hunt for ExoplanetsAnamul Haque
 
Jenkins stratakeynote
Jenkins stratakeynoteJenkins stratakeynote
Jenkins stratakeynoteJon Jenkins
 
Exoplanets definitivo (1)
Exoplanets definitivo (1)Exoplanets definitivo (1)
Exoplanets definitivo (1)Pablo Del Cerro
 
Planets, stars and stellar systems2
Planets, stars and stellar systems2Planets, stars and stellar systems2
Planets, stars and stellar systems2Springer
 
Exoplanets grade5 2015
Exoplanets grade5 2015Exoplanets grade5 2015
Exoplanets grade5 2015Paul Green
 
Habiatal Zone (outside our solar system)
Habiatal Zone (outside our solar system)Habiatal Zone (outside our solar system)
Habiatal Zone (outside our solar system)Bob Smullen
 
A closely packed system of low mass, low-density planets transiting kepler-11
A closely packed system of low mass, low-density planets transiting kepler-11A closely packed system of low mass, low-density planets transiting kepler-11
A closely packed system of low mass, low-density planets transiting kepler-11Sérgio Sacani
 
Exoplanet hunters rethink search for alien life
Exoplanet hunters rethink search for alien lifeExoplanet hunters rethink search for alien life
Exoplanet hunters rethink search for alien lifeSérgio Sacani
 
Batalha press aas20110110
Batalha press aas20110110Batalha press aas20110110
Batalha press aas20110110Sérgio Sacani
 
New Horizon: The First Mission to the Pluto System and the Kuiper Belt
New Horizon: The First Mission to the Pluto System and the Kuiper BeltNew Horizon: The First Mission to the Pluto System and the Kuiper Belt
New Horizon: The First Mission to the Pluto System and the Kuiper BeltSOCIEDAD JULIO GARAVITO
 
Dark Energy
Dark EnergyDark Energy
Dark Energypixiejen
 
New Horizon: The First Mission to the Pluto System and the Kuiper Belt
New Horizon: The First Mission to the Pluto System and the Kuiper BeltNew Horizon: The First Mission to the Pluto System and the Kuiper Belt
New Horizon: The First Mission to the Pluto System and the Kuiper BeltSOCIEDAD JULIO GARAVITO
 

Tendances (18)

Kepler 413b a_slightly_misaligned_neptune_size_transiting_circumbinary_planet
Kepler 413b a_slightly_misaligned_neptune_size_transiting_circumbinary_planetKepler 413b a_slightly_misaligned_neptune_size_transiting_circumbinary_planet
Kepler 413b a_slightly_misaligned_neptune_size_transiting_circumbinary_planet
 
Life on exoplanets
Life on exoplanetsLife on exoplanets
Life on exoplanets
 
An earth sized_planet_in_the_habitable_zone_of_a_cool_star
An earth sized_planet_in_the_habitable_zone_of_a_cool_starAn earth sized_planet_in_the_habitable_zone_of_a_cool_star
An earth sized_planet_in_the_habitable_zone_of_a_cool_star
 
The harps n-rocky_planet_search_hd219134b_transiting_rocky_planet
The harps n-rocky_planet_search_hd219134b_transiting_rocky_planetThe harps n-rocky_planet_search_hd219134b_transiting_rocky_planet
The harps n-rocky_planet_search_hd219134b_transiting_rocky_planet
 
The Hunt for Exoplanets
The Hunt for ExoplanetsThe Hunt for Exoplanets
The Hunt for Exoplanets
 
Jenkins stratakeynote
Jenkins stratakeynoteJenkins stratakeynote
Jenkins stratakeynote
 
Exoplanets definitivo (1)
Exoplanets definitivo (1)Exoplanets definitivo (1)
Exoplanets definitivo (1)
 
Planets, stars and stellar systems2
Planets, stars and stellar systems2Planets, stars and stellar systems2
Planets, stars and stellar systems2
 
Exoplanets grade5 2015
Exoplanets grade5 2015Exoplanets grade5 2015
Exoplanets grade5 2015
 
Habiatal Zone (outside our solar system)
Habiatal Zone (outside our solar system)Habiatal Zone (outside our solar system)
Habiatal Zone (outside our solar system)
 
Ian Morison
Ian MorisonIan Morison
Ian Morison
 
A closely packed system of low mass, low-density planets transiting kepler-11
A closely packed system of low mass, low-density planets transiting kepler-11A closely packed system of low mass, low-density planets transiting kepler-11
A closely packed system of low mass, low-density planets transiting kepler-11
 
Exoplanets
ExoplanetsExoplanets
Exoplanets
 
Exoplanet hunters rethink search for alien life
Exoplanet hunters rethink search for alien lifeExoplanet hunters rethink search for alien life
Exoplanet hunters rethink search for alien life
 
Batalha press aas20110110
Batalha press aas20110110Batalha press aas20110110
Batalha press aas20110110
 
New Horizon: The First Mission to the Pluto System and the Kuiper Belt
New Horizon: The First Mission to the Pluto System and the Kuiper BeltNew Horizon: The First Mission to the Pluto System and the Kuiper Belt
New Horizon: The First Mission to the Pluto System and the Kuiper Belt
 
Dark Energy
Dark EnergyDark Energy
Dark Energy
 
New Horizon: The First Mission to the Pluto System and the Kuiper Belt
New Horizon: The First Mission to the Pluto System and the Kuiper BeltNew Horizon: The First Mission to the Pluto System and the Kuiper Belt
New Horizon: The First Mission to the Pluto System and the Kuiper Belt
 

En vedette

Massive clusters in_the_inner_regions_of_ngc 1365
Massive clusters in_the_inner_regions_of_ngc 1365Massive clusters in_the_inner_regions_of_ngc 1365
Massive clusters in_the_inner_regions_of_ngc 1365Sérgio Sacani
 
Unexpectedly large mass_loss_during_the_thermal_pulse_cycle_of_the_red_giant
Unexpectedly large mass_loss_during_the_thermal_pulse_cycle_of_the_red_giantUnexpectedly large mass_loss_during_the_thermal_pulse_cycle_of_the_red_giant
Unexpectedly large mass_loss_during_the_thermal_pulse_cycle_of_the_red_giantSérgio Sacani
 
Geologic mapping and_characterization_of_gale_crater
Geologic mapping and_characterization_of_gale_craterGeologic mapping and_characterization_of_gale_crater
Geologic mapping and_characterization_of_gale_craterSérgio Sacani
 
Pismis 241 the_stellar_upper_mass_limit_preserved
Pismis 241 the_stellar_upper_mass_limit_preservedPismis 241 the_stellar_upper_mass_limit_preserved
Pismis 241 the_stellar_upper_mass_limit_preservedSérgio Sacani
 
Discovery of carbon_radio_recombination_lines_in_m82
Discovery of carbon_radio_recombination_lines_in_m82Discovery of carbon_radio_recombination_lines_in_m82
Discovery of carbon_radio_recombination_lines_in_m82Sérgio Sacani
 
The herschel fronax_cluster
The herschel fronax_clusterThe herschel fronax_cluster
The herschel fronax_clusterSérgio Sacani
 
Glimpse of Gale Crater Architecture
Glimpse of Gale Crater ArchitectureGlimpse of Gale Crater Architecture
Glimpse of Gale Crater ArchitectureFreyk John Geeris
 

En vedette (9)

Massive clusters in_the_inner_regions_of_ngc 1365
Massive clusters in_the_inner_regions_of_ngc 1365Massive clusters in_the_inner_regions_of_ngc 1365
Massive clusters in_the_inner_regions_of_ngc 1365
 
Unexpectedly large mass_loss_during_the_thermal_pulse_cycle_of_the_red_giant
Unexpectedly large mass_loss_during_the_thermal_pulse_cycle_of_the_red_giantUnexpectedly large mass_loss_during_the_thermal_pulse_cycle_of_the_red_giant
Unexpectedly large mass_loss_during_the_thermal_pulse_cycle_of_the_red_giant
 
Geologic mapping and_characterization_of_gale_crater
Geologic mapping and_characterization_of_gale_craterGeologic mapping and_characterization_of_gale_crater
Geologic mapping and_characterization_of_gale_crater
 
Pismis 241 the_stellar_upper_mass_limit_preserved
Pismis 241 the_stellar_upper_mass_limit_preservedPismis 241 the_stellar_upper_mass_limit_preserved
Pismis 241 the_stellar_upper_mass_limit_preserved
 
Alien Dome - Mars 3-D
Alien Dome - Mars 3-DAlien Dome - Mars 3-D
Alien Dome - Mars 3-D
 
Discovery of carbon_radio_recombination_lines_in_m82
Discovery of carbon_radio_recombination_lines_in_m82Discovery of carbon_radio_recombination_lines_in_m82
Discovery of carbon_radio_recombination_lines_in_m82
 
The herschel fronax_cluster
The herschel fronax_clusterThe herschel fronax_cluster
The herschel fronax_cluster
 
Ph circumbinary final
Ph circumbinary finalPh circumbinary final
Ph circumbinary final
 
Glimpse of Gale Crater Architecture
Glimpse of Gale Crater ArchitectureGlimpse of Gale Crater Architecture
Glimpse of Gale Crater Architecture
 

Similaire à The false positive_rate_of_kepler_and_the_occurrence_of_planets

Validation of twelve_small_kepler_transiting_planets_in_the_habitable_zone
Validation of twelve_small_kepler_transiting_planets_in_the_habitable_zoneValidation of twelve_small_kepler_transiting_planets_in_the_habitable_zone
Validation of twelve_small_kepler_transiting_planets_in_the_habitable_zoneSérgio Sacani
 
A super earth_sized_planet_orbiting_in_or_near_the_habitable_zone_around_sun_...
A super earth_sized_planet_orbiting_in_or_near_the_habitable_zone_around_sun_...A super earth_sized_planet_orbiting_in_or_near_the_habitable_zone_around_sun_...
A super earth_sized_planet_orbiting_in_or_near_the_habitable_zone_around_sun_...Sérgio Sacani
 
The Transit Method for Exoplanet Detection
The Transit Method for Exoplanet DetectionThe Transit Method for Exoplanet Detection
The Transit Method for Exoplanet DetectionKennedy Jnr Izuagbe
 
An ancient extrasolar_system_with_five_sub_earth_size_planets
An ancient extrasolar_system_with_five_sub_earth_size_planetsAn ancient extrasolar_system_with_five_sub_earth_size_planets
An ancient extrasolar_system_with_five_sub_earth_size_planetsSérgio Sacani
 
Spatially resolved images_of_dust_belt_around_the_planet_hosting_subgiant_κ_cr_b
Spatially resolved images_of_dust_belt_around_the_planet_hosting_subgiant_κ_cr_bSpatially resolved images_of_dust_belt_around_the_planet_hosting_subgiant_κ_cr_b
Spatially resolved images_of_dust_belt_around_the_planet_hosting_subgiant_κ_cr_bSérgio Sacani
 
The kepler 10_planetary_system_revisited_by_harps
The kepler 10_planetary_system_revisited_by_harpsThe kepler 10_planetary_system_revisited_by_harps
The kepler 10_planetary_system_revisited_by_harpsSérgio Sacani
 
Know the star_know_the_planet_discovery_of_l_ate_type_companions_to_two_exopl...
Know the star_know_the_planet_discovery_of_l_ate_type_companions_to_two_exopl...Know the star_know_the_planet_discovery_of_l_ate_type_companions_to_two_exopl...
Know the star_know_the_planet_discovery_of_l_ate_type_companions_to_two_exopl...Sérgio Sacani
 
Transit timing observations from kepler i. statistical analysis of the first...
Transit timing observations from kepler  i. statistical analysis of the first...Transit timing observations from kepler  i. statistical analysis of the first...
Transit timing observations from kepler i. statistical analysis of the first...Sérgio Sacani
 
Kepler 34 b_e_kepler_35_b
Kepler 34 b_e_kepler_35_bKepler 34 b_e_kepler_35_b
Kepler 34 b_e_kepler_35_bSérgio Sacani
 
Transiting circumbinary planets_kepler-34b_and_35b
Transiting circumbinary planets_kepler-34b_and_35bTransiting circumbinary planets_kepler-34b_and_35b
Transiting circumbinary planets_kepler-34b_and_35bSérgio Sacani
 
Kepler 432b a_massive_planet_in_a_highly_eccentric_orbit_transiting_a_red_giant
Kepler 432b a_massive_planet_in_a_highly_eccentric_orbit_transiting_a_red_giantKepler 432b a_massive_planet_in_a_highly_eccentric_orbit_transiting_a_red_giant
Kepler 432b a_massive_planet_in_a_highly_eccentric_orbit_transiting_a_red_giantSérgio Sacani
 
A herschel and_apex_census_of_the_reddest_sources_in_orion_searching_for_the_...
A herschel and_apex_census_of_the_reddest_sources_in_orion_searching_for_the_...A herschel and_apex_census_of_the_reddest_sources_in_orion_searching_for_the_...
A herschel and_apex_census_of_the_reddest_sources_in_orion_searching_for_the_...Sérgio Sacani
 
Complete essay on exoplanet
Complete essay on exoplanetComplete essay on exoplanet
Complete essay on exoplanetAnamul Haque
 
Kepler 47 a-transiting_circumbinary
Kepler 47 a-transiting_circumbinaryKepler 47 a-transiting_circumbinary
Kepler 47 a-transiting_circumbinarySérgio Sacani
 
Comets: Delving into the Heart of the Matter
Comets: Delving into the Heart of the MatterComets: Delving into the Heart of the Matter
Comets: Delving into the Heart of the MatterRyan Laird
 

Similaire à The false positive_rate_of_kepler_and_the_occurrence_of_planets (20)

Validation of twelve_small_kepler_transiting_planets_in_the_habitable_zone
Validation of twelve_small_kepler_transiting_planets_in_the_habitable_zoneValidation of twelve_small_kepler_transiting_planets_in_the_habitable_zone
Validation of twelve_small_kepler_transiting_planets_in_the_habitable_zone
 
A super earth_sized_planet_orbiting_in_or_near_the_habitable_zone_around_sun_...
A super earth_sized_planet_orbiting_in_or_near_the_habitable_zone_around_sun_...A super earth_sized_planet_orbiting_in_or_near_the_habitable_zone_around_sun_...
A super earth_sized_planet_orbiting_in_or_near_the_habitable_zone_around_sun_...
 
Research Poster PPT
Research Poster PPTResearch Poster PPT
Research Poster PPT
 
The Transit Method for Exoplanet Detection
The Transit Method for Exoplanet DetectionThe Transit Method for Exoplanet Detection
The Transit Method for Exoplanet Detection
 
An ancient extrasolar_system_with_five_sub_earth_size_planets
An ancient extrasolar_system_with_five_sub_earth_size_planetsAn ancient extrasolar_system_with_five_sub_earth_size_planets
An ancient extrasolar_system_with_five_sub_earth_size_planets
 
Spatially resolved images_of_dust_belt_around_the_planet_hosting_subgiant_κ_cr_b
Spatially resolved images_of_dust_belt_around_the_planet_hosting_subgiant_κ_cr_bSpatially resolved images_of_dust_belt_around_the_planet_hosting_subgiant_κ_cr_b
Spatially resolved images_of_dust_belt_around_the_planet_hosting_subgiant_κ_cr_b
 
Nature10780
Nature10780Nature10780
Nature10780
 
The kepler 10_planetary_system_revisited_by_harps
The kepler 10_planetary_system_revisited_by_harpsThe kepler 10_planetary_system_revisited_by_harps
The kepler 10_planetary_system_revisited_by_harps
 
Know the star_know_the_planet_discovery_of_l_ate_type_companions_to_two_exopl...
Know the star_know_the_planet_discovery_of_l_ate_type_companions_to_two_exopl...Know the star_know_the_planet_discovery_of_l_ate_type_companions_to_two_exopl...
Know the star_know_the_planet_discovery_of_l_ate_type_companions_to_two_exopl...
 
Transit timing observations from kepler i. statistical analysis of the first...
Transit timing observations from kepler  i. statistical analysis of the first...Transit timing observations from kepler  i. statistical analysis of the first...
Transit timing observations from kepler i. statistical analysis of the first...
 
Eso1134b
Eso1134bEso1134b
Eso1134b
 
1803.07867.pdf
1803.07867.pdf1803.07867.pdf
1803.07867.pdf
 
Kepler 34 b_e_kepler_35_b
Kepler 34 b_e_kepler_35_bKepler 34 b_e_kepler_35_b
Kepler 34 b_e_kepler_35_b
 
Transiting circumbinary planets_kepler-34b_and_35b
Transiting circumbinary planets_kepler-34b_and_35bTransiting circumbinary planets_kepler-34b_and_35b
Transiting circumbinary planets_kepler-34b_and_35b
 
PodHandler.pptx
PodHandler.pptxPodHandler.pptx
PodHandler.pptx
 
Kepler 432b a_massive_planet_in_a_highly_eccentric_orbit_transiting_a_red_giant
Kepler 432b a_massive_planet_in_a_highly_eccentric_orbit_transiting_a_red_giantKepler 432b a_massive_planet_in_a_highly_eccentric_orbit_transiting_a_red_giant
Kepler 432b a_massive_planet_in_a_highly_eccentric_orbit_transiting_a_red_giant
 
A herschel and_apex_census_of_the_reddest_sources_in_orion_searching_for_the_...
A herschel and_apex_census_of_the_reddest_sources_in_orion_searching_for_the_...A herschel and_apex_census_of_the_reddest_sources_in_orion_searching_for_the_...
A herschel and_apex_census_of_the_reddest_sources_in_orion_searching_for_the_...
 
Complete essay on exoplanet
Complete essay on exoplanetComplete essay on exoplanet
Complete essay on exoplanet
 
Kepler 47 a-transiting_circumbinary
Kepler 47 a-transiting_circumbinaryKepler 47 a-transiting_circumbinary
Kepler 47 a-transiting_circumbinary
 
Comets: Delving into the Heart of the Matter
Comets: Delving into the Heart of the MatterComets: Delving into the Heart of the Matter
Comets: Delving into the Heart of the Matter
 

Plus de Sérgio Sacani

Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsSérgio Sacani
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bSérgio Sacani
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksSérgio Sacani
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bSérgio Sacani
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Sérgio Sacani
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTSérgio Sacani
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...Sérgio Sacani
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...Sérgio Sacani
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoSérgio Sacani
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsSérgio Sacani
 
Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsSérgio Sacani
 
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...Sérgio Sacani
 
The SAMI Galaxy Sur v ey: galaxy spin is more strongly correlated with stella...
The SAMI Galaxy Sur v ey: galaxy spin is more strongly correlated with stella...The SAMI Galaxy Sur v ey: galaxy spin is more strongly correlated with stella...
The SAMI Galaxy Sur v ey: galaxy spin is more strongly correlated with stella...Sérgio Sacani
 
Is Betelgeuse Really Rotating? Synthetic ALMA Observations of Large-scale Con...
Is Betelgeuse Really Rotating? Synthetic ALMA Observations of Large-scale Con...Is Betelgeuse Really Rotating? Synthetic ALMA Observations of Large-scale Con...
Is Betelgeuse Really Rotating? Synthetic ALMA Observations of Large-scale Con...Sérgio Sacani
 
First Direct Imaging of a Kelvin–Helmholtz Instability by PSP/WISPR
First Direct Imaging of a Kelvin–Helmholtz Instability by PSP/WISPRFirst Direct Imaging of a Kelvin–Helmholtz Instability by PSP/WISPR
First Direct Imaging of a Kelvin–Helmholtz Instability by PSP/WISPRSérgio Sacani
 
The Sun’s differential rotation is controlled by high- latitude baroclinicall...
The Sun’s differential rotation is controlled by high- latitude baroclinicall...The Sun’s differential rotation is controlled by high- latitude baroclinicall...
The Sun’s differential rotation is controlled by high- latitude baroclinicall...Sérgio Sacani
 
Hydrogen Column Density Variability in a Sample of Local Compton-Thin AGN
Hydrogen Column Density Variability in a Sample of Local Compton-Thin AGNHydrogen Column Density Variability in a Sample of Local Compton-Thin AGN
Hydrogen Column Density Variability in a Sample of Local Compton-Thin AGNSérgio Sacani
 
Huygens - Exploring Titan A Mysterious World
Huygens - Exploring Titan A Mysterious WorldHuygens - Exploring Titan A Mysterious World
Huygens - Exploring Titan A Mysterious WorldSérgio Sacani
 
The Radcliffe Wave Of Milk Way is oscillating
The Radcliffe Wave Of Milk Way  is oscillatingThe Radcliffe Wave Of Milk Way  is oscillating
The Radcliffe Wave Of Milk Way is oscillatingSérgio Sacani
 
Thermonuclear explosions on neutron stars reveal the speed of their jets
Thermonuclear explosions on neutron stars reveal the speed of their jetsThermonuclear explosions on neutron stars reveal the speed of their jets
Thermonuclear explosions on neutron stars reveal the speed of their jetsSérgio Sacani
 

Plus de Sérgio Sacani (20)

Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune WaterworldsBiogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
Biogenic Sulfur Gases as Biosignatures on Temperate Sub-Neptune Waterworlds
 
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 bAsymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
Asymmetry in the atmosphere of the ultra-hot Jupiter WASP-76 b
 
Formation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disksFormation of low mass protostars and their circumstellar disks
Formation of low mass protostars and their circumstellar disks
 
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43bNightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
Nightside clouds and disequilibrium chemistry on the hot Jupiter WASP-43b
 
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
Discovery of an Accretion Streamer and a Slow Wide-angle Outflow around FUOri...
 
Disentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOSTDisentangling the origin of chemical differences using GHOST
Disentangling the origin of chemical differences using GHOST
 
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
All-domain Anomaly Resolution Office U.S. Department of Defense (U) Case: “Eg...
 
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
PossibleEoarcheanRecordsoftheGeomagneticFieldPreservedintheIsuaSupracrustalBe...
 
Isotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on IoIsotopic evidence of long-lived volcanism on Io
Isotopic evidence of long-lived volcanism on Io
 
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroidsHubble Asteroid Hunter III. Physical properties of newly found asteroids
Hubble Asteroid Hunter III. Physical properties of newly found asteroids
 
Observational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive starsObservational constraints on mergers creating magnetism in massive stars
Observational constraints on mergers creating magnetism in massive stars
 
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
Observation of Gravitational Waves from the Coalescence of a 2.5–4.5 M⊙ Compa...
 
The SAMI Galaxy Sur v ey: galaxy spin is more strongly correlated with stella...
The SAMI Galaxy Sur v ey: galaxy spin is more strongly correlated with stella...The SAMI Galaxy Sur v ey: galaxy spin is more strongly correlated with stella...
The SAMI Galaxy Sur v ey: galaxy spin is more strongly correlated with stella...
 
Is Betelgeuse Really Rotating? Synthetic ALMA Observations of Large-scale Con...
Is Betelgeuse Really Rotating? Synthetic ALMA Observations of Large-scale Con...Is Betelgeuse Really Rotating? Synthetic ALMA Observations of Large-scale Con...
Is Betelgeuse Really Rotating? Synthetic ALMA Observations of Large-scale Con...
 
First Direct Imaging of a Kelvin–Helmholtz Instability by PSP/WISPR
First Direct Imaging of a Kelvin–Helmholtz Instability by PSP/WISPRFirst Direct Imaging of a Kelvin–Helmholtz Instability by PSP/WISPR
First Direct Imaging of a Kelvin–Helmholtz Instability by PSP/WISPR
 
The Sun’s differential rotation is controlled by high- latitude baroclinicall...
The Sun’s differential rotation is controlled by high- latitude baroclinicall...The Sun’s differential rotation is controlled by high- latitude baroclinicall...
The Sun’s differential rotation is controlled by high- latitude baroclinicall...
 
Hydrogen Column Density Variability in a Sample of Local Compton-Thin AGN
Hydrogen Column Density Variability in a Sample of Local Compton-Thin AGNHydrogen Column Density Variability in a Sample of Local Compton-Thin AGN
Hydrogen Column Density Variability in a Sample of Local Compton-Thin AGN
 
Huygens - Exploring Titan A Mysterious World
Huygens - Exploring Titan A Mysterious WorldHuygens - Exploring Titan A Mysterious World
Huygens - Exploring Titan A Mysterious World
 
The Radcliffe Wave Of Milk Way is oscillating
The Radcliffe Wave Of Milk Way  is oscillatingThe Radcliffe Wave Of Milk Way  is oscillating
The Radcliffe Wave Of Milk Way is oscillating
 
Thermonuclear explosions on neutron stars reveal the speed of their jets
Thermonuclear explosions on neutron stars reveal the speed of their jetsThermonuclear explosions on neutron stars reveal the speed of their jets
Thermonuclear explosions on neutron stars reveal the speed of their jets
 

The false positive_rate_of_kepler_and_the_occurrence_of_planets

  • 1. Accepted for publication in the Astrophysical Journal on 2012 Dec, 15 - Submitted on 2012 Oct, 16 Preprint typeset using L TEX style emulateapj v. 5/2/11 A + THE FALSE POSITIVE RATE OF KEPLER AND THE OCCURRENCE OF PLANETS Francois Fressin1 , Guillermo Torres1 , David Charbonneau1 , Stephen T. Bryson2 , Jessie Christiansen2 , ¸ Courtney D. Dressing1 , Jon M. Jenkins2 , Lucianne M. Walkowicz3 and Natalie M. Batalha2 Accepted for publication in the Astrophysical Journal on 2012 Dec, 15 - Submitted on 2012 Oct, 16 ABSTRACT The Kepler Mission is uniquely suited to study the frequencies of extrasolar planets. This goal requires knowledge of the incidence of false positives such as eclipsing binaries in the background of arXiv:1301.0842v1 [astro-ph.EP] 4 Jan 2013 the targets, or physically bound to them, which can mimic the photometric signal of a transiting planet. We perform numerical simulations of the Kepler targets and of physical companions or stars in the background to predict the occurrence of astrophysical false positives detectable by the Mission. Using real noise level estimates, we compute the number and characteristics of detectable eclipsing pairs involving main sequence stars and non-main sequence stars or planets, and we quantify the fraction of those that would pass the Kepler candidate vetting procedure. By comparing their distribution with that of the Kepler Objects of Interest (KOIs) detected during the first six quarters of operation of the spacecraft, we infer the false positive rate of Kepler and study its dependence on spectral type, candidate planet size, and orbital period. We find that the global false positive rate of Kepler is 9.4%, peaking for giant planets (6–22 R⊕ ) at 17.7%, reaching a low of 6.7% for small Neptunes (2–4 R⊕ ), and increasing again for Earth-size planets (0.8–1.25 R⊕ ) to 12.3%. Most importantly, we also quantify and characterize the distribution and rate of occurrence of planets down to Earth size with no prior assumptions on their frequency, by subtracting from the population of actual Kepler candidates our simulated population of astrophysical false positives. We find that 16.5 ± 3.6% of main-sequence FGK stars have at least one planet between 0.8 and 1.25 R⊕ with orbital periods up to 85 days. This result is a significant step towards the determination of eta-earth, the occurrence of Earth-like planets in the habitable zone of their parent stars. There is no significant dependence of the rates of planet occurrence between 0.8 and 4 Earth radii with spectral type. In the process, we derive also a prescription for the signal recovery rate of Kepler that enables a good match to both the KOI size and orbital period distribution, as well as their signal-to-noise distribution. 1. INTRODUCTION Many of the most interesting candidate transiting plan- In February 2011, the Kepler Mission produced a cat- ets identified by the Kepler Mission cannot be confirmed alog of more than 1200 candidate transiting planets, re- with current spectroscopic capabilities, that is, by the ferred to as Kepler Objects of Interest (KOIs). These detection of the reflex motion of the star due to the candidates were identified during the first four months gravitational pull from the planet. Included in this cat- of operation of the spacecraft (Borucki et al. 2011) in its egory are essentially all Earth-size planets, as well as quest to determine the frequency of Earth-size planets most super-Earths that are in the habitable zone of Sun- around Sun-like stars. This unprecedented sample of po- like stars, which have Doppler signals too small to de- tential exoplanets has become an invaluable resource for tect. Faced with this difficulty, the approach adopted all manner of statistical investigations of the properties for such objects is statistical in nature and consists in and distributions of planets around main-sequence stars. demonstrating that the likelihood of a planet is much The most recent Kepler release expanded the sample to greater than that of a false positive, a process referred more than 2300 candidates (Batalha et al. 2012), based to as “validation”. The Kepler team has made extensive on 16 months of observation. The analysis of this infor- use of a technique referred to as BLENDER (Torres et al. mation must contend, however, with the fact that not all 2004, 2011; Fressin et al. 2011) to validate a number of photometric signals are caused by planets. Indeed, false KOIs, including the majority of the smallest known exo- positive contamination is typically the main concern in planets discovered to date. This technique requires an transit surveys (see, e.g., Brown 2003), including Kepler, accurate knowledge of the target star usually derived because there is a large array of astrophysical phenomena from spectroscopy or asteroseismology, and makes use that can produce small periodic dimmings in the light of of other follow-up observations for the candidate includ- a star that can be virtually indistinguishable from those ing high spatial resolution imaging, Spitzer observations due to a true planetary transit. A common example is when available, and high-resolution spectroscopy. The a background eclipsing binary falling within the photo- telescope facilities required to gather such observations metric aperture of a Kepler target. are typically scarce, so it is generally not possible to have these constraints for thousands of KOIs such as those in 1 Harvard-Smithsonian Center for Astrophysics, Cambridge, the recent list by Batalha et al. (2012). For this reason MA 02138, USA, ffressin@cfa.harvard.edu the Kepler team has concentrated the BLENDER efforts 2 NASA Ames Research Center, Moffett Field, CA 94035, USA only on the most interesting and challenging cases. 3 Princeton University, Princeton, NJ 08544, USA Based on the previous list of Kepler candi-
  • 2. 2 Fressin et al. dates by Borucki et al. (2011) available to them, troid of the target in and out of transit based the Kepler Morton & Johnson (2011) (hereafter MJ11) investigated images themselves, which can exclude all chance align- the false positive rate for KOIs, and its dependence on ments with background eclipsing binaries beyond a cer- some of the candidate-specific properties such as bright- tain angular separation. They adopted for this angular ness and transit depth. They concluded that the false separation a standard value of 2′′ based on the single positive rate is smaller than 10% for most of the KOIs, example of Kepler-10 b (Batalha et al. 2011), and then and less than 5% for more than half of them. This conclu- scaled this result for other KOIs assuming certain in- sion differs significantly from the experience of all previ- tuitive dependencies with target brightness and transit ous transit surveys, where the rates were typically up to depth. As it turns out, those dependencies are not borne an order of magnitude larger (e.g., ∼80% for the HAT- out by actual multi-quarter centroid analyses performed Net survey; Latham et al. 2009). The realization that since, and reported by Batalha et al. (2012); the Kepler rate is much lower than for the ground-based surveys allowed the community to proceed with statisti- 4. MJ11 did not consider the question of detectability of cal studies based on lists of mere candidates, without too both planets and false positives, and its dependence on much concern that false positive contamination might the noise level for each Kepler target star as well as on bias the results (e.g., Howard et al. 2012). the period and duration of the transit signals; In their analysis MJ11 made a number of simplifying assumptions that allowed them to provide these first es- 5. The overall frequency and distribution of eclipsing bina- timates of the false positive rate for Kepler, but that ries in the Kepler field, which enters the calculation of are possibly not quite realistic enough for the most in- the frequency of background blends, has been measured teresting smaller signals with lower signal-to-noise ratios directly by the Kepler Mission itself (Prˇa et al. 2011; s (SNRs). A first motivation for the present work is thus to Slawson et al. 2011), and is likely more accurate at pre- improve upon those assumptions and at the same time to dicting the occurrence of eclipsing binaries in the solar approach the problem of false positive rate determination neighborhood than inferences from the survey of non- in a global way, by numerically simulating the population eclipsing binary stars by Raghavan et al. (2010), used by of blends in greater detail for the entire sample of Kepler MJ11. targets. A second motivation of this paper is to use our improved estimates of the false positive rate to extract The above details have the potential to bias the esti- the true frequencies of planets of different sizes (down mates of the false positive rate by factors of several, espe- to Earth-size) as well as their distributions in terms of cially for the smaller candidates with low signal-to-noise host star spectral types and orbital characteristics, a goal ratios. Indeed, recent observational evidence suggests to which the Kepler Mission is especially suited to con- the false positive rate may be considerably higher than tribute. claimed by MJ11. In one example, Demory & Seager Regarding our first objective —the determination of (2011) reported that a significant fraction (14%) of the the false positive rate of Kepler — the assumptions by 115 hot Jupiter candidates they examined, with radii MJ11 we see as potentially having the greatest impact in the 8–22 R⊕ range, show secondary eclipses inconsis- on their results are the following: tent with the planetary interpretation. The false posi- tive rate implied by the MJ11 results for the same ra- 1. The MJ11 false positive rate is based on an assumed dius range is only 4%. Santerne et al. (2012) recently 20% planet occurrence, with a power law distribution conducted an extensive spectroscopic follow-up campaign of planet sizes between 0.5 and 20 R⊕ (peaking at small using the HARPS and SOPHIE spectrographs, and ob- planets) independently of the orbital period or stellar served a number of hot Jupiters with periods under 25 host characteristics. This is a rather critical hypothesis, days, transit depths greater than 0.4%, and host star as the frequency of the smallest or longest-period objects magnitudes Kp < 14.7 in the Kepler bandpass. They in the KOI sample is essentially unknown. The adoption reported finding a false positive rate of 34.8 ± 6.5%. In of a given planet frequency to infer the false positive rate contrast, the MJ11 results imply an average value of 2.7% in poorly understood regions of parameter space is risky, for the same period, depth, and magnitude ranges, which and may constitute circular reasoning; is a full order of magnitude smaller. 2. The scenarios MJ11 included as possible blends feature We point out that Morton (2012) recently published both background eclipsing binaries and eclipsing bina- an automated validation procedure for exoplanet tran- ries physically associated with the target. However, sit candidates based on a similar approach as the MJ11 other configurations that can also mimic transit signals work, with the improvement that it now considers back- were not considered, such as those involving larger plan- ground stars transited by planets as an additional source ets transiting an unseen physical companion or a back- of potential blends (item #2 above). However, the ground star. These configurations are more difficult Morton (2012) work does not quantify the global false to rule out with follow-up observations, and BLENDER positive rate of Kepler, but focuses instead on false studies have shown that such scenarios are often the alarm probabilities for individual transit candidates. The most common blend configuration for candidates that methodology makes use of candidate-specific information have been carefully vetted (see, e.g., Batalha et al. 2011; such as the transit shape that was not explicitly consid- Cochran et al. 2011; Ballard et al. 2011; Fressin et al. ered in the actual vetting procedure used by the Kepler 2012; Gautier et al. 2012; Borucki et al. 2012); team to generate the KOI list. Here we have chosen to emulate the Kepler procedures to the extent possible so 3. A key ingredient in the MJ11 study is the strong con- that we may make a consistent use of the KOI list as straint offered by the analysis of the motion of the cen- published.
  • 3. Kepler FPR and Occurrence of Planets 3 Regarding the second goal of our work —to determine of the target, rather than being systems comprised of true the rate of occurrence of planets of different sizes— a Earth-size planets. main concern that such statistical studies must neces- Our two objectives, the determination of the false pos- sarily deal with is the issue of detectability. Only a itive rate for Kepler and the determination of the occur- fraction of the planets with the smallest sizes and the rence rate of planets in different size ranges, are in fact longest periods have passed the detection threshold of interdependent, as described below. We have organized Kepler. The studies of Catanzarite & Shao (2011) and the paper as follows. In Sect. 2 we summarize the gen- Traub (2012), based on the KOI catalog of Borucki et al. eral approach we follow. The details of how we simulate (2011), assumed the sample of Neptune-size and Jupiter- false positives and quantify their frequency are given in size planets to be both complete (i.e., that all transiting Sect. 3, separately for each type of blend scenario includ- planets have been detected by Kepler ) and essentially ing background or physically associated stars eclipsed by free of contamination (i.e., with a negligible false posi- another star or by a planet. At the end of this section we tive rate). They then extrapolated to infer the occur- provide an example of the calculation for one of those sce- rence rate of Earth-size planets assuming it follows the narios. Sect. 4 is a summary of the false positive rates for occurrence vs. size trend of larger planets. The more planets of different sizes in the Kepler sample as a whole. detailed study of Howard et al. (2012) focused on the In Sect. 5 we describe the study of the Kepler detection sub-sample of KOIs for which Kepler is closer to com- rate, and how we estimate the frequencies of planets. We pleteness, considering only planets larger than 2 R⊕ and derive the occurrence of planets in different size ranges in with periods shorter than 50 days. They tackled the issue Sect. 6, where we examine also the dependence of the fre- of detectability by making use of the Combined Differ- quencies on the spectral type (mass) of the host star and ential Photometric Precision (CDPP) estimates for each other properties. In Sect. 7, we study the distribution of KOI to determine the completeness of the sample. The the transit durations of our simulated planet population. CDPP is designed to be an estimator of the noise level of We discuss the assumptions and possible improvements Kepler light curves on the time scale of planetary tran- of our study in Sect. 8, and conclude by listing our main sits, and is available for each star Kepler has observed. results in Sect. 9. Finally, an Appendix describes how we Howard et al. (2012) found a rapid increase in planet oc- model the detection process of Kepler and how we infer currence with decreasing planet size that agrees with the the exclusion limit from the centroid motion analysis for predictions of the core-accretion formation scenario, but each individual target. disagrees with population synthesis models that predict 2. GENERAL APPROACH a dearth of objects with super-Earth and Neptune sizes for close-in orbits. They also reported that the occur- 2.1. False positive simulation rence of planets between 2 and 4 R⊕ in the Kepler field The first part of our analysis is the calculation of increases linearly with decreasing effective temperature the false positive rate for Kepler targets. We consider of the host star. Their results rely to some degree on here all Kepler targets that have been observed in at the conclusions of MJ11 regarding the rate of false pos- least one out of the first six quarters of operation of itives of the sub-sample of KOIs they studied, and it is the spacecraft (Q1–Q6)4 , to be consistent with the se- unclear how the issues enumerated above might affect lection imposed on the Batalha et al. (2012) list, which them. In an independent study Youdin (2011) devel- comes to a total of 156,453 stars. For each of these oped a method to infer the underlying planetary distri- stars, identified by a unique Kepler Input Catalog (KIC) bution from that of the Kepler candidates published by number (Brown et al. 2011), we have available their es- Borucki et al. (2011). He investigated the occurrence of timated mass, surface gravity log g, and brightness in planets down to 0.5 R⊕ , but without considering false the Kepler bandpass (Kp magnitude) as listed in the positives, and relied on the simplified Kepler detection KIC5 . Additionally we have their CDPP values on dif- efficiency model from Howard et al. (2012). He reported ferent timescales and for different quarters, provided in a significant difference in the size distributions of shorter the Mikulski Archive for Space Telescopes (MAST) at (P < 7 days) and longer period planets. STScI.6 The CDPP allows us to estimate the detectabil- Aside from the question of detectability, we emphasize ity of each type of false positive scenario. that the occurrence rate of Earth-size planets is still effec- We perform Monte Carlo simulations specific to each tively unknown. Kepler is the only survey that has pro- Kepler target to compute the number of blends of differ- duced a list of Earth-size planet candidates, and because ent kinds that we expect. The calculations are based on of the issues raised above, strictly speaking we cannot realistic assumptions about the properties of objects act- yet rule out that the false positive rate is much higher for ing as blends, informed estimates about the frequencies such challenging objects than it is for larger ones, even of such objects either in the background or physically as high as 90%. Indeed, MJ11 predicted false positive associated with the target, the detectability of the sig- rates with the assumption of an overall 20% planet fre- nals produced by these blends for the particular star in quency, peaking precisely for the smallest planets. The question based on its CDPP, and also on the ability to re- question of the exact rate of occurrence of small planets ject blends based on constraints provided by the centroid has implications for other conclusions from Kepler. For 4 A quarter corresponds to a period of observation of about example, Lissauer et al. (2012) have recently shown that most KOIs with multiple candidates (‘multis’) are bona- three months between 90◦ spacecraft rolls designed to keep the solar panels properly illuminated. fide planets. However, if Earth-size planets are in fact 5 A number of biases are known to exist in the properties listed rare, then most multis involving Earth-size candidates in the KIC, the implications of which will be discussed later in would likely correspond to larger objects transiting (to- Section 8. 6 http://stdatu.stsci.edu/kepler/ gether) the same unseen star in the photometric aperture
  • 4. 4 Fressin et al. motion analysis or the presence of significant secondary than 2 R⊕ , in which they claimed that the smallest plan- eclipses. Blended stars beyond a certain angular distance ets (small Neptunes) are more common among later-type from the target can be detected in the standard vetting stars, but did not find the same trend for larger plan- procedures carried out by the Kepler team because they ets. Our detailed modeling of the planet detectability cause changes in the flux-weighted centroid position, de- for each KOI, along with our new estimates of the false pending on the properties of the signal. This constraint, positive rates among small planets, enables us to extend and how we estimate it for each star, is described more the study to two smaller classes of planets, and to inves- fully in the Appendix. The inclusion of detectability and tigate whether the claims by Howard et al. (2012) hold the constraint from centroid motion analysis are included for objects as small as the Earth. in order to emulate the vetting process of Kepler to a reasonable degree. 3. FALSE POSITIVES There exists a wide diversity of astrophysical phenom- 2.2. Planet occurrence simulation ena that can lead to periodic dimming in the light curve The second part of our analysis seeks to determine the of a Kepler target star and might be interpreted as a sig- rate of occurrence of planets of different sizes. For this nal from a transiting planet. These phenomena involve we compare the frequencies and distributions of simu- other stars that fall within the photometric aperture of lated false positives from the previous step with those of the target and contribute light. One possibility is a back- the real sample of KOIs given by Batalha et al. (2012). ground star (either main-sequence or giant) eclipsed by We interpret the differences between these two distribu- a smaller object; another is a main-sequence star physi- tions as due to true detectable planets. In order to obtain cally associated with the target and eclipsed by a smaller the actual rate of occurrence of planets it is necessary to object. In both cases the eclipsing body may be either a correct for the fact that the KOI list includes only planets smaller star or a planet. There are thus four main cat- with orbital orientations such that they transit, and for egories of false positives, which we consider separately the fact that some fraction of transiting planets are not below. We discuss also the possibility that the eclipsing detectable by Kepler. We determine these corrections objects may be brown dwarfs or white dwarfs. by means of Monte-Carlo simulations of planets orbit- We make here the initial assumption that all KOIs that ing the Kepler targets, assuming initial distributions of have passed the nominal detection threshold of Kepler planets as a function of planet size and orbital period are due to astrophysical causes, as opposed to instru- (Howard et al. 2012). We then compare our simulated mental causes such as statistical noise or aliases from population of detectable transiting planets with that of transient events in the Kepler light curve. The expected KOIs minus our simulated false positives population, and detection rate (Dr ) based on a matched filter method we adjust our initial assumptions for the planetary distri- used in the Kepler pipeline is discussed by Jenkins et al. butions. We proceed iteratively to obtain the occurrence (1996). It is given by the following formula, as a function of planets that provides the best match to the KOI list. of the signal-to-noise ratio (SNR) of the signal and a cer- This analysis permits us to also study the dependence tain threshold, assuming that the (possibly non-white) of the occurrence rates on properties such as the stellar observational noise is Gaussian: mass. √ For the purpose of interpreting both the false posi- Dr (SNR) = 0.5 + 0.5 erf (SNR − 7.1)/ 2 . (1) tive rates and the frequencies of planets, we have chosen to separate planets into five different size (radius Rp ) In this expression erf is the standard error function, and ranges: the adopted 7.1σ threshold was chosen so that no more than one false positive will occur over the course of the • Giant planets: 6 R⊕ < Rp ≤ 22 R⊕ Mission due to random fluctuations (see Jenkins et al. 2010). The formula corresponds to the cumulative prob- • Large Neptunes: 4 R⊕ < Rp ≤ 6 R⊕ ability density function of a zero mean, unit variance Gaussian variable, evaluated at a point representing the • Small Neptunes: 2 R⊕ < Rp ≤ 4 R⊕ distance between the threshold and the SNR. The con- • Super-Earths: 1.25 R⊕ < Rp ≤ 2 R⊕ struction of the matched filter used in the Kepler pipeline is designed to yield detection statistics drawn from this • Earths: 0.8 R⊕ < Rp ≤ 1.25 R⊕ distribution. With this in mind, only 50% of transits with an SNR of 7.1 will be detected. The detection rates An astrophysical false positive that is not ruled out are 2.3%, 15.9%, 84.1%, 97.7%, and 99.9% for SNRs of by centroid motion analysis and that would produce a 5.1, 6.1, 8.1 9.1, and 10.1. Below in Sect. 5.1 we will im- signal detectable by Kepler with a transit depth corre- prove upon this initial detection model, but we proceed sponding to a planet in any of the above categories is for now with this prescription for clarity. counted as a viable blend, able to mimic a true planet Our simulation of the vetting process carried out by in that size range. These five classes were selected as a the Kepler team includes two tests that the astrophysi- compromise between the number of KOIs in each group cal scenarios listed above must pass in order to be con- and the nomenclature proposed by the Kepler team sidered viable false positives (that could be considered a (Borucki et al. 2011; Batalha et al. 2012). In particular, planet candidate): the most important is that they must we have subdivided the 2–6 R⊕ category used by the Ke- not produce a detectable shift in the flux centroid. Ad- pler team into “small Neptunes” and “large Neptunes”. ditionally, for blends involving eclipsing binaries, they Howard et al. (2012) chose the same separation in their must not lead to secondary eclipses that are of similar recent statistical study of the frequencies of planets larger depth as the primary eclipses (within 3σ).
  • 5. Kepler FPR and Occurrence of Planets 5 3.1. Background eclipsing binaries from the Slawson et al. (2011) catalog. We then check We begin by simulating the most obvious type of false whether the transit-like signal produced by this compan- positive involving a background (or foreground) eclips- ion has the proper depth to mimic a planet with a radius ing binary in the photometric aperture of the target star, in the size category under consideration (Sect. 2.2), after whose eclipses are attenuated by the light of the target dilution by the light of the Kepler target (which depends and reduced to a shallow transit-like event. We model on the known brightness of the target and of the back- the stellar background of each of the 156,453 KIC targets ground star). If it does not, we reject it as a possible using the Besan¸on model of the Galaxy (Robin et al. c blend. Next we compute the SNR of the signal as de- scribed in the Appendix, and we determine whether this 2003).7 We simulate 1 square degree areas at the center false-positive would be detectable by the Kepler pipeline of each of the 21 modules comprising the focal plane of as follows. For each KIC target observed by Kepler be- Kepler (each module containing two CCDs), and record tween Q1 and Q6 we compute the SNR as explained in all stars (of any luminosity class) down to 27th magni- the Appendix, and with Eq.[1] we derive Dr . We then tude in the R band, which is sufficiently close to Kepler ’s draw a random number between 0 and 1 and compare it Kp band for our purposes. This limit of R = 27 corre- with Dr ; if larger, we consider the false positive signal to sponds to the faintest eclipsing binary that can mimic a be detectable. We also estimate the fraction of the Nbs transit by an Earth-size planet, after dilution by the tar- stars that would be missed in the vetting carried out by get. We assign to each Kepler target a background star the Kepler team because they do not induce a measur- drawn randomly from the simulated area for the module able centroid motion. This contribution, Fcent , is simply it is located on, and we assign also a coefficient Nbs to the fraction of background stars interior to the exclusion represent the average number of background stars down limit set by the centroid analysis. As we describe in more to 27th magnitude in the 1 square degree region around detail in the Appendix, the size of this exclusion region the target. is itself mainly a function of the SNR. Only a fraction of these background stars will be Based on all of the above, we compute the number Nbeb eclipsing binaries, and we take this fraction from re- of background (or foreground) eclipsing binaries in the sults of the Kepler Mission itself, reported in the work of Kepler field that could mimic the signal of a transiting Slawson et al. (2011). The catalog compiled by these au- planet in a given size range as thors provides not only the overall rate of occurrence of eclipsing binaries (1.4%, defined as the number of eclips- Ntarg ing binaries found by Kepler divided by the number of Nbeb = Nbs Feb Cbf Cgeom Tdepth Tdet Tsec Fcent , Kepler targets), but also their eclipse depth and period i=1 distributions. In practice we adopt a somewhat reduced frequency of eclipsing binaries Feb = 0.79% that excludes where F terms represent fractions, C terms represent contact systems, as these generally cannot mimic a true corrections to the eclipsing binary frequency, and T terms planetary transit because the flux varies almost contin- are either 0 or 1: uously throughout the orbital cycle. We consider only • Ntarg is the number of Kepler targets observed for detached, semi-detached, and unclassified eclipsing bina- at least one quarter during the Q1–Q6 observing ries from the Slawson et al. (2011) catalog. Furthermore, interval considered here; we apply two additional corrections to this overall fre- quency that depend on the properties of the background • Nbs is the average number of background stars star. The first factor, Cbf , adjusts the eclipsing binary down to magnitude R = 27 in a 1 square degree frequency according to spectral type, as earlier-type stars area around the target star; have been found to have a significantly higher binary frequency than later-type stars. We interpolate this cor- • Feb is the frequency of eclipsing binaries, defined as rection from Figure 12 of Raghavan et al. (2010). The the fraction of eclipsing binaries (excluding contact second factor, Cgeom , accounts for the increased chance systems) found by Kepler divided by the number that a larger star in the background would be eclipsed of Kepler targets, and is 0.79% for this work; by a companion of a given period, from geometrical con- • Cbf is a correction to the binary occurrence rate siderations. This correction factor is unity for a star of Feb that accounts for the dependence on stellar 1 R⊙ , and increases linearly with radius. We note that mass (or spectral type), following Raghavan et al. the scaling law we have chosen does not significantly im- (2010); pact the overall occurrence of eclipsing binaries estab- lished by Slawson et al. (2011), as the median radius of • Cgeom is a correction to the geometric probability a Kepler target (0.988 R⊙ ) is very close to 1 R⊙ . The size of eclipse that takes into account the dependence of each of our simulated background stars is provided by on the size of the background star; the Besan¸on model. c Next, we assign a companion star to each of the 156,453 • Tdepth is 1 if the signal produced by the background background stars drawn from the Besan¸on model, with c eclipsing binary has a depth corresponding to a properties (eclipse depth, orbital period) taken randomly planet in the size range under consideration, af- ter accounting for dilution by the Kepler target, or 7 The accuracy of the stellar densities (number of stars per 0 otherwise; square degree) per magnitude bin provided by this model has been tested against actual star counts at the center of the Kepler • Tdet is 1 if the signal produced by the back- field. The results have been found to be consistent within 15% (R. ground eclipsing binary is detectable by the Kepler Gilliland, priv. comm.). pipeline, or 0 otherwise;
  • 6. 6 Fressin et al. • Tsec is 1 if the background eclipsing binary does not as: show a secondary eclipse detectable by the Kepler Ntarg pipeline, or 0 otherwise. We note that we have also set Tsec to 1 if the simulated secondary eclipses have Nbtp = Nbs Ftp Ctpf Cgeom Tdepth Tdet Fcent , a depth within 3σ of that of the primary eclipse, as i=1 this could potentially be accepted by the pipeline where Ntarg , Nbs , Cgeom , Tdepth, Tdet , and Fcent have as a transiting object, with a period that is half of similar meanings as before, and the binary period; • Ftp is the frequency of transiting planets in the size class under consideration, computed as the frac- • Fcent is the fraction of the Nbs background stars tion of suitable KOIs found by Kepler divided by that would show no significant centroid shifts, i.e., the number of non-giant Kepler targets (defined that are interior to the angular exclusion region as those with KIC values of log g > 3.6, following that is estimated for each star, as explained in the Brown et al. 2011)8 ; Appendix. • Ctpf is a correction factor to the transiting planet frequency Ftp that accounts for the three biases 3.2. Background stars transited by planets mentioned above: incompleteness, the false posi- A second type of astrophysical scenario that can re- tive rate among the KOIs of Batalha et al. (2012), produce the shape of a small transiting planet consists and the potential dependence of Ftp on the spectral of a larger planet transiting a star in the background type of the background star. (or foreground) of the Kepler target. We proceed in a similar way as for background eclipsing binaries, ex- 3.3. Companion eclipsing binaries cept that instead of assigning a stellar companion to each An eclipsing binary gravitationally bound to the tar- background star, we assign a random planetary compan- get star (forming a hierarchical triple system) may also ion drawn from the actual list of KOIs by Batalha et al. mimic a transiting planet signal since its eclipses can (2012). However, there are three factors that prevent be greatly attenuated by the light of the target. The us from using this list as an unbiased sample of transit- treatment of this case is somewhat different from the ing planets: (1) the list is incomplete, as the shallowest one of background binaries, though, as the occurrence and longest period signals are only detectable among the rate of intruding stars eclipsed by others is independent Kepler targets in the most favorable cases; (2) the list of the Galactic stellar population. To simulate these sce- contains an undetermined number of false positives; and narios we require knowledge of the frequency of triple (3) the occurrence of planets of different sizes may be systems, which we adopt from the results of a volume- correlated with the spectral type of the host star (see limited survey of the multiplicity of solar-type stars by Howard et al. 2012). Raghavan et al. (2010). They found that 8% of their While it is fairly straightforward to quantify the in- sample stars are triples, and another 3% have higher completeness by modeling the detectability of signals for multiplicity. We therefore adopt a total frequency of the individual KOIs (see the Appendix for a description 8% + 3% = 11%, since additional components beyond of the detection model we use), the biases (2) and (3) three merely produce extra dilution, which is small in are more difficult to estimate. We adopt a bootstrap any case compared to that of the Kepler target itself (as- approach in which we first determine the false positive sumed to be the brighter star). Of all possible triple con- rate and the frequencies for the larger planets among figurations we consider only those in which the tertiary the KOIs, and we then proceed to study successively star eclipses the close secondary, with the primary star smaller planets. The only false positive sources for the being farther removed (hierarchical structure). The two larger planet candidates involve even larger (that is, stel- other configurations involving a secondary star eclipsing lar) objects, for which the frequency and distributions of the primary or the tertiary eclipsing the primary would the periods and eclipse depths are well known (i.e., from typically not result in the target being promoted to a the work of Slawson et al. 2011). Comparing the pop- KOI, as the eclipses would be either very deep or ‘V’- ulation of large-planet KOIs (of the ‘giant planet’ class shaped. Thus we adopt 1/3 of 11% as the relevant fre- previously defined) to the one of simulated false posi- quency of triple and higher-multiplicity systems. tives that can mimic their signals enables us to obtain We proceed to simulate this type of false positive by the false positive correction factor (2). This, in turn, assigning to each Kepler target a companion star (“sec- allows us to investigate whether the frequency of large ondary”) with a random mass ratio q (relative to the tar- planets is correlated with spectral type (see Sect. 6), and get), eccentricity e, and orbital period P drawn from the therefore to address bias (3) above. As larger planets distributions presented by Raghavan et al. (2010), which transiting background stars are potential false positive are uniform in q and e and log-normal in P . We further sources for smaller planets, once we have understood the infer the radius and brightness of the secondary in the giant planet population we may proceed to estimate the Kp band from a representative 3 Gyr solar-metallicity false positive rates and occurrence of each of the smaller isochrone from the Padova series of Girardi et al. (2000). planet classes in order of decreasing size, applying correc- We then assign to each of these secondaries an eclips- tions for the biases in the same way as described above. ing companion, with properties (period, eclipse depth) We express the number Nbtp of background stars in the 8 We do not consider background giant stars transited by a Kepler field with larger transiting planets able to mimic the signal of a true transiting planet in a given size class planet as a viable blend, as the signal would likely be undetectable.
  • 7. Kepler FPR and Occurrence of Planets 7 taken from the catalog of Kepler eclipsing binaries by binaries slightly diluted by the light of a compan- Slawson et al. (2011), as we did for background eclipsing ion, and we assume here that those configurations binaries in Sect. 3.1. We additionally assign a mass ratio would not have passed the Kepler vetting proce- and eccentricity to the tertiary from the above distribu- dure. tions. To determine whether each of these simulated triple • Tstab is 1 if the triple system is dynamically stable, configurations constitutes a viable blend, we perform the and 0 otherwise; following four tests: (1) We check whether the triple sys- • Tcent is 1 if the triple system does not induce a tem would be dynamically stable, to first order, using significant centroid motion, and 0 otherwise. the condition for stability given by Holman & Wiegert (1999); (2) we check whether the resulting depth of the 3.4. Companion transiting planets eclipse, after accounting for the light of the target, cor- responds to the depth of a transiting planet in the size Planets transiting a physical stellar companion to a range under consideration; (3) we determine whether the Kepler target can also produce a signal that will mimic transit-like signal would be detectable, with the same that of a smaller planet around the target itself. One signal-to-noise criterion used earlier; and (4) we check possible point of view regarding these scenarios is to not whether the eclipsing binary would be angularly sepa- consider them a false positive, as a planet is still present rated enough from the target to induce a centroid shift. in the system, only not of the size anticipated from the For this last test we assign a random orbital phase to the depth of the transit signal. However, since one of the secondary in its orbit around the target, along with a goals of the present work is to quantify the rate of occur- random inclination angle from an isotropic distribution, rence of planets in specific size categories, a configuration a random longitude of periastron from a uniform distri- of this kind would lead to the incorrect classification of bution, and a random eccentricity drawn from the dis- the object as belonging to a smaller planet class, biasing tribution reported by Raghavan et al. (2010). The semi- the rates of occurrence. For this reason we consider these major axis is a function of the known masses and the scenarios as a legitimate false positive. orbital period. The final ingredient is the distance to To quantify them we proceed in a similar way as for the system, which we compute using the apparent mag- companion eclipsing binaries in the preceding section, as- nitude of the Kepler target as listed in the KIC and the signing a bound stellar companion to each target in ac- absolute magnitudes of the three stars from the Padova cordance with the frequency and known distributions of isochrone, ignoring extinction. If the resulting angular binary properties from Raghavan et al. (2010), and as- separation is smaller than the radius of the exclusion re- signing a transiting planet to this bound companion us- gion from centroid motion analysis, we count this as a ing the KOI list by Batalha et al. (2012). Corrections to viable blend. the rates of occurrence Ftp are as described in Sect. 3.2. The number Nceb of companion eclipsing binary con- The total number of false positives of this kind among figurations in the Kepler field that could mimic the signal the Kepler targets is then of a transiting planet in each size category is given by: Ntarg Ntarg Nctp = Fbin Ftp Cgeom Ctpf Tdepth Tdet Tcent Tstab , Nceb = Feb/triple Cgeom Tstab Tdepth Tdet Tsec Tcent , i=1 i=1 where Fbin is the frequency of non-single stars in the in which Ntarg , Cgeom , Tdepth, Tsec and Tdet represent the solar neighborhood (44%; Raghavan et al. 2010) and the same quantities as in previous sections, while remaining symbols have the same meaning as before. Adding up the contributions from this section and • Feb/triple is the frequency of eclipsing binaries in the preceding three, the total number of false positives triple systems (or systems of higher multiplicity) in the Kepler field due to physically-bound or back- in which the secondary star is eclipsed by the ground/foreground stars eclipsed by a smaller star or by tertiary. As indicated earlier, we adopt a fre- a planet is Nbeb + Nbtp + Nceb + Nctp . quency of eclipsing binaries in the solar neighbor- hood of Feb = 0.79%, from the Kepler catalog by 3.5. Eclipsing pairs involving brown dwarfs and white Slawson et al. (2011). The frequency of stars in dwarfs binaries and higher-multiplicity systems has been Because of their small size, brown dwarfs transiting a given by Raghavan et al. (2010) as Fbin = 44% star can produce signals that are very similar to those (where we use the notation ‘bin’ to mean ‘non- of giant planets. Therefore, they constitute a potential single’). Therefore, the chance of a random pair of source of false positives, not only when directly orbiting stars to be eclipsing is Feb /Fbin = 0.0079/0.44 = the target but also when eclipsing a star blended with the 0.018. As mentioned earlier we will assume here target (physically associated or not). However, because that one third of the triple star configurations of their larger mass and non-negligible luminosity, other (with a frequency Ftriple of 11%, according to evidence would normally betray the presence of a brown Raghavan et al. 2010) have the secondary and ter- dwarf, such as ellipsoidal variations in the light curve tiary as the close pair of the hierarchical system. (for short periods), secondary eclipses, or even measur- Feb/triple is then equal to Feb /Fbin × 1/3 × Ftriple = able velocity variations induced on the target star. Pre- 0.018×1/3×0.11 = 0.00066. The two other config- vious Doppler searches have shown that the population urations in which the secondary or tertiary is eclips- of brown dwarf companions to solar-type stars is signif- ing the primary would merely correspond to target icantly smaller than that of true Jupiter-mass planets.
  • 8. 8 Fressin et al. Based on this, we expect the incidence of brown dwarfs to the median ∼1 R⊙ Kepler target. The corresponding as false positives to be negligible, and we do not consider correction factor is then Cgeom = 0.707. them here. The correction Ctpf to the rate of occurrence of tran- A white dwarf transiting a star can easily mimic the siting planets acting as blends is the most complicated to signal of a true Earth-size planet, since their sizes are estimate, as it relies in part on similar calculations car- comparable. Their masses, on the other hand, are of ried out sequentially from the larger categories of planets, course very different. Some theoretical predictions sug- starting with the giant planets, as described earlier. For gest that binary stars consisting of a white dwarf and a this example we make use of the results of those calcu- main-sequence star may be as frequent as main-sequence lations presented in Sect. 5. The Ctpf factor corrects the binary stars (see, e.g., Farmer & Agol 2003). However, occurrence rate of planets in the 1.25–2 R⊕ super-Earth evidence so far from the Kepler Mission seems to indi- category (since this is the relevant class for the particu- cate that these predictions may be off by orders of mag- lar blended planet we have simulated) for three distinct nitude. For example, none of the Earth-size candidates biases in the KOI list of Batalha et al. (2012), due to in- that have been monitored spectroscopically to determine completeness, false positives, and the possible correlation their radial velocities have shown any indication that the of frequency with spectral type. companion is as massive as a white dwarf. Also, while For our simulated planet with Rp = 1.81 R⊕ and transiting white dwarfs with suitable periods would be P = 13.7 days, the incompleteness contribution to Ctpf expected to produce many easily detectable gravitational was estimated by computing the SNR of the signals gen- lensing events if their frequency were as high as predicted erated by a planet of this size transiting each individual by Farmer & Agol (2003), no such magnification events Kepler target, using the CDPP of each target. We find compatible with lensing have been detected so far in the that the transit signal could only be detected around Kepler photometry (J. Jenkins, priv. comm.). For these 54.1% of the targets, from which the incompleteness reasons we conclude that the relative number of eclipsing boost is simply 1/0.541 = 1.848. The contribution to white dwarfs among the 196 Earth-size KOIs is negligi- Ctpf from the false positive bias was taken from Table 1 ble in comparison to other contributing sources of false of Sect. 4 below, which summarizes the results from the positives. bootstrap analysis mentioned earlier. That analysis in- dicates that the false positive rate for super-Earths is 3.6. Example of a false positive calculation 8.8%, i.e., 91.2% of the KOIs in the 1.25–2 R⊕ are tran- To illustrate the process of false positive estimation sited by true planets. Finally, the third contributor to for planets of Earth size (0.8 R⊕ < Rp ≤ 1.25 R⊕ ), we Ctpf addresses the possibility that small Neptunes from present here the calculation for a single case of a false pos- the KOI list acting as blends may be more common, itive involving a larger planet transiting a star physically for example, around stars of later spectral types than bound to the Kepler target (Sect. 3.4), which happens to earlier spectral types. Our analysis in Sect. 6.4 in fact produce a signal corresponding to an Earth-size planet. suggests that super-Earths have a uniform occurrence In this case selected at random, the target star (KIC as a function of spectral type, and we account for this 3453569) has a mass of 0.73 M⊙ , a brightness of Kp = absence of correlation by setting a value of 1.0 for the 15.34 in Kepler band, and a 3-hour CDPP of 133.1 spectral-type dependence correction factor. Combining ppm. As indicated above, the star has a 44% a priori the three factors just described, we arrive at a value of chance of being a multiple system based on the work of Ctpf = 1.848 × 0.912 × 1.0 = 1.685. Raghavan et al. (2010). We assign to the companion a To ascertain whether this particular false positive could mass and an orbital period drawn randomly from the have been detected by Kepler, we compute its signal-to- corresponding distributions reported by these authors, noise ratio as explained in the Appendix, in terms of with values of 0.54 M⊙ and 67.6 yr, corresponding to a the size of the small Neptune relative to the physically- semimajor axis of a = 18.0 AU. bound companion, the dilution factor from the target, This companion has an a priori chance Ftp of having a the CDPP of the target, the number of transits observed transiting planet equal to the number of KOIs (orbiting (from the duration interval for the particular Kepler tar- non-giant stars) divided by the number of non-giant stars get divided by the period of the KOI), and the transit (Ftp = 2,302/132,756 = 0.017) observed by Kepler in at duration as reported by Batalha et al. (2012). The re- least one quarter during the Q1–Q6 interval. We assign sult, SNR = 9.44 is above the threshold (determined to this companion star a transiting planet drawn ran- using the detection recovery rate studied in section 5.1), domly from the Batalha et al. (2012) list of KOIs, with so the signal would be detectable and we set Tdet = 1. a radius of 1.81 R⊕ (a super-Earth) and an orbital pe- With the above SNR we may also estimate the an- riod of 13.7 days. The diluted depth of the signal of this gular size of the exclusion region outside of which the transiting planet results in a 140 ppm dimming of the multi-quarter centroid motion analysis would rule out a combined flux of the two stars during the transit, that blend. The value we obtain for the present example with ′′ could be incorrectly interpreted as an Earth-size planet the prescription given in the Appendix is 1. 1. To es- (0.91 R⊕ ) transiting the target star. We set Tdepth = 1, tablish whether the companion star is inside or outside as this example pertains to false positives mimicking of this region, we compute its angular separation from transits of Earth-size planets. Because of the smaller ra- the target as follows. First we estimate the distance to dius of the stellar companion (Rcomp = 0.707 R⊙ , deter- the system from the apparent magnitude of the target mined with the help of our representative 3-Gyr, solar- (Kp = 15.34) and the absolute magnitudes of the two metallicity isochrone), that star has a correspondingly stars, read off from our representative isochrone accord- smaller chance that the planet we have assigned to or- ing to their masses of 0.73 and 0.54 M⊙ . Ignoring ex- bit it will actually transit at the given period, compared tinction, we obtain a distance of 730 pc. Next we place
  • 9. Kepler FPR and Occurrence of Planets 9 the companion at a random position in its orbit around 29.3 ± 3.1%, in good agreement with their observational the target, using the known semi-major axis of 18.0 AU result. This value is significantly larger than our over- and the random values of other relevant orbital elements all figure of 17.7% for giant planets because of the cut (eccentricity, inclination angle, longitude of periastron) at P < 25 days and the fact that the false positive rate as reported above. With this, the angular separation increases somewhat toward shorter periods, according to ′′ is 0. 02. This is about a factor of 50 smaller than the our simulations (see Sect. 6.1). limit from centroid motion analysis, so we conclude this For large Neptunes we find that the false positive rate false positive would not be detected in this way (i.e., it decreases somewhat to 15.9%. This is due mainly to remains a viable blend). We therefore set Tcent = 1. the lower incidence of blends from hierarchical triples, Finally, the formalism by Holman & Wiegert (1999) which can only mimic the transit depth of planets orbit- indicates that, to first order, a hierarchical triple system ing the largest stars in the sample, and to the relatively like the one in this example would be dynamically un- higher frequency of planets of this size in comparison stable if the companion star were within 1.06 AU of the to giant planets. The false positive rate decreases fur- target. As their actual mean separation is much larger ther for small Neptunes and super-Earths, and rises again (a = 18.0 AU), we consider the system to be stable and for Earth-size planets. The overall false positive rate of we set Tstab = 1. Kepler we find by combining all categories of planets is Given the results above, this case is therefore a viable 9.4%. false positive that could be interpreted as a transiting All of these rates depend quite strongly on how well Earth-size planet. The chance that this would happen we have emulated the vetting process of Kepler. We for the particular Kepler target we have selected is given may assess this as follows. We begin by noting that by the product Fbin × Ftp × Cgeom × Ctpf × Tdepth × Tdet × before the vetting process is applied, the majority of Tcent ×Tstab = 44%×0.017×0.707×(1.848×0.912×1.0)× false positives are background eclipsing binaries. To es- 1 × 1 × 1 × 1 = 0.0089. We performed similar simulations timate their numbers, we tallied all such systems falling for each of the other Kepler targets observed between Q1 within the photometric aperture of a Kepler target and and Q6, and found that larger planets transiting unseen contributing more than 50% of their flux (755 cases). companions to the targets are the dominant type of blend Of these, 465 pass our secondary eclipse test (i.e., they in this size class, and should account for a total of 16.3 present secondary eclipses that are detectable, and that false positives that might be confused with Earth-size have a depth differing by more than 3σ from the pri- planets. Doing the same for all other types of blends that mary eclipse). After applying the centroid test we find might mimic Earth-size planets leads to an estimate of that only 44.7 survive as viable blends. In other words, 24.1 false positives among the Kepler targets. our simulated application of the centroid test rules out 465 − 44.7 ≈ 420 eclipsing binary blends. We may com- 4. RESULTS FOR THE FALSE POSITIVE RATE OF pare this with results from the actual vetting of Kepler KEPLER as reported by Batalha et al. (2012), who indicated that The output of the simulations described above is sum- 1093 targets from an initial list of 1390 passed the cen- marized in Table 1, broken down by planet class and troid test and were included as KOIs, and were added also by the type of scenario producing the false positive. to the list of previously vetted KOIs from Borucki et al. We point out here again that these results are based on (2011). The difference, 1390 − 1093 = 297, is of the same a revision of the detection model of Kepler (Sect. 5.1) order of magnitude as our simulated results (∼420), pro- rather than the a priori detection rate described in Sec- viding a sanity check on our background eclipsing binary tion 3. An interesting general result is that the dominant occurrence rates as well as our implementation of the source of false positives for all planet classes involves not vetting process. This exercise also shows that the cen- eclipsing binaries, but instead large planets transiting an troid test is by far the most effective for weeding out unseen companion to the Kepler target. This type of blends. Even ignoring the test for secondary eclipses, scenario is the most difficult to rule out in the vetting the centroid analysis is able to bring down the number process performed by the Kepler team. of blends involving background eclipsing binaries to only For giant planets our simulations project a total of 39.5 83.3 out of the original 755 in our simulations, represent- false positives among the Kepler targets, or 17.7% of ing a reduction of almost 90%. the 223 KOIs that were actually identified in this size Due to the complex nature of the simulations it is non- category. This is significantly higher than the estimate trivial to assign uncertainties to the false positive rates from MJ11, who predicted a less than 5% false positive reported in Table 1, and the values listed reflect our best rate for this kind of objects. The relatively high fre- knowledge of the various sources that may contribute. quency of false positives we obtain is explained by the Many of the ingredients in our simulations rely on counts inherently low occurrence of giant planets in comparison based directly on Kepler observations, such as the KOI to the other astrophysical configurations that can mimic list and the Kepler eclipsing binary catalog. For those their signal. Another estimate of the false positive rate quantities it is reasonable to adopt a Poissonian distri- for giant planets was made recently by Santerne et al. bution for the statistical error (ǫstat ). We have also at- (2012), from a subsample of KOIs they followed up spec- tempted to include contributions from inputs that do not troscopically. They reported that 34.8 ± 6.5% of close-in rely directly on Kepler observations. One is the uncer- giant planets with periods shorter than 25 days, tran- tainty in the star counts that we have adopted from the sit depths greater than 0.4%, and brightness Kp < 14.7 Besan¸on model of Robin et al. (2003). A comparison of c show radial-velocity signals inconsistent with a planetary the simulated star densities near the center of the Kepler interpretation, and are thus false positives. Adopting the field with actual star counts (R. Gilliland, priv. comm.) same sample restrictions we obtain a false positive rate of shows agreement within 15%. We may therefore use this
  • 10. 10 Fressin et al. as an estimate of the error for false positives involving mine the occurrence of planets in each of our five planet background stars (ǫback ). As an additional test we com- classes and for each of 11 period bins, which comes to 55 pared the Besan¸on results with those from a different c free parameters. We use the rates of occurrence found Galactic structure model. Using the Trilegal model of by Howard et al. (2012) for our initial guess (prior), with Girardi et al. (2005) we found that the stellar densities extrapolated values for the planet sizes (below 2 R⊕ ) and from the latter are approximately 10% smaller, while the periods (longer than 50 days) that they did not consider distribution of stars in terms of brightness and spectral in their study. Each star is initially assigned a global type is similar (background stars using Trilegal are only chance of hosting a planet equal to the sum of these 55 0.47 mag brighter in the R band, and 100 K hotter, on occurrence rates. Our baseline assumption is that planet average). We adopt the larger difference of 15% as our occurrence is independent of the spectral type of the host ǫback uncertainty. star, but we later investigate whether this hypothesis is We have also considered the additional uncertainty consistent with the observations (i.e., with the actual dis- coming from our modeling of the detection level of the tribution of KOI spectral types; Sect. 6). Kepler pipeline (ǫdetec ). While we initially adopted a We have also assumed that the planet sizes are log- nominal detection threshold corresponding to the ex- arithmically distributed between the size boundaries of pected detection rate following Jenkins et al. (2010), in each planet class, and that their periods are distributed practice we find that this is somewhat optimistic, and logarithmically within each period bin. For each of below we describe a revision of that condition that pro- our simulated planets we compute the geometric tran- vides better agreement with the actual performance of sit probability (which depends on the stellar radius) as Kepler as represented by the published list of KOIs by well as the SNR of its combined transit signals (see the Batalha et al. (2012). Experiments in which we repeated Appendix for details on the computation of the SNR). the simulations with several prescriptions for the detec- We assigned to each planet a random inclination an- tion limit yielded a typical difference in the results com- gle, and discarded cases that are not transiting or that pared to the model we finally adopted (see below) that would not be detectable by Kepler. We assigned also a may be used as an estimate of ǫdetec . We note that the random eccentricity and longitude of periastron, with ec- difference between the results obtained from all these de- centricities drawn from a Rayleigh distribution, following tection prescriptions and those that use the a priori de- Moorhead et al. (2011). These authors found that such tection rate is approximately three times ǫdetec . This a distribution with a mean eccentricity in the range 0.1– suggests that the a priori detection model may be used 0.25 provides a satisfactory representation of the distri- to predict reasonable lower limits for the false positive bution of transit durations for KOIs cooler than 5100 K. rates, as well as the planet occurrence rates discussed We chose to adopt an intermediate value for the mean later. of the Rayleigh distribution of e = 0.175, and used it for Based on the above, we take the total error for our stars of all spectral types. Allowing for eccentric orbits false positive population involving background stars to alters the geometric probability of a transit as well as be ǫtot = ǫ2 + ǫ2 + ǫ2 . We adopt a similar stat back detec their duration (and thus their SNR). Finally, we com- expression for false positives involving stars physically pare our simulated population of detectable transiting associated with the target, without the ǫback term. planets with that of the KOIs minus our simulated false positives population, and we correct our initial assump- 5. COMPUTING THE PLANET OCCURRENCE tion for the distribution of planets as a function of size The KOI list of Batalha et al. (2012) is composed of and period. both true planets and false positives. The true planet To estimate uncertainties for our simulated true planet population may be obtained by subtracting our simu- population we adopt a similar prescription as for the false lated false positives from the KOIs. However, this dif- positive rates, and compute ǫtot = ǫ2 + ǫ2 , where stat detec ference corresponds only to planets in the Kepler field the two contributions have the same meaning as before. that both transit their host star and that are detectable While the two terms in ǫtot have roughly the same aver- by Kepler. In order to model the actual distribution age impact on the global uncertainty, the statistical error of planets in each size class and as a function of their tends to dominate when the number of KOIs in a specific orbital period, we must correct for the geometric tran- size and period bin is small, and the detection error is sit probabilities and for incompleteness. Our approach, more important for smaller planets and longer periods. therefore, is to not only simulate false positives, as de- Modeling the detection limits of Kepler is a central scribed earlier, but to also simulate in detail the true component of the process, as the incompleteness correc- planet population in such a way that the sum of the two tions can be fairly large in some regimes (i.e., for small matches the published catalog of KOIs, after accounting signals and/or long periods). Exactly how this is done for the detectability of both planets and false positives. affects both the false positive rates and the planet oc- The planet occurrence rates we will derive correspond currence rates. We have therefore gone to some effort to strictly to the average number of planets per star. investigate the accuracy of the nominal detection model For our planet simulations we proceed as follows. We (Sect. 3) according to which 50% of signals are consid- assign a random planet to each Kepler target that has ered detected by the Kepler pipeline if their SNR exceeds been observed between Q1 and Q6, taking the planet oc- a threshold of 7.1, and 99.9% of signals are detected for currences per period bin and size class to be adjustable a SNR over 10.1. We describe this in the following. variables. We have elected to use the same logarithmic period bins as adopted by Howard et al. (2012), to ease comparisons, with additional bins for longer periods than 5.1. Detection model they considered (up to ∼400 days). We seek to deter- Burke et al. (2012) have reported that a significant
  • 11. Kepler FPR and Occurrence of Planets 11 fraction of the Kepler targets have not actually been searched for transit signals down to the official SNR threshold of 7.1. This is due to the fact that spurious detections with SNR over 7.1 can mask real planet sig- A priori detection recovery rate (50 % for SNR = 7.1) nals with lower SNR in the same light curve. Pont et al. (2006) have shown that time-correlated noise features in KOI SNR (Batalha et al. 2012) KOI computed SNR (based on CDPP) photometric time-series can produce spurious detections well over this threshold, and that this has significant im- Simulated False Positives SNR plications for the yield of transit surveys. We note also Simulated FP + planets SNR that the vetting procedure that led to the published KOI list of Batalha et al. (2012) involves human intervention at various stages, and is likely to have missed some low- SNR candidates for a variety of reasons. Therefore, the assumption that most signals with SNR > 7.1 have been detected and are present in the KOI list is probably op- timistic. Nevertheless, this hypothesis is useful in that it sets lower limits for the planet occurrence rates, and an upper limit for the false positive rates. For example, Fig. 1.— Comparison of signal-to-noise ratio distributions to evaluate the detection model (signal recovery rate) of Kepler. The the overall false positive rate (all planet classes) we ob- dotted black curve corresponds to the KOIs from Batalha et al. tain when following the a priori detection rate is 14.9%, (2012), with the SNRs from their Table 9 adjusted to correspond compared to our lower rate of 9.4% when using a more to Q1–Q6, for consistency with our simulations (see text). There is accurate model given below. good agreement with the distribution of SNRs computed from the CDPP (Christiansen et al. 2012) (red curve). The green curve is The clearest evidence that the Kepler team has missed the SNR distribution of our simulated population of false positives a significant fraction of the low SNR candidates is seen and true planets (which by construction matches the size/period in Figure 1. This figure displays the distribution of the distribution of actual KOIs). It assumes the nominal Kepler de- SNRs of the actual KOIs, and compares it with the SNRs tection model in which 50% of the signals with SNR > 7.1 and 99.9% of the signals with SNR > 10.1 are detectable (solid black for our simulated population of false positives and true lines). The simulated false positives are shown separately by the planets. To provide for smoother distributions we have orange curve. The dotted black curve peaks at a larger SNR than convolved the individual SNRs with a Gaussian with the green curve, indicating that the nominal detection model is a width corresponding to 20% of each SNR (a kernel not accurate: the effective recovery threshold must be significantly higher than 7.1. density estimation technique). The SNRs for the KOIs presented by Batalha et al. (2012) were originally com- puted based on observations from Q1 to Q8. For consis- model requires modification. tency with our simulations, which only use Q1–Q6, we It is not possible to adjust the SNR-dependent recov- have therefore degraded the published SNRs accordingly. ery rate simultaneously with the occurrence rates of plan- Also shown in the figure is the SNR distribution for the ets in our simulations, as the two are highly degenerate. KOIs computed with the prescription described in the However, we find that we are able to reach convergence Appendix based on the CDPP (Christiansen et al. 2012). if we assume the following: Several conclusions can be drawn. One is that there is • the recovery rate is represented by a monotonically very good agreement between the SNRs computed by rising function, rather than a fixed threshold, in- us from the CDPP (red line) and those presented by creasing from zero at some low SNR to 100% for Batalha et al. (2012) (adjusted to Q1–Q6; black line). some higher SNR; Importantly, this validates the CDPP-based procedure used in this paper to determine the detectability of a sig- • the model for the recovery rate function should al- nal. It also suggests that the KOI distribution contains low for a good match of the distribution of SNRs useful information on the actual signal recovery rate of separately for each of our five planet classes. Kepler. Secondly, we note that a small number of KOIs (∼70) have SNRs (either computed from the CDPP or A number of simple models were tried, and we used the adjusted to Q1–Q6) that are actually below the nominal Bayesian Information Criterion (BIC) to compare them threshold of 7.1. Most of these lower SNRs are values we and make a choice: BIC = χ2 + k ln n. In this expres- degraded from Q1–Q8 to Q1–Q6, and others are for KOIs sion χ2 was computed in the usual way by comparing our that were not originally found by the Kepler pipeline, simulated SNRs for the population of false positives and but were instead identified later by further examination true planets with the one for the KOIs; k is the number of systems already containing one or more candidates. of free parameters of the model, and n is the number of Thirdly and most importantly, the peak of the SNR dis- bins in the histogram of the SNR distributions. We con- tribution from our simulations (green line in the figure), sidered the five planet categories at the same time and which by construction match the size and period distri- computed the BIC by summing up the corresponding butions of the KOIs and use the a priori detection model, χ2 values, with n therefore being the sum of the num- is shifted to smaller values than the one for KOIs. This ber of bins for all classes. The model that provides the suggests that the a priori detection model described in best BIC involves a simple linear ramp for the recovery Sect. 3 in which 50% of the signals with SNR > 7.1 (and rate between SNRs of 6 and 16, in other words, no tran- 99.9% of the signals SNR > 10.1) have been detected as sit signal with a SNR below 6 is recovered, and every KOIs is not quite accurate, and indicates the detection transit signal is recovered over 16. Figure 2 (to be com- pared with Figure 1) shows the much better agreement
  • 12. 12 Fressin et al. Detection recovery rate - ramp from SNR = 6 (0%) to SNR = 16 (100 %) KOIs (Batalha et al. 2012) KOI SNR (Batalha et al. 2012) Simulated False Positives KOI computed SNR (based on CDPP) Simulated FP + planets Simulated False Positives SNR Simulated FP + planets SNR Fig. 2.— Same as Figure 1 but with a detection model (solid Fig. 3.— Histogram of the periods of giant planet KOIs from black lines) ranging linearly from 0% for candidates with a SNR Batalha et al. (2012) (dotted black lines), simulated false positives of 6 or lower to 100% for candidates above a SNR of 16. With (orange), and the sum of simulated false positives and simulated this model our simulations can match not only the size and period planets (green). distribution of KOIs, but also the distribution of their SNRs. between the SNR distribution of our simulated popula- 6.1. Giant planets (6–22 R⊕ ) tion of false positives and planets and the SNRs for the Planet occurrence rates and false positive rates are in- KOIs. We adopt this detection model for the remainder terdependent. As described earlier, the bootstrap ap- of the paper. proach we have adopted to determine those properties for the different planet classes begins with the giant plan- 6. PLANET OCCURRENCE RESULTS ets, as only larger objects (stars) with well understood This section presents the results of our joint simulation properties can mimic their signals. The process then of false positives and true planets for each of the planet continues with smaller planets in a sequential fashion. categories, and compares their distributions with those The frequencies of giant planets per period bin were of actual KOIs from Batalha et al. (2012). In particular, adjusted until their distribution added to that of false since we have assumed no correlation between the planet positives reproduces the period spectrum of actual KOIs. frequencies and the spectral type of the host star (the This is illustrated in Figure 3, where the simulated and simplest model), here we examine whether there is any actual period distributions (green and dotted black lines) such dependence, using the stellar mass as a proxy for match on average, though not in detail because of the spectral type. As in some of the previous figures, we statistical nature of the simulations. The frequency of display generalized histograms computed with a kernel false positives (orange line) is seen to increase somewhat density estimator approach in which the stellar masses toward shorter periods, peaking at P ∼ 3–4 days. Sim- are convolved with a Gaussian function. We adopt a ilar adjustments to the frequencies by period bin have Gaussian width (σ) of 20%, which we consider to be a been performed successively for large Neptunes, small realistic estimate of the mass uncertainties in the KIC. Neptunes, super-Earths, and Earth-size planets. The total frequencies (average number of planets per We find that the overall frequency of giant planets star) are reported in Table 2 and Table 3. The first table (planets per star) in orbits with periods up to 418 days is presents the occurrence of planets of different classes per close to 5.2%. Figure 4 (top panel) displays the distribu- period bin of equal size on a logarithmic scale. The bin tion of simulated false positives for this class of planets with the longest period for which we can provide an oc- as a function of the mass of the host star (orange curve). currence estimate differs for the different planet classes. After adding to these the simulated planets, we obtain We do not state results for period bins in which the num- the green curve that represents stars that either have a ber of KOIs is less than 1σ larger than the number of false true transiting giant planet or that constitute a false pos- positives; for these size/period ranges the current list of itive mimicking a planet in this category. The compar- Kepler candidates is not large enough to provide reli- ison with the actual distribution of KOIs (black dotted able values. Cumulative rates of occurrence of planets as curve) shows significant differences, with a Kolmogorov- a function of period are presented in Table 3. Another Smirnov (K-S) probability of 0.7%. This suggests a pos- interesting way to present planet occurrence results is sible correlation between the occurrence of giant planets to provide the number of stars that have at least one and spectral type (mass), whereas our simulations have planet in various period ranges. This requires a differ- assumed none. In particular, the simulations produce an ent treatment of KOIs with multiple planet candidates excess of giant planets around late-type stars, implying for the same star. To compute these numbers, shown in that in reality there may be a deficit for M dwarfs. The Table 4, we repeated our simulations by removing from opposite seems to be true for G and K stars. Doppler the KOI list the planet candidates beyond the inner one surveys have also shown a dependence of the rates of in the considered size range, for KOIs with multiple can- occurrence of giant planets with spectral type. For ex- didates. ample, Johnson et al. (2010) reported a roughly linear
  • 13. Kepler FPR and Occurrence of Planets 13 increase as a function of stellar mass, with an estimate of about 3% for M dwarfs. We find a similar figure of 3.6 ± 1.7%. As is the case for the RV surveys, our fre- Giant planets (6 REarth -2 RJup) quency increases for G and K stars to 6.1 ± 0.9%, but (per 0.1 M -bin) then reverses for F (and warmer) stars to 4.3 ± 1.0%, 223 KOIs, FPR = 17.7 % while the Doppler results suggest a higher frequency ap- proaching 10%. We point out, however, that there are KOIs (Batalha et al. 2012) rather important differences between the two samples: Simulated False Positives (1) the estimates from radial velocity surveys extend out Simulated FP + planets to orbital separations of 2.5 AU, while our study is only reasonably complete to periods of ∼400 days (1.06 AU for a solar-type star); (2) the samples are based on two very different characteristics: the planet mass for RV surveys, and planet radius for Kepler ; and (3) there may well be a significant correlation between the period distribution of planets and their host star spectral type. Indeed, the results from the study of planets orbiting A-type stars by Bowler et al. (2010) provides an explanation for the apparent discrepancy between RV and Kepler results for the occurrence of giant planets orbiting hot stars: Fig- KS prob = 0.7 % ure 1 of the above paper shows that no Doppler planets have been discovered with semimajor axes under 0.6 AU for stellar masses over 1.5,M⊙ , creating a ‘planet desert’ in that region. Thus, the Doppler surveys find the more common longer-period planets around the hotter stars, and Kepler has found the rarer planets close-in. Averaged over all spectral types, the frequency of giant planets up to orbital periods of 418 days is approximately 5.2% (Table 3). 6.2. Large Neptunes (4–6 R⊕ ) Our simulations result in an overall frequency of large Neptunes with periods up to 418 days of approximately 3.2%. The distribution in terms of host star mass Fig. 4.— (Top): Generalized histogram of the stellar masses of is shown in Figure 5, and indicates a good match to KOI host stars in the giant planet category (dotted black curve), the distribution of the large Neptune-size KOIs from compared against the results of our Monte-Carlo simulation of the Batalha et al. (2012) (K-S probability of 23.3%). We Kepler targets observed during Q1–Q6. The total number of KOIs conclude that there is no significant dependence of the with main-sequence host stars in this category (223) and the overall false positive rate (FPR = 17.7%; Table 1) are indicated. Simu- occurrence rate of planets in this class with the spectral lated false positives are shown in orange, and the sum of these type of the host star. and the simulated planet population are represented in green. The green and dotted black curves are statistically different. (Bottom): cumulative distribution of stellar masses from the top panel. Our 6.3. Small Neptunes (2–4 R⊕ ) assumed uniform occurrence of giant planets as a function of the The overall rate of occurrence (planets per star) of host star spectral type (mass) results in a distribution that is statis- tically different from the actual KOI distribution (K-S probability small Neptunes rises significantly compared to that of of 0.7%). The KOI distribution indicates that G-K stars are more larger planets, reaching 31% out to periods of 245 days. likely to host giant planets than F- and M-type stars. Due to small-number statistics we are unable to provide reliable estimates for periods as long as those considered for the larger planets (up to 418 days). The logarith- with an interesting result they reported in a study based mic distribution of sizes we have assumed within each on the first three quarters of Kepler data. They found planet category allows for a satisfactory fit to the ac- that small Neptunes are more common around late-type tual KOI distributions in each class (with separate K-S stars than early-type stars, and that the chance, f (Teff ), probabilities above 5%), with the exception of the small that a star has a planet in the 2–4 R⊕ range depends Neptunes. As noted earlier, the increase in planet oc- roughly linearly on its temperature. Specifically, they currence toward smaller radii for these objects is very proposed f (Teff ) = f0 + kT (Teff − 5100 K)/1000 K, valid steep (Figure 7). We find that dividing the small Nep- over the temperature range from 3600 K to 7100 K, with tunes into two subclasses (two radius bins of the same coefficients f0 = 0.165 ± 0.011 and kT = −0.081 ± 0.011. logarithmic size: 2–2.8 R⊕ , and 2.8–4 R⊕ ) we are able to In addition to the use in the present work of the con- obtain a much closer match to the KOI population (K-S siderably expanded KOI list from Batalha et al. (2012) probability of 6%) with similar logarithmic distributions (which is roughly twice the size of the KOI list from within each sub-bin as assumed before. Borucki et al. 2011 used by Howard et al. 2012), there In our analysis we have deliberately chosen the size are a number of other significant differences between our range for small Neptunes to be the same as that adopted analysis and theirs including the fact that we account by Howard et al. (2012), to facilitate the comparison for false positives, and we use a different model for the