The clustering of massive galaxies at z~0.5 from the first semester of BOSS data

We calculate the real- and redshift-space clustering of massive galaxies at z~0.5 using the first semester of data by the Baryon Oscillation Spectroscopic Survey (BOSS). We study the correlation functions of a sample of 44,000 massive galaxies in the redshift range 0.4<z<0.7. We present a halo-occupation distribution modeling of the clustering results and discuss the implications for the manner in which massive galaxies at z~0.5 occupy dark matter halos. The majority of our galaxies are central galaxies living in halos of mass 10^{13}Msun/h, but 10% are satellites living in halos 10 times more massive. These results are broadly in agreement with earlier investigations of massive galaxies at z~0.5. The inferred large-scale bias (b~2) and relatively high number density (nbar=3e-4 h^3 Mpc^{-3}) imply that BOSS galaxies are excellent tracers of large-scale structure, suggesting BOSS will enable a wide range of investigations on the distance scale, the growth of large-scale structure, massive galaxy evolution and other topics.


INTRODUCTION
The distribution of objects in the universe displays a high degree of organization, which in current models is due to primordial fluctuations in density which were laid down at very early times and amplified by the process of gravitational instability. Characterizing the evolution of this large-scale structure is a central theme of cosmology and astrophysics. In addition to allowing us to understand the structure itself, large-scale structure studies offer an incisive tool for probing cosmology and particle physics and set the context for our modern understanding of galaxy formation and evolution. Since the pioneering studies of Humason et al. (1956), Gregory & Thompson (1978), and Joeveer & Einasto (1978) and the first CfA redshift survey (Huchra et al. 1983), galaxy redshift surveys have played a key role in this enterprise, and ever larger surveys have provided increasing insight and ever tighter constraints on cosmological models. This paper presents the first measurements of the clustering of massive galaxies from the Baryon Oscillation Spectroscopic Survey (BOSS; Schlegel et al. 2009) based on a sample of galaxy redshifts observed during the period 2010 January through July. We demonstrate that BOSS is efficiently obtaining redshifts of some of the most luminous galaxies at z 0.5 and has already become the largest such redshift survey ever undertaken. The high bias and number density of these objects (described below) make them ideal tracers of large-scale structure, and suggest that BOSS will make a significant impact on many science questions including a determination of the cosmic distance scale, the growth of structure, and the evolution of massive galaxies.
The outline of the paper is as follows. In Section 2, we briefly describe the BOSS survey and observations and define the sample we focus on in this paper. Our clustering results are described in Section 3 and interpreted in the framework of the halo model in Section 4, where we also compare to previous work on the clustering of massive galaxies at intermediate redshift. We conclude with a discussion of the implications of these results in Section 5, while some technical details on the construction of our mock catalogs are relegated to an Appendix. Throughout this paper when measuring distances we refer to comoving separations, measured in h −1 Mpc with H 0 = 100 h km s −1 Mpc −1 . We convert redshifts to distances, assuming a ΛCDM cosmology with Ω m = 0.274, Ω Λ = 0.726 and h = 0.70. This is the same cosmology as assumed for the N-body simulations from which we make our mock catalogs (see the Appendix).

OBSERVATIONS
The Sloan Digital Sky Survey (SDSS; York et al. 2000) mapped nearly a quarter of the sky using the dedicated Sloan Foundation 2.5 m telescope (Gunn et al. 2006) located at Apache Point Observatory in New Mexico. A drift-scanning mosaic CCD camera (Gunn et al. 1998) imaged the sky in five photometric bandpasses (Fukugita et al. 1996;Smith et al. 2002;Doi et al. 2010) to a limiting magnitude of r 22.5. The imaging data were processed through a series of pipelines that perform astrometric calibration (Pier et al. 2003), photometric reduction (Lupton et al. 2001), and photometric calibration (Padmanabhan et al. 2008). The magnitudes were corrected for Galactic extinction using the maps of Schlegel et al. (1998). BOSS, a part of the SDSS-III survey (D. J. Eisenstein et al. 2011, in preparation), has completed an additional 3000 deg 2 of imaging in the southern Galactic cap, taken in a manner identical to the original SDSS imaging. All of the data have been processed through the latest versions of the pipelines and BOSS is obtaining spectra of a selected subset (N. Padmanabhan et al. 2011, in preparation) of 1.5 million galaxies approximately volume limited to z 0.6 (in addition to spectra of 150,000 quasars and various ancillary observations). The targets are assigned to tiles of diameter 3 • using an adaptive tiling algorithm (Blanton et al. 2003). Aluminum plates are drilled with holes corresponding to the positions of objects on each tile, and manually plugged with optical fibers that feed a pair of double spectrographs. These spectrographs are significantly upgraded from those used by SDSS-I/II (York et al. 2000;Stoughton et al. 2002), with improved chips with better red response, higher throughput gratings, 1000 fibers (instead of 640) and a 2 entrance aperture (was 3 ). The spectra cover the range 3600-10000 Å, at a resolution of about 2000.
BOSS makes use of luminous galaxies selected from the multi-color SDSS imaging to probe large-scale structure at intermediate redshift (z < 0.7). These galaxies are among the most luminous galaxies in the universe and trace a large cosmological volume while having high enough number density to ensure that shot-noise is not a dominant contributor to the clustering variance. The majority of the galaxies have old stellar systems whose prominent 4000 Å break in their spectral energy distributions makes them relatively easy to select in multi-color data.
The strategy behind, and details of, our target selection are covered in detail in N. Padmanabhan et al. (2011, in preparation). Cuts in color-magnitude space allow a roughly volume-limited sample of luminous galaxies to be selected, and partitioned into broad redshift bins. Briefly, we follow the SDSS-I/II procedure described in Eisenstein et al. (2001) and define a "rotated" combination of colors d ⊥ = (r − i) − (g − r)/8. The sample we analyze in this paper (the so-called CMASS sample since it is approximately stellar mass limited) is defined via d ⊥ > 0.55 and i < 19.9 and i fiber2 < 21.5 i < 19.86 + 1.6 (d ⊥ − 0.8) and r − i < 2, where magnitude cuts use "cmodel magnitudes" and colors are defined with "model magnitudes," except for i fiber2 which is the magnitude in the 2 spectroscopic fiber (see Stoughton et al. 2002;Abazajian et al. 2004, for definitions of the magnitudes and further discussion). There are two additional cuts to reduce stellar contamination, z psf −z 9.125-0.45 z and r psf −r > 0.3. These cuts isolate the z ∼ 0.5, high-mass galaxies. The i − d ⊥ constraint is approximately a cut in absolute magnitude Figure 1. The (comoving) number density of galaxies,n(z), for the sample described in the text (Section 2). The vertical dashed lines indicate the redshift limits we use in our analysis: 0.4 < z < 0.7. or stellar mass, with d ⊥ closely tracking redshift for these galaxies. As discussed in detail in N. Padmanabhan et al. (2011, in preparation), the slope of the i − d ⊥ cut is set to parallel the track of a passively evolving, constant stellar mass galaxy as determined from the population synthesis models of Maraston et al. (2009). This approach leads to an approximately stellar mass limited sample. We restrict ourselves to galaxies in the redshift range 0.4 < z < 0.7 (Figure 1). Note that our selection gives the majority of the galaxies within Δz = 0.1 of the median-this has the advantage of making the analysis relatively straightforward but means we need to combine with other samples to obtain leverage in redshift. A comparison of the cuts defining this sample with other, similar, samples in the literature will be presented in N. Padmanabhan et al. (2011, in preparation). In general BOSS goes both fainter and bluer than the earlier samples, targeting "luminous galaxies" not "luminous red galaxies." The distribution of absolute (r-band) magnitude for the sample is shown in Figure 2, where we see that all of the CMASS galaxies are intrinsically very luminous. Using the modeling of C. Maraston et al. (2011, in preparation) on the BOSS spectra we find the median stellar mass of the sample is 10 11.7 M . While the detailed numbers depend on assumptions about. e.g., the initial mass function, these galaxies are at the very high mass end of the stellar mass function at this redshift for any reasonable assumptions.
The clustering measurements in this paper are based on the data taken by BOSS up to end of 2010 July, which includes 120,000 galaxies over 1600 deg 2 of sky. However, the data prior to 2010 January were taken in commissioning mode and few of those data are of survey quality. Once we trim the data to contiguous regions ( Figure 3) with high-redshift completeness and select galaxies at z ∼ 0.5 we are left with 44,000 galaxies, covering 580 deg 2 , which we have used in our analysis.
The sky coverage of our sample can be seen in Figure 3. We view the data as comprising three regions of the sky, hereafter referred to as A, B, and C (see the figure). Galaxies in these regions are separated from those in any other region by several hundred Mpc, and we shall consider them independent. Convenient "rectangular" boundaries to the regions are Figure 2. Distribution of absolute magnitudes for the sample analyzed in this paper. We have k + e corrected the r-band magnitudes to z 0.55 using the g − i color assuming a passively evolving galaxy-since the redshift range is small this amounts to a small correction. This sample consists of intrinsically very bright, and massive, galaxies with stellar masses several times the characteristic mass in a Schechter fit. The luminosity function of Faber et al. (2007) at z = 0.5, converted from B to r band assuming a redshifted z = 0 elliptical galaxy template, has a characteristic luminosity of −19.8. Converting the Bell et al. (2004) luminosity function using a high-z single burst model gives −20. So, all of the CMASS galaxies are brighter than this characteristic luminosity.
These boundaries yield widths (heights) of 600 (700), 2600 (270), and 1600 (800) h −1 Mpc, respectively, at z 0.5. As we shall discuss below, the data are consistent with having the same clustering and redshift distribution in all three regions.

CLUSTERING MEASURES
We compute several two-point, configuration-space clustering statistics in this paper. The basis for all of these calculations is the two-point galaxy correlation function on a twodimensional (2D) grid of pair separations parallel and perpendicular to the line of sight: ξ (R, Z).
To estimate the counts expected for unclustered objects while accounting for the complex survey geometry, we generate random catalogs with the detailed radial and angular selection functions of the sample but with 50× the number of points. Numerous tests have confirmed that the survey selection function factorizes into an angular and a redshift piece. The redshift selection function can be taken into account by distributing the randoms according to the observed redshift distribution of the sample. The completeness on the sky is determined from the fraction of target galaxies in a sector for which we obtained a high-quality redshift, with the sectors being areas of the sky covered by a unique set of spectroscopic tiles (see Blanton et al. 2003;Tegmark et al. 2004). We use the Mangle software (Swanson et al. 2008) to track the angular completeness. In computing the redshift completeness we omit galaxies for which a redshift was already known from an earlier survey from both the target and success lists, and then later randomly sample such galaxies with the resulting completeness in constructing the input catalog. Since very few of our targets at z ∼ 0.5 have existing redshifts this is a very small correction. Not all of the spectra taken resulted in a reliable redshift, and the failure probability has angular structure due to hardware limitations. These result in spatial signal-to-noise fluctuations in observations. We find no evidence that this failure is redshift dependent-low-and high-redshift failure regions have the same redshift distribution. We therefore apply a small angular correction for this spatial structure by up-weighting galaxies based on the signal to noise of each spectrum, and the probability of redshift measurement. This is a small correction and only affects our results at the percent level. To avoid issues arising from small-number statistics we only keep sectors with area larger than 10 −4 sr, or approximately 0.3 deg 2 . At the observed mean density (150 deg −2 ) we expect several tens of galaxies in any such region, enabling us to reliably determine the redshift completeness. 18 We trim the final area to all sectors with completeness greater than 75%, producing our final sample of 44,000 galaxies, distributed as 5000 in region A, 14,000 in region B, and 24,000 in region C. 18 Assuming binomial statistics, if M of N galaxies have redshifts the most likely completeness is c = M/N, the mean is (M + 1)/(N + 2) and the variance on c is [M(N − M) + N + 1]/[(N + 2) 2 (N + 3)]. For example, if N = 12 and M = 9 the error on c is approximately 10%. For N 100 the error is under 5%. Unless the scatter is somehow correlated with the signal these uncertainties are negligible. In fact, we find that ignoring the exact value of the completeness in constructing our random catalog only slightly alters our final ξ . . Contours of the redshift-space correlation function, ξ (R, Z), for our 0.4 < z < 0.7 galaxy sample (see the text). Note the characteristic elongation in the Z direction at small R (fingers-of-god) and squashing at large R (super-cluster infall). The left panel shows the results from the BOSS data, while the right panel is from our mock catalogs. The level of agreement is quite good, as can be seen more quantitatively in later figures.
After the cut the median, galaxy-weighted completeness is 88%, 84%, and 88% in regions A, B, and C, respectively.
We estimate ξ (R, Z) using the Landy & Szalay (1993) where DD, DR, and RR are suitably normalized numbers of (weighted) data-data, data-random, and random-random pairs in bins of (R, Z). We experimented with two sets of weights, one to correct for fiber collisions (described below) and one to reduce the variance of the estimator. The latter was wheren(z i ) is the mean density at redshift z i andξ V is a model for the volume-integrated redshift-space correlation function within s. We approximatedξ V = 4πs 2 0 s, corresponding to ξ (s) = (s/s 0 ) −2 and took s 0 = 8 h −1 Mpc. The details of the weighting scheme did not affect our final result on the scales of interest to us here-in fact dropping this weight altogether gave comparable results and so we neglect this weight in what follows.
We are unable to obtain redshifts for approximately 7% of the galaxies due to fiber collisions-no two fibers on any given observation can be placed closer than 62 . At z 0.5 the 62 exclusion corresponds to 0.4 h −1 Mpc. Where possible we obtain redshifts for the collided galaxies in regions where plates overlap, but the remaining exclusion must be account for. We correct for the impact of this by (1) restricting our analysis to relatively large scales and (2) up-weighting galaxy-galaxy pairs in the analysis with angular separations smaller than 62 . The weight is derived by comparing the angular correlation function of the entire photometric sample with that of the galaxies for which we obtained redshifts (Hawkins et al. 2003;Li et al. 2006;Ross et al. 2007). This ratio is very close to unity above 62 but significantly depressed below this scale. Note that in our situation there is a close correspondence between angular separation and transverse separation since our survey volume is a relatively narrow shell with reasonably large radius, so the number of pairs for which this correction is appreciable is quite small.
Contours of the 2D correlation function for our 0.4 < z < 0.7 galaxy sample are shown in Figure 4. Note the characteristic elongation in the Z direction at small R (fingers-of-god) and squashing at large R (super-cluster infall).
To mitigate the effects of redshift space distortions, we follow standard practice and compute from ξ (R, Z) the projected correlation function (e.g., Davis & Peebles 1983) In practice, we integrate to 100 h −1 Mpc, which is sufficiently large to include almost all correlated pairs. We also compute the angularly averaged, redshift space correlation function, ξ (s), and the cross-correlation between the CMASS sample selected from the imaging and the spectroscopic samples, w × . For all of these measures the full covariance matrix is computed from a set of mock catalogs based on a halo-occupation distribution (HOD) modeling of the data (Section 4 and the Appendix).
We now discuss each of the clustering measurements in turn, beginning with the real-space clustering.

Real-space Clustering
The projected correlation function for the 0.4 < z < 0.7 sample is shown in Figure 5 and given in Table 1. We chose eight bins, equally spaced in ln R between 0.3 h −1 Mpc and 30 h −1 Mpc as a compromise between retaining the relevant information and generating stable covariance matrices via Monte Carlo. The finite width of these bins should be borne in mind when comparing theoretical models to these data. The integration over Z in Equation (7) was done by Riemann sum using 100 linearly spaced bins in Z. The results were well converged at this spacing, because of the "smearing" of the correlation function along the line of sight due to redshift space effects. The data were analyzed separately in each of regions A, B, and C and then combined in a minimum variance manner: The errors on the individual samples have been suppressed for clarity. The data are combined using the full covariance matrix, but only the diagonal elements are plotted. The w p implied by a power-law correlation function of slope −1.8 and correlation length of 7.5 h −1 Mpc forms a reasonable fit to the data with 1 < R p < 10 h −1 Mpc but we do not plot it here for clarity. The (thick) long-dashed-dotted line shows the prediction of the best-fitting HOD model (Section 4), which provides a reasonable fit on all scales plotted (recall the errors are correlated).
where w (α) p represents the vector of w p measurements from region α = A, B, or C. Not surprisingly, the combined result is dominated by the results from region C. To reduce the condition number of the covariance matrix, and the dynamic range in w p , we fit throughout to R w p and quote the results in that form. The w p points are quite covariant, in part because the integration in Equation (7) introduces a large mixing of power at different R, thus use of the full covariance matrix is essential. The error bars on the individual w (α) p have been suppressed in the figure for clarity, and the square root of the diagonal elements of the covariance matrix is shown as error bars on the combined result.
We also subdivided the redshift range into a low-z and high-z half, splitting at z = 0.55, and found no statistically significant difference between the two samples ( Figure 6; in the split samples the fiber collision correction is more uncertain, so the disagreement at the smallest R point is not very significant). This result motivates our decision to analyze the data in a single redshift slice. Slow evolution of the clustering is expected for a highly biased population such as our luminous galaxies where the evolution of the bias approximately cancels the evolution of the dark matter clustering (Fry 1996).
Even with only the eight data points in w p , deviations from a pure power-law correlation function are apparent. These can be traced to the non-power-law nature of the mass correlation function and the way in which the galaxies occupy dark matter halos-we will return to these issues in Section 4.
The calculation of errors in clustering measurements can be done in a number of different ways (see Norberg et al. 2009 for a discussion). We first tried a bootstrap estimate, dividing the survey regions into 8-22, roughly equal area "pixels" and sampling from these regions with replacement (Efron & Gong 1983). Unfortunately, the irregular geometry and relatively small sky coverage meant that we were not able to obtain a covariance matrix which was stable against changes in the pixelization. We anticipate that as the survey progresses this technique will become more robust. In the meantime, we computed the covariance from a series of mock catalogs derived from an iterative procedure using N-body simulations as described in the Appendix. We will show in Figure 13 that the distribution of χ 2 from our mock catalogs encompasses the value obtained for the data in regions A, B, and C if both are computed using the mock-based covariance matrix and the best-fitting HOD model (Section 4). This indicates that the measurements we obtain are completely consistent with being drawn from the underlying HOD model, given the finite number of galaxies and observing geometry.

Redshift-space Clustering
The angle-averaged redshift space correlation function, ξ (s), for the 0.4 < z < 0.7 sample is shown in Figure 7. Again, the data were analyzed separately in each of regions A, B, and C. The dot-dashed line shows the same power-law correlation function as described in Figure 5, while the solid line shows the predicted ξ (s) from the model that best fits the w p data (above). The enhancement of clustering over the real-space result on large scales (Kaiser 1987; for a review see Hamilton 1998 and for recent developments see Pápai & Szapudi 2008;Shaw & Lewis 2008) is evident in the comparison of the data to the power law. The good agreement between the data and the HOD model below a few Mpc is indication that the satellite fraction in the model is close to that in the data and the relative motions of the satellite galaxies are close to the motions of the dark matter within the parent halos (i.e., any velocity bias is small). The characteristic down-turn on scales smaller than a few Mpc is expected from virial motions within halos and the motion of halos themselves. The excess power of the HOD model compared to the data on scales of a few Mpc can be mitigated by increasing the degrees of freedom in the model, for example by dropping the assumption that central galaxies move with the mean halo velocity or follow the dark matter radial profile or allowing a modest amount of satellite velocity bias.
On scales below tens of Mpc the violations of the distant observer approximation are small, but on larger scales they begin to become appreciable (Pápai & Szapudi 2008) and should be included in any comparison between these data and a theoretical model (most noticeably for the higher multipoles). Redshift-space, isotropic correlation function for the 0.4 < z < 0.7 sample in regions A, B, and C (points). The same power-law correlation function which fits the w p data on intermediate scales, with s 0 = 7.5 h −1 Mpc, is shown as the dot-dashed line while the solid line is the prediction for ξ (s) from the best-fitting HOD model to w p , assuming no velocity bias for satellites and that central galaxies are at rest in their halos. The good agreement below a few Mpc is an indication that the satellite fraction in the model is close to that in the data and any velocity bias is small.

Cross-correlation
Finally we consider w p computed from the cross-correlation of the imaging catalog with the spectroscopy-this allows us to isolate the galaxies to a narrow redshift shell and convert angles to (transverse) distances while at the same time being insensitive to the details of the spectroscopic selection including the issue of fiber collisions. 19 As described in Padmanabhan et al. (2009), the angular cross-correlation of the imaging and spectroscopic samples, with angles converted to distances using the redshift of the spectroscopic member, can be written as where f (χ ) is the normalized radial distribution of the photometric sample as a function of comoving distance, χ , and the average is over the redshift distribution of the spectroscopic sample. Note that w × (R) is dimensionless, with f (χ ) having dimensions of inverse length and w p having dimensions of length. Figure 8 shows the cross-correlation for regions A, B, and C along with a power-law correlation function. The normalization of this figure differs from that of Figure 5 by a factor of f (χ ) ∼ O(10 −3 ). Because the signal is suppressed by the width of f (χ ) the estimate of w p from the cross-correlation is significantly noisier than that from the auto-correlation (see Myers et al. 2009, Section 2.1, for related discussion). The cross-correlation estimate is consistent with our auto-correlation results but we have not attempted to fit any models to it directly. We have extended the cross-correlation to smaller scales in the figure to emphasize that there is significant power even on very small scales, which are difficult to probe directly with the autocorrelation function due to the fiber collision problem.

HALO-OCCUPATION MODELING
In order to relate the observed clustering of galaxies with the clustering of the underlying mass, and to make realistic mock catalogs, we interpret our measurements within the context of the HOD (Peacock & Smith 2000;Seljak 2000;Benson et al. 2000;White et al. 2001;Berlind & Weinberg 2002;Cooray & Sheth 2002). The HOD describes the number and distribution of galaxies within dark matter halos. Since the clustering and space density of the latter are predictable functions of redshift, any HOD model makes predictions for a wide range of observational statistics. Rather than perform a simultaneous fit to the real-and redshift-space correlation functions (including their covariances) we choose to fit to the real-space clustering only and show that the models which best fit these data also provide a reasonable description of the redshift-space clustering results. This avoids the need to make additional assumptions for modeling the redshift space correlation function. We also implicitly assume that we are measuring a uniform sample of galaxies across the entire redshift range, so that a single HOD makes sense. We tested this assumption by splitting the sample into high-and low-redshift subsamples.
We use a halo model which distinguishes between central and satellite galaxies with the mean occupancy of halos: 20 Note that our definition of σ can be interpreted as a fractional "scatter" in mass at threshold but is a factor ln(10)/ √ 2 different than that in Zheng et al. (2005). for M > κM cut and zero otherwise. This form implicitly assumes that halos do not host satellite galaxies without hosting centrals, which is at best an approximation, but this is reasonable for the purposes of computing projected clustering. Different functional forms have been proposed in the literature, but the current form is flexible enough for our purposes.
To explore the plausible range of HOD parameter space we applied the Markov Chain Monte Carlo method (see, e.g., Gilks et al. 1996) to the w p data using a χ 2 -based likelihood. This method generates a "chain" of HOD parameters whose frequency of appearance traces the likelihood of that model fitting the data. It works by generating random HODs from a trial distribution (in our case a multi-dimensional Gaussian), populating a simulation cube with galaxies according to that HOD, computing w p from the periodic box by pair counts and accepting or rejecting the HOD based on the relative likelihood of the fit. The step sizes and directions are determined from the covariance matrix of a previous run of the chain. Given the chain, the probability distribution of any statistic derivable from the parameters can easily be computed: we show the mean occupancy of halos as a function of mass, N (M), in Figure 9, where the band indicates the ±1 σ spread within the chain. The mean (galaxy-weighted) halo mass is M 180b gal = (2.8 ± 0.15) × 10 13 h −1 M (we quote here the mass interior to a sphere within which the mean density is 180× the background density for halo mass, rather than the friends-offriends (FoF) mass, to facilitate comparison with other work); while the satellite fraction is (10 ± 2)%. The values of the HOD parameters are given in Table 2.
In addition to the purely statistical errors, shown in the figure and quoted above, there are systematic uncertainties. Our correction for fiber collisions only significantly impacts the smallest R point in our calculation. If we increase the error on that point by a factor of 10, effectively removing it from the fit, the results change to M 180b gal = (2.6±0.15)×10 13 h −1 M and (7 ± 2)%, respectively, which are shifts of approximately 1 σ . Additional uncertainty arises from the uncertainty in the background cosmology (held fixed in this paper) and from methodological choices. A comparison of different methods for performing the halo modeling (using different mass definitions or halo profiles, analytic versus numerical methods, different Notes. Both R and w p are measured in units of h −1 Mpc, with R quoted at the bin mid-point. To reduce the condition number of the covariance matrix we quote means, errors and covariances on Rw p , which removes much of the run of w p with scale and makes the quoted data points more similar in magnitude. The error bars, σ i , from the diagonal of C, are broken out separately in the third row and the correlation matrix, C ij /(σ i σ j )is quoted in the lower part of the table.
The full covariance matrix should be used in any fit, and the finite width of the R bins should be included in theoretical predictions.

Table 2
The Mean and Standard Deviation of the HOD Parameters (See Equations (12) and (13) ways of enforcing halo exclusion, etc.) suggests an additional O(10%) "systematic" uncertainty. It would be interesting to check the assumptions going into this HOD analysis, and the inferences so derived, with additional data and a luminosity dependent modeling. The halo occupancy of massive galaxies at these redshifts has been investigated before based on both photometric (White et al. 2007;Blake et al. 2008;Brown et al. 2008;Padmanabhan et al. 2009) and spectroscopic (Ross et al. 2007(Ross et al. , 2008Wake et al. 2008;Zheng et al. 2009;Reid & Spergel 2009) samples. Accounting for differences in sample selection and redshift range, our results appear quite consistent with the previous literature (see Figure 10).
Our galaxies populate a broad range of halo masses, with an approximate power-law dependence of the mean number of galaxies per halo with halo mass for massive halos and a broad roll-off at lower halo masses. The low-mass behavior is driven by the amplitude of the large-scale clustering in combination with the relatively high number density of our sample and encodes information about the scaling of the central galaxy luminosity with halo mass and its distribution. We find that the halos with masses (2-3) × 10 13 h −1 M contain on average one of our massive galaxies. At these redshifts such halos are quite highly biased (see below), corresponding to galaxy groups, and we expect b(z) ∝ 1/D(z), where D(z) is the linear growth rate, leading to an approximately constant clustering amplitude with redshift. The majority of our galaxies are central galaxies residing in 10 13 h −1 M halos, but a non-negligible fraction are satellites which live primarily in halos ∼10 times more massive. The width of this "plateau" (M 1 /M cut ) is smaller than that found for less luminous systems at lower redshift, though it continues the trends seen in Zheng et al. (2009) for plateau width and satellite fraction as a function of luminosity. This increase in the satellite fraction is driving the visibility of the fingers-of-god in the correlation function ( Figure 4) and the small-scale upturn in w p .
An alternative view of the halo occupation is presented in Figure 11, which shows the probability that a galaxy in our sample is hosted by a halo of mass M. Note the broad range of halo masses probed by our galaxies, and the low probability of finding one of our galaxies in very high mass halos-which is a consequence of the sparsity of such halos at this redshift.
The N-body simulations can also be used to infer the scale dependence of the bias, b(r) ≡ [ξ gal (r)/ξ dm (r)] 1/2 , for our bestfitting halo model. This is shown in Figure 12, where we see that above 10-20 h −1 Mpc the bias approaches a constant, b 2. For our cosmology the linear growth factor at z = 0.55 is 0.762 so bσ 8 (z = 0.55) = 1.3 and this is assumed constant across our redshift range. This is very similar to the results obtained for photometric luminous red galaxy (LRG) samples at comparable redshifts (Blake et al. 2007;Ross et al. 2007;Padmanabhan et al. 2007Padmanabhan et al. , 2009Blake et al. 2008). The rapid rise of b(r) at very small scales is expected, since it is well known that these galaxies exhibit an almost power-law correlation function at small scales while the nonlinear ξ dm (r) is predicted to fall below a power law at small r. (Most galaxy pairs on these scales are central-satellite pairs, whereas for the dark matter there is Figure 11. Probability per log 10 M halo (upper) or cumulative probability (lower) that a galaxy in our sample is hosted by a halo of mass M halo . Note the broad range of halo masses probed by our galaxies, and the low probability of finding one of our galaxies in very high mass halos (due to the sparsity of such halos at this redshift).

Figure 12.
Scale dependence of the bias, b(r) ≡ [ξ gal (r)/ξ dm (r)] 1/2 , predicted from our best-fit halo model and N-body simulations. The feature at a few Mpc has been seen in other analyses and occurs at the transition between the oneand two-halo contributions (see the text). Note that the bias asymptotes to a constant, b 2, on large scales. no such distinction so ξ dm is the convolution of the halo radial profile with itself.) The feature in b(r) at a few Mpc occurs at the transition between the one-and two-halo contributions, i.e., pairs of galaxies that lie within a single dark matter halo versus those which lie in separate halos, while the rise at slightly larger scales comes from the scale dependence of the halo bias. Note that the combination of the high clustering amplitude and number density makes this sample particularly powerful for probing large-scale structure at z 0.5.
Finally, using the best-fitting HOD model from the chain and a series of N-body simulations we generate mock catalogs as described in more detail in the Appendix. These are passed through the observational masks and cuts in order to mimic the observations and can be analyzed in the same manner to generate a set of mock measurements from which we compute covariance matrices and other statistical quantities. We match the redshift distribution of the sample to our constant-numberdensity simulation boxes by randomly subsampling the galaxies as a function of redshift (with a 100% sampling at the peak near z 0.55). This is consistent with our assumption, earlier, that the HOD describes a single population of objects and the dN/dz reflects observational selection effects. We obtain similar HODs fitting to the thinner redshift slices which lends credence to this view.

DISCUSSION
The Baryon Spectroscopic Oscillation Survey is in the process of taking spectra for 1.5 million luminous galaxies and 150,000 quasars to make a precision determination of the scale of baryon oscillations and to study the growth of structure and the evolution of massive galaxies. We have presented measurements of the clustering of 44,000 massive galaxies at z ∼ 0.5 from the first semester of BOSS data, showing that BOSS is performing well and that the galaxies we are targeting have properties in line with expectations (Schlegel et al. 2009).
The CMASS sample at z 0.5 has a large-scale bias of b 2 (Figure 12), and a number density several times higher than the earlier, spectroscopic LRG sample of Eisenstein et al. (2001), making it an ideal sample for studying large-scale structure. The majority of our CMASS galaxies are central galaxies residing in 10 13 h −1 M halos, but a non-negligible fraction are satellites which live primarily in halos ∼10 times more massive.
The data through 2010 July do not cover enough volume to robustly detect the acoustic peak in the correlation function in this sample, one of the science goals of BOSS. While no definitive detection is possible at present, the error bars are anticipated to shrink rapidly as we collect more redshifts; BOSS should be able to constrain the acoustic scale at z ∼ 0.5 within the next year, with the constraints becoming increasingly tight as the survey progresses.
Funding for SDSS-III has been provided by the Alfred P. Sloan Foundation, the Participating Institutions, the National Science Foundation, and the U.S. Department of Energy. The SDSS-III Web site is http://www.sdss3.org/.
SDSS-III is managed by the Astrophysical Research Consortium for the Participating Institutions of the SDSS-III Collaboration including the University of Arizona, the Brazilian Participation Group, Brookhaven National Laboratory, University of Cambridge, University of Florida, the French Participation Group, the German Participation Group, the Instituto

N-BODY SIMULATIONS AND MOCK CATALOGS
We make use of several simulations in this paper. The main set is 20 different realizations of the ΛCDM family with Ω m = 0.274, Ω Λ = 0.726, h = 0.7, n = 0.95, and σ 8 = 0.8 (in agreement with a wide array of observations). Briefly, each simulation employs an updated version of the TreePM code described in White (2002) to evolve 1500 3 equal mass (7.6 × 10 10 h −1 M ) particles in a periodic cube of side length 1500 h −1 Mpc with a Plummer equivalent smoothing of 36 h −1 kpc. The initial conditions were generated by displacing particles from a regular grid using second order Lagrangian perturbation theory at z = 75 where the rms displacement is 10% of the mean interparticle spacing. This TreePM code has been compared to a number of other codes and shown to perform well for such simulations (Heitmann et al. 2008). Recently the code has been modified to use a hybrid MPI+OpenMP approach which is particularly efficient for modern clusters.
For each output we found dark matter halos using the FoF algorithm (Davis et al. 1985) with a linking length of 0.168 times the mean interparticle spacing. This partitions the particles into equivalence classes roughly bounded by isodensity contours of 100× the mean density. The position of the most-bound particle, the center of mass velocity and a random subset of the member particles are stored for each halo and used as input into the HOD modeling and mock catalogs. Throughout we use the sum of the masses of the particles linked by the FoF algorithm as our basic definition of halo mass, except when quoting M 180b gal in Section 4 where we use spherical overdensity (SO) masses to facilitate comparison with other work. Note we do not run an SO finder to define new groups. We use the FoF halo catalog, only computing a different mass for each FoF halo. In order to compute these SO masses we grow spheres outwards from the most bound particle in each FoF halo, stopping when the mean density of the enclosed material (including both halo and non-halo particles) is 180× the background density. The total enclosed mass we denote by M 180b .
All of the mock observational samples are assumed to be isoredshift, and "static" outputs are used as input to the modeling.
The assumption of non-evolving clustering over the relevant redshift range is theoretically expected for a highly biased population, and also borne out by our modeling (Section 4) and measurements (Section 3).
Once a set of HOD parameter values has been chosen, we populate each halo in a given simulation with mock "galaxies." The HOD provides the probabilities that a halo will contain a central galaxy and the number of satellites. The central galaxy is placed at the position of the most bound particle in the halo, and we randomly draw dark matter particles to represent the satellites, assuming that the satellite galaxies trace the mass profile within halos. This approach has the advantage of retaining any alignments between the halo material, the filamentary large-scale structure and the velocity field.
Since the observational geometry is in some cases highly elongated (Figure 3), we use volume remapping (Carlson & White 2010) on the periodic cubes to encompass many realizations of the sample within each box. The mock galaxies are then observed in a way analogous to the actual sample, with the completeness mask and redshift cuts applied to generate several hundred "mock surveys." (Overall we have 1500 mock surveys, divided into 900, 360, and 240 mock surveys of regions A, B, and C, respectively. However they are not all completely Figure 13. Distribution of χ 2 for w p from our mock catalogs (histograms) and from the data (vertical dashed lines) in regions A, B, and C. The χ 2 is computed for the measured or mock w p compared to the best-fitting HOD model using the covariance matrix computed from the mocks. The measurements we obtain are completely consistent with being drawn from the underlying HOD model, given the finite number of galaxies and observing geometry. independent as we have only ∼100 times as much volume in the simulations as in the largest region, C.) For technical reasons, and since it only affects the smallest scale w p point, we do not model fiber collisions. Instead we increase the errors for that point by the square root of the ratio of the pair counts in the photometric sample to that in the spectroscopic sample (i.e., the same correction applied to the data-data pairs in computing ξ (R, Z)). This correction is appropriate in the limit that the error is dominated by Poisson counting statistics.
The covariance matrices for the clustering statistics are obtained from the mocks, and the entire procedure (reconstructing the best fit with the new covariance matrix, recomputing the mock catalogs and recomputing the clustering) is iterated until convergence. Given a reasonable starting HOD, the procedure converges within two or three steps.
Over the range of scales probed in this paper the correlation function is quite well constrained and we find the distribution of w p values in the mocks is well fit by a Gaussian at each R. This suggests we are able to use a Gaussian form for the likelihood, which is backed up by the distribution of χ 2 values seen in Figure 13.