Loading…

Deep learning of quasar spectra to discover and characterize damped Lyα systems

Abstract We have designed, developed, and applied a convolutional neural network (CNN) architecture using multi-task learning to search for and characterize strong H i Lyα absorption in quasar spectra. Without any explicit modelling of the quasar continuum or application of the predicted line profil...

Full description

Saved in:
Bibliographic Details
Published in:Monthly notices of the Royal Astronomical Society 2018-05, Vol.476 (1), p.1151-1168
Main Authors: Parks, David, Prochaska, J Xavier, Dong, Shawfeng, Cai, Zheng
Format: Article
Language:English
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Abstract We have designed, developed, and applied a convolutional neural network (CNN) architecture using multi-task learning to search for and characterize strong H i Lyα absorption in quasar spectra. Without any explicit modelling of the quasar continuum or application of the predicted line profile for Lyα from quantum mechanics, our algorithm predicts the presence of strong H i absorption and estimates the corresponding redshift zabs and H i column density $N_{\rm H\,\small {I}}$, with emphasis on damped Lyα systems (DLAs, absorbers with $N_{\rm H\,\small {I}}\ge 2 \times 10^{20} \, {\rm cm^{-2}}$). We tuned the CNN model using a custom training set of DLAs injected into DLA-free quasar spectra from the Sloan Digital Sky Survey (SDSS), data release 5 (DR5). Testing on a held-back validation set demonstrates a high incidence of DLAs recovered by the algorithm (97.4 per cent as DLAs and 99 per cent as an H i absorber with $N_{\rm H\,\small {I}}> 10^{19.5} \, {\rm cm^{-2}}$) and excellent estimates for zabs and $N_{\rm H\,\small {I}}$. Similar results are obtained against a human-generated survey of the SDSS DR5 data set. The algorithm yields a low incidence of false positives and negatives but is challenged by overlapping DLAs and/or very high $N_{\rm H\,\small {I}}$ systems. We have applied this CNN model to the quasar spectra of SDSS DR7 and the Baryon Oscillation Spectroscopic Survey (data release 12) and provide catalogues of 4913 and 50 969 DLAs, respectively (including 1659 and 9230 high-confidence DLAs that were previously unpublished). This work validates the application of deep learning techniques to astronomical spectra for both classification and quantitative measurements.
ISSN:0035-8711
1365-2966
DOI:10.1093/mnras/sty196