3-D convolutional neural networks for organ segmentation in medical images for radiotherapy planning

3-D convolutional neural networks for organ segmentation in medical images for radiotherapy planning
US11676281

Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for segmenting a medical image. In one aspect, a method comprises: receiving a medical image that is captured using a medical imaging modality and that depicts a region of tissue in a body; and processing the medical image using a segmentation neural network to generate a segmentation output. The segmentation neural network can include a sequence of multiple encoder blocks and a decoder subnetwork. training the segmentation neural network can include determining a set of error values for a segmentation channel; identifying the highest error values from the set of error values for the segmentation channel; and determining a segmentation loss based on the highest error values identified for the segmentation channel.

PTO Wrapper PDF
Dossier Espace Google

Patent 11676281
Priority Sep 10 2018
Filed Jul 20 2021
Issued Jun 13 2023
Expiry Nov 14 2039 TERM.DISCL. Extension 66 days
Inventors Hughes, Ci…
Assg.orig DeepMind T…
Assg.curr GOOGLE LLC
Entity Large
Referenced by 0
References 20
Maint.: currently ok

CROSS-REFERENCE TO R…
BACKGROUND
SUMMARY
BRIEF DESCRIPTION OF…
DETAILED DESCRIPTION

1. A method performed by one or more data processing apparatus, the method comprising:

receiving a medical image that is captured using a medical imaging modality and that depicts a region of tissue in a body;

processing the medical image using a segmentation neural network, in accordance with trained values of a plurality segmentation neural network parameters, to generate a segmentation output, wherein:

the segmentation output comprises a plurality of segmentation channels, each segmentation channel corresponds to a respective organ from a predetermined set of organs, and each segmentation channel defines a segmentation of the respective organ corresponding to the segmentation channel in the medical image;

a segmentation of a respective organ in the medical image comprises, for each of a plurality of voxels in the medical image, a respective score characterizing whether the voxel corresponds to an interior of the respective organ;

the segmentation neural network comprises a sequence of multiple encoder blocks, wherein:

each encoder block is a residual neural network block comprising one or more two-dimensional convolutional neural network layers, one or more three-dimensional convolutional neural network layers, or both;

each encoder block is configured to process a respective encoder block input to generate a respective encoder block output wherein a spatial resolution of the encoder block output is lower than a spatial resolution of the encoder block input; and

for each encoder block that is after an initial encoder block in the sequence of encoder blocks, the encoder block input comprises a previous encoder block output of a previous encoder block in the sequence of encoder blocks;

the segmentation neural network comprises a decoder subnetwork, wherein the decoder subnetwork is configured to process a decoder subnetwork input comprising an intermediate output of each encoder block to generate the segmentation output;

the decoder subnetwork comprises a final layer that is configured to process a final layer input to generate the segmentation output;

wherein the segmentation neural network has been trained by a plurality of operations comprising:

processing a training medical image using the segmentation neural network to generate a training segmentation output;

determining a segmentation loss for the training medical image, comprising:

for each segmentation channel of the training segmentation output:

determining a set of error values for the segmentation channel, wherein each error value in the set of error values for the segmentation channel corresponds to a respective voxel in the training medical image and is based on an error between: (i) the score from the segmentation channel which characterizes whether the voxel corresponds to the interior of the organ corresponding to the segmentation channel, and (ii) a target score defining whether the voxel corresponds to the interior of the organ corresponding to the segmentation channel; and

identifying a plurality of highest error values from the set of error values for the segmentation channel, wherein the plurality of highest error values are a proper subset of the set of error values for the segmentation channel; and

determining the segmentation loss based on the plurality of highest error values identified for each segmentation channel of the training segmentation output; and

adjusting current values of the plurality of segmentation neural network parameters of the segmentation neural network based on the segmentation loss for the training medical image.

17. One or more non-transitory computer storage media storing instructions that when executed by one or more computers cause the one or more computers to perform operations comprising: