Cervical cancer is the second most common cancer type in women around the world. One of the causes is that it is often detected at late stages due to non-existent or inadequate screening, and the standard treatment options are often absent or unaffordable [1]. Cervical cancer is developed in the cervical transformation zone and most of the time is caused by the infection of several types of Human Papillomavirus (HPV). This is a curable type of cancer if early detected and treated appropriately. However, detecting this disease at the pre-cancerous stage remains a challenging task [2].
After abnormal screening results, the prevention of cervical cancer depends on the destruction or excision of the entire transformation zone epithelium, not specific precancerous lesions. This method is effective in about 80% to 95% of the cases [3, 4]. Cervical cancer is a deadly disease that could greatly benefit from early detection approaches [5]. If early detected, the prognosis for the patients is excellent, improving the survival rate. The early detection of cancer can be achieved by non-invasive methods that preserve vital organ function, resulting in a better quality life for the patients [6].
Chromosome instability has been gaining interest because it is a central process in the development of cancer cells. This instability is indicative of the fact that the cell contains an abnormal amount of DNA, resulting in a process known as aneuploidy. This cellular abnormality has been associated with tumorigenesis. This ploidy analysis has long been a promising and economically viable solution to facilitate early cancer detection. However, this approach has not been widely adopted in clinical routine [7]. Nowadays, the biopsy is the gold standard in the detection of cancer/pre-cancer lesions, and for being an invasive method, it is recommended to be done in extreme cases only [8].
This ploidy analysis usually is done by cytological exams, which consists of visually inspecting the nuclei searching for morphological alteration. This manual evaluation of microscopic images is subject to variations in perceptions and level of expertise of the cytologists, making this process prone to human errors too. Also, the slow processing time of manual analysis should be considered as another reason for using computational methods [9]. To overcome this, computer-assisted analysis can be used to measure the cytological alterations and indication of cellular status, helping on early cancer diagnosis [10, 11].
Advances in image analysis and machine learning make available more robust algorithms for extracting information from data [12]. Recently, deep learning methods have achieved good performance in various computational tasks, showing to be effective for extracting the features from data in different settings [13]. Recent
MACARINI, Luiz A. B. et al. Towards a Complete Pipeline for Segmenting Nuclei in Feulgen-Stained Images: Anais do Computer on the Beach. In COMPUTER ON THE BEACH (COTB’20), 11., 2020, Balneário Camboriú. Anais... Balneário Camboriú: Universidade do Vale do Itajaí, 2020, p. 1-7.
efforts to use deep learning approaches in genomics and biomedical application show their flexibility for handling complex problems. Therefore, using deep learning methods to analyze cytometry data is a very promising approach [9].
In this context, we present a complete pipeline for cell nuclei segmentation in Feulgen stained slides using image processing and deep learning methods. More specifically, we get the whole slide images, generate patches from it, use an U-Net with ResNet34 as the backbone for semantic segmentation and post-process the results with simple morphological operations. Our main goal is to provide an overview of the entire process, aiming to provide insights on early detecting possible cases of cervical cancer.
The rest of the paper is organized as follows: in Section 2 we present the related works. In Section 3 we introduce the methodology, giving details about each step of the pipeline. In Section 4 the results are presented, together with the discussion. Closing this work, in Section 5, we present the final remarks and future works.
Medical images combined with the powerful tools provided by deep learning opens up an opportunity to bring advances in computer-assisted diagnosis. Over the literature, we could find many researchers combining these tools to propose novel solutions, and some of them are presented below.
In [14] the authors presented a method to classify cervical cells without prior segmentation based on deep features, using convolutional neural networks. The neural network is trained with a natural dataset and fine-tuned on a cervical cell dataset. Also on cervical cells, in [15] the authors combined a fully convolutional network with a graph-based approach for segmentation of nuclei in cervical images. The network was used to learn the nuclei label mask and nuclei probabilistic map. It is formulated into the graph cost function in addition to the properties of the nuclei border and nuclei region. The information of the nuclei-cytoplasm position is used to modify the cost functions, where the optimal path needs to be found.
A method to segment cervical cells from Pap Smear images with the goal of early detection of cancer was proposed in [2]. Deep convolutional neural networks were used to learn diverse cell appearance features. The high-level shape information was also used to guide segmentation. In [16] the authors propose a superpixel and convolutional neural network method based on cervical cancer cell segmentation. The method first segment the cytoplasm, since its contrast with the background sometimes is not good. From this, the cell nuclei are segmented and further refined. In [17] it was proposed a method for segmenting overlapping cells in Pap smear images. The authors define the problem as a discrete labeling task for multiple cells. So coarse labeling result is used to initialize a dynamic multiple-template deformation model for further boundary refinement on each cell. Further, multiple-scale deep convolutional networks were trained to learn cell features. High-level shape information was used to help on segmenting where the cells’ boundaries are noisy or lost due to touching and overlapping cells.
A fully convolutional neural network for individual nuclei segmentation was presented in [18]. The network outputs a nuclei map and a boundary map, where the post-processing step is parameterfree and was applied on the estimated nuclei map. In [19] the authors proposed a method that can be used for cell segmentation, detection, and tracking. The approach was based in two stages. In the first, a convolutional neural network outputs the cells bounding boxes along with their scores. In the second step, a second Convolutional Neural Network (CNN) uses the proposed bounding boxes to predict segmentation masks for the cells.
In [20] the authors proposed a deep contour-aware network for nuclei detection and segmentation. They use a multi-level contextual feature based on a fully convolutional neural network to deal with large appearance variations. The proposed method outputs accurate probability maps and presents the contours simultaneously in a very clear way, separating clustered object instances. A novel method consisting of CNN based structured regression model to segment cells was presented in [21]. The method was able to handle touching cells, inhomogeneous background noises and large variations in sizes and shapes. Instead of providing a single class label, the method generates structured outputs referenced as proximity patches, where the maximum positions indicate the cell centroids.
The authors from [22] proposed an automated cell segmentation method to segment breast cancer histopathology images. The procedure has three stages. On the first, sparse reconstruction was applied to remove the background and enhance the nuclei. Then, a CNN was applied to segment the cell nuclei from the background. So, morphological operations were used to improve segmentation performance. The approach presented a pixel-wise segmentation accuracy better or equivalent to state-of-art algorithms. Also on automatic nuclei segmentation, in [23] a CNN was used to generate a probability map and from this, an iterative region merging approach was performed for shape initialization. So, a novel segmentation algorithm was presented to separate individual nuclei, combining a robust selection-based sparse shape model and a local repulsive deformable model.
Regarding classical methods, in [24] the authors proposed an approach that first segments the cell clumps from the cervical smear image and then, detects the nuclei in each cell clump. For an accurate segmentation of nuclei in each clump, the authors also proposed a modified Otsu method with prior class probability. The contour around each nucleus was evolved until it finds the cytoplasm boundaries by the use of distance regularized level set evolution. Also dealing with overlapping cells, in [25] the authors proposed an automatic segmentation method for cervical cells in microscopic images. The method was divided into three steps, where in the first, the cell masses were detected by superpixel generation and triangle thresholding. Secondly, nuclei of cells are extracted by local thresholding and outlier removal. Finally, the cell cytoplasm was segmented by superpixel partitioning and refined by cell-wise contour refinement with graph cuts. The authors in [26] proposed a method using Gaussian Mixture Models to segment cell regions to identify cellular features such as nuclei and cytoplasm. This method was combined with the shape-based identification of nuclei to increase the accuracy of segmentation. This allows the algorithm to trace the cells and nuclei contours from the pap-smear images that contain cell clusters. This approach also deals with inconsistent staining.
In this section, the methodology will be explained in detail. Fig. 1 shows an overview of the entire process. The first step consists in getting the image file of the scanned slide and submit it to the corresponding software to generate the image patches (cropping process) that will be used to train the network. Training the network for semantic segmentation is the second step, where we used the framework. The post-processing step consists in using image processing methods to improve the outcome of the network. The last step is the evaluation of the results, which was done in postprocessed images using the Intersection over Union (IoU) metric. In the next subsections, these steps will be explained in detail.
3.1 Dataset Generation
3.1.1 Samples Collection. The images come from exams that were performed in women who were attended at the gynecology and colposcopy outpatient clinic of the University Hospital Professor Polydoro Ernani de SÃčo Thiago of Federal University of Santa Catarina (HU-UFSC). These women, presenting cytological alterations in the oncotic cytology of previous exams, were referred to the HU-UFSC for a gynecological exam, colposcopy and biopsy2. The cytological samples were collected, prepared, processed and colored with Papanicolaou stain. After cytological analysis, the slides were immersed in xylol until complete removal of the coverslips for subsequent Feulgen staining. It is a widely used cytohistochemical reaction where the staining intensity is proportional to the DNA concentration. It is mostly used for DNA quantification in cell nuclei through image cytometry, aiming to do the ploidy evaluation [28]. For the acquisition step, it was used a ZEISS Axio Scanner.Z1 with a Hitachi HV-F202SCL as imaging device. The exposure time was 200, with a depth of focus of 1
and a light source intensity of 246%.
3.1.2 Images pre-processing. The patches (i.e. tiles) are generated from a cropping process from the whole slide images using the Zen Software 2.6 (Blue Edition). This patch generation is necessary since the whole slide image resolution is too high to be used in network training (about 209000 148000 pixels). The process consists of open the CZI (Carl Zeiss Image Data File) slide file in the software and exports the data into tiles. It will generate sub-images with the resolution of 1200
1600. From now on, we will refer to these "patches" simply by "image".
These nuclei need to be labeled for network training. So, specialists labeled the nuclei on each image using the labelme [29] software. This software generates a JSON file with the points of the surrounding polygon that was created. The software itself provides the script to convert from JSON to Pascal VOC format. Fig. 2 shows an image and its corresponding mask.
Figure 2: An example of an input image and its corresponding mask
3.2 Network Training
The dataset contains 4753 images, and it was divided into 70% from training (3327 images), 15% (713 images) for validation and 15% for testing. An U-Net [27] with ResNet34 [30] as backbone was trained with a weight decay of 1and the IoU metric as performance measure during training. We choose to use an U-Net since it was created for biomedical image segmentation purposes, presenting good results on applications over the literature.
The data augmentation applied on images was a random flip with a probability of 0.5, a random rotation between -10 and 10 degrees with probability p_affine, random zoom between 1 and 1.1 with probability p_affine, and a random symmetric warp of magnitude between -0.2 and 0.2 was applied with probability p_affine. There was a 0.75 probability that each lighting transform was applied and 0.2 was the probability that each affine transform was applied. Finally, a normalized transform using imagenet_stats was also applied.
The training strategy was divided into three stages, and each stage was divided into two parts, as shown in Fig. 3. From one step to another, the image size was increased by a factor of 2. In other words, on the first stage, the network was trained with the
Figure 1: Overview of the proposed pipeline. The U-Net Model image was adapted from [27].
images having 1/4 of its original size. On the second stage, training continues with the image having 1/2 of its original size and lastly, the training is completed using the images on its original size. This strategy is based on the idea that, by increasing the size of the images, we have a whole new dataset. This size increase is similar to data augmentation, which helps to avoid overfitting [31].
Figure 3: Overview of the training process strategy
In the first part of each stage, the network was trained by five iterations using the fit_one_cycle method, which implements the 1cycle policy [32]. It consists of training the model while increasing the learning rate from a very low value to a very large one, stopping when the loss starts to diverge. So, the losses are plotted against the learning rates and the chosen value is the one that is placed a bit before the minimum loss, where it can still improve. The implementation provided by fast.ai is done with the same dataaugmentation as in the original paper, with a little tweak. The padded pixels in black are not colored. Instead, it is used a reflection padding [33].
The batch size in each stage was chosen by technical limitations, where we tried several values, beginning with a large value and decreasing it by a factor of 2(32, 16, 8...) until the data fits the Graphics Processing Unit (GPU) memory. The learning rate for each stage was found by the lr_find method, which tries values from 1
to 10 and stops when loss diverges. This method is provided by fast.ai and discussed in [34]. The learning rate value has to be chosen right before the best loss be achieved since it gives a chance to improve it during training [31].
In the first part of each stage, only the final layers of the network were trained. So, we used the method unfreeze to unfreeze the network and set every layer group as trainable. Again, the fit_one_cycle method was used to train the network for 10 iterations, with varying learning rates, using the value found by lr_find method on the previous step as reference. This process will be repeated three times, where in the last one the network will be trained with the images on its original size.
3.3 Post-Processing
The trained model was used to perform the prediction on the test dataset. The fast.ai provides a result in the form of a mask, where the pixel intensity denotes the classification probability of each pixel belongs to a particular class. An example is shown in Fig. 4. To improve the network’s outcome, we performed a simple post-processing step to remove noise and generate another image for visualization purposes.
We read the output mask (Fig. 4) from the network and convert it to grayscale (pixel values from 0 to 255). So, a Gaussian Blur with square kernel was applied and then it was performed a threshold operation. Next, we performed an erosion operation and after, an opening (erosion followed by dilation) operation, both with a square kernel. It results in an image similar to the one shown in Fig. 5. This mask is used in the network evaluation process (IoU computation).
Figure 4: The output of the network performed on the test set
Figure 5: Resulting mask of post-processing step
In order to better visualize the results, we use the Canny Operator [35] to get only the edges of the mask. These edges are overlapped on the original image, getting something similar to the image shown in Fig. 6.
Figure 6: Overlapping image created for visualization purposes
3.4 Evaluation
We evaluated the trained model on the test dataset using the IoU metric. The IoU metric is used to quantify the overlapping between the ground truth and the prediction output from the network. This metric is related to the Dice coefficient, a commonly used loss function during the network training. In other words, the IoU metric is computed by the number of common pixels between the target and prediction masks divided by the total number of pixels present across both masks [36], as shown in Equation 1.
In order to illustrate this, Fig. 7 shows an example of IoU between the ground truth and the prediction of the network. The green pixels show the intersection between ground truth and prediction, and the blue pixels show the union between them.
Figure 7: Overlapping between the intersection (green) and the union (blue) used to calculate the IoU
The network was trained on an Intel Core i7-7700 CPU @ 3.60GHz 8, with 32 GiB of RAM, a GeForce GTX 1080 Ti using Ubuntu 18.04.3 LTS 64-bit. It was used the fast.ai 1.0.57 framework for semantic segmentation and all scripts were coded in Python 3.
Initially, the network was trained with images having a size of 300 400 pixels (a quarter of its original size), batch size of 16 and learning rate of 1
. In the second stage of training, the images were resized to half of its original size (600
800 pixels), it was used a batch size of 4 and the learning rate found was 1
. The last stage consists in repeating the same process with the images in its original size (1200
1600 pixels). The learning rate found was 1
and a batch size of 1. The parameters for the post-processing step were defined experimentally, with an expert doing the visual inspection of the results. It was used a 5
5 kernel for Gaussian Blur application, a 3
3 ellipse kernel for erosion operation, a 15
15 elliptical-shaped kernel for the opening operation and a threshold value of 230.
The final training loss was 0.011802 and validation loss was 0.012477. The IoU was computed for each image and the average value was calculated using the entire dataset to obtain the overall IoU. In this work, we obtained an average IoU of 0.7841 for the 713 test images. The entire training process took about 885 minutes (14h45m). The testing step was performed in the GPU and took 5 minutes and 22 seconds, which gives about 0.452 seconds per image. The post-processing step took 1 minute and 14 seconds, 0.104 seconds per image, and was performed on CPU.
The semantic segmentation using CNN showed to be robust since it was capable of detecting and segmenting even some of the blurred cells. The neural network showed a good performance using only RGB images, without the application of any additional pre-processing. The fact that is possible to use an entire image as input to a CNN is one of the main advantages of this method.
One of the biggest drawbacks of the semantic segmentation process (and deep learning in general) is the need for a big number of samples to train a model. Even though a whole slide image can be used to generate a big number of samples, it is necessary a significant number of hours to label all these nuclei. However, it is important to take into account that the CNN works like a generalpurpose feature extractor. In other words, by adjusting the network weights using backpropagation, complex filters can be generated and used to detect complex patterns [37]. So, developing a handengineered feature extractor could be a difficult process, taking into account the nature and complexity of the problem. Classical approaches found on literature showed to be less effective when compared with deep learning models since it does an automatic feature extraction [38]. This characteristic fits well in this application.
Training a deep learning model usually takes a large amount of time, mostly if compared with traditional machine learning methods. However, this training step needs to be done once. Also, the testing step is fast and parallelizable, which can be performed in real-time [38]. Here, even that the training time was high, the testing time was small, being about 0.452 seconds per tile. Usually, a microscopy slide like the one used in this work takes about 40 minutes (it can vary from 40 minutes to 1h30m, depending on the number of cells) to be analyzed by a professional. Using a CNN, this process can be done in minutes. The post-processing step, used to improve the outcome of the network, took about 0.10 seconds per image. This time should decrease since this step can be parallelized on a GPU. The main drawback of this process is the need for defining some parameters, which have to be done by experiments. However, once defined, it is not necessary to be done again.
Nowadays, Deep Learning has been used to perform many tasks in image processing that was not possible some years ago. Arguably it has reached its maturity with potential and promising results, improving the state-of-the-art in object recognition, object detection, and semantic segmentation. So, we present a complete pipeline using CNN for semantic segmentation with the goal of segmenting nuclei in Feulgen stained images.
The obtained results validated the possibility of the usage of CNNs to segment individual nuclei, even in some partially blurred images. This approach can be used as the first step in a nuclei classification experiment aiming to find aneuploidies. Even this approach achieved good results, it is hard to compare with other works on literature because each paper uses different pre-processing techniques, metrics, models, parameters, and datasets. Furthermore, as far as we know, there is no similar publicly available dataset that we can apply our methodology and compare the results.
The training time was quite high, but can be considered reasonably good when compared with other applications of deep learning found on literature. The necessary time to export and load the trained model, and to process the images was small. It is also necessary to point out that, in our case, the application doesn’t need to be performed in real-time.
Data plays a central role in machine learning, and even more on deep learning. So, we believe that extending the size (number of samples) of the dataset to cover most cases should help in improving the outcome of the network since deep learning methods need larger datasets. One drawback of our work is the necessity of defining parameters on the post-processing step, which influences the results. However, with a small number of experiments, good results could be achieved by simply looking at the segmentation result.
Summarizing all this, the main contribution of this work is a pipeline for nuclei segmentation using semantic segmentation techniques which work as a starting point (not only Feulgen stained images) aiming to find aneuploidy. We also present a simple, yet effective, post-processing algorithm that improves the results, removing some small imperfections on the segmentation process. Our results also show that the training strategy proposed by [31] works on this type of application, where the image resizing works as a data augmentation method.
Future works include segment the nuclei and automatically classify it in normal or not normal. Also, testing other architectures of CNNs, applying image processing methods in the pre-processing step and trying other post-processing ones. Some classical algorithms of segmentation can be compared with the results obtained by semantic segmentation with CNNs. Other types of cell images (with other stains, for example) can be used to check the performance of the network. Further steps will also include the classification of segmented nuclei in normal or not-normal, aiming to early detect possible cases of cancer.
We would like to thank Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES) for funding this work. Also, we would like to thank Dr. Adriane Pogere, for providing the samples and Ms. Ane Francyne Costa for all necessary assistance for data collection.
[1] Mark Schiffman, Philip E Castle, Jose Jeronimo, Ana C Rodriguez, and Sholom Wacholder. Human papillomavirus and cervical cancer. The Lancet, 370(9590): 890–907, 2007.
[2] Youyi Song, Ee-Leng Tan, Xudong Jiang, Jie-Zhi Cheng, Dong Ni, Siping Chen, Baiying Lei, and Tianfu Wang. Accurate cervical cell segmentation from overlapping clumps in pap smear images. IEEE transactions on medical imaging, 36(1): 288–300, 2016.
[3] SP Dobbs, T Asmussen, D Nunns, J Hollingworth, LJR Brown, and D Ireland. Does histological incomplete excision of cervical intraepithelial neoplasia following large loop excision of transformation zone increase recurrence rates? a six year cytological follow up. BJOG: An International Journal of Obstetrics & Gynaecology, 107(10):1298–1301, 2000.
[4] M Arbyn, E Paraskevaidis, P Martin-Hirsch, W Prendiville, and Joakim Dillner. Clinical utility of hpv–dna detection: triage of minor cervical lesions, follow-up of women treated for high-grade cin: an update of pooled evidence. Gynecologic oncology, 99(3):S7–S11, 2005.
[5] Shannon E Weigum, Pierre N Floriano, Spencer W Redding, Chih-Ko Yeh, Stephen D Westbrook, H Stan McGuff, Alan Lin, Frank R Miller, Fred Villarreal, Stephanie D Rowan, et al. Nano-bio-chip sensor platform for examination of oral exfoliative cytology. Cancer Prevention Research, 3(4):518–528, 2010.
[6] TY Seiwert and EEW Cohen. State-of-the-art management of locally advanced head and neck cancer. British journal of cancer, 92(8):1341, 2005.
[7] Håvard E Danielsen, Manohar Pradhan, and Marco Novelli. Revisiting tumour aneuploidy - the place of ploidy assessment in the molecular era. Nature Reviews Clinical Oncology, 13(5):291, 2016.
[8] Saunak Chatterjee, Debaleena Nawn, Mousumi Mandal, Jyotirmoy Chatterjee, Somdutta Mitra, Mousumi Pal, and Ranjan Rashmi Paul. Augmentation of statistical features in cytopathology towards computer aided diagnosis of oral precancerlcancer. In 2018 Fourth International Conference on Biosignals, Images and Instrumentation (ICBSII), pages 206–212. IEEE, 2018.
[9] Huamin Li, Uri Shaham, Kelly P Stanton, Yi Yao, Ruth R Montgomery, and Yuval Kluger. Gating mass cytometry data by deep learning. Bioinformatics, 33(21): 3423–3430, 2017.
[10] Ravi Mehrotra, Sanjay Mishra, Mamta Singh, and Mangal Singh. The efficacy of oral brush biopsy with computer-assisted analysis in identifying precancerous and cancerous lesions. Head & neck oncology, 3(1):39, 2011.
[11] Dongsuk Shin, Nadarajah Vigneswaran, Ann Gillenwater, and Rebecca RichardsKortum. Advances in fluorescence imaging techniques to detect oral cancer and its precursors. Future oncology, 6(7):1143–1154, 2010.
[12] Melih Kandemir and Fred A Hamprecht. Computer-aided diagnosis from weak supervision: A benchmarking study. Computerized medical imaging and graphics, 42:44–50, 2015.
[13] Yann LeCun, Yoshua Bengio, and Geoffrey Hinton. Deep learning. nature, 521 (7553):436, 2015.
[14] Ling Zhang, Le Lu, Isabella Nogues, Ronald M Summers, Shaoxiong Liu, and Jianhua Yao. Deeppap: deep convolutional networks for cervical cell classification. IEEE journal of biomedical and health informatics, 21(6):1633–1643, 2017.
[15] Ling Zhang, Milan Sonka, Le Lu, Ronald M Summers, and Jianhua Yao. Combining fully convolutional networks and graph-based approach for automated segmentation of cervical cell nuclei. In 2017 IEEE 14th international symposium on biomedical imaging (ISBI 2017), pages 406–409. IEEE, 2017.
[16] Youyi Song, Ling Zhang, Siping Chen, Dong Ni, Baopu Li, Yongjing Zhou, Baiying Lei, and Tianfu Wang. A deep learning based framework for accurate segmentation of cervical cytoplasm and nuclei. In 2014 36th Annual International Conference of the IEEE Engineering in Medicine and Biology Society, pages 2903–2906. IEEE, 2014.
[17] Youyi Song, Jie-Zhi Cheng, Dong Ni, Siping Chen, Baiying Lei, and Tianfu Wang. Segmenting overlapping cervical cell in pap smear images. In 2016 IEEE 13th International Symposium on Biomedical Imaging (ISBI), pages 1159–1162. IEEE, 2016.
[18] Yuxin Cui, Guiying Zhang, Zhonghao Liu, Zheng Xiong, and Jianjun Hu. A deep learning algorithm for one-step contour aware nuclei segmentation of histopathological images. arXiv preprint arXiv:1803.02786, 2018.
[19] Saad Ullah Akram, Juho Kannala, Lauri Eklund, and Janne Heikkilä. Cell segmentation proposal network for microscopy image analysis. In Deep Learning and Data Labeling for Medical Applications, pages 21–29. Springer, 2016.
[20] Hao Chen, Xiaojuan Qi, Lequan Yu, Qi Dou, Jing Qin, and Pheng-Ann Heng. Dcan: Deep contour-aware networks for object instance segmentation from histology images. Medical image analysis, 36:135–146, 2017.
[21] Yuanpu Xie, Fuyong Xing, Xiangfei Kong, Hai Su, and Lin Yang. Beyond classification: structured regression for robust cell detection using convolutional neural network. In International Conference on Medical Image Computing and Computer-Assisted Intervention, pages 358–365. Springer, 2015.
[22] Xipeng Pan, Lingqiao Li, Huihua Yang, Zhenbing Liu, Jinxin Yang, Lingling Zhao, and Yongxian Fan. Accurate segmentation of nuclei in pathological images via sparse reconstruction and deep convolutional networks. Neurocomputing, 229: 88–99, 2017.
[23] Fuyong Xing, Yuanpu Xie, and Lin Yang. An automatic learning-based framework for robust nucleus segmentation. IEEE transactions on medical imaging, 35(2): 550–566, 2015.
[24] Pranav Kumar, SL Happy, Swarnadip Chatterjee, Debdoot Sheet, and Aurobinda Routray. An unsupervised approach for overlapping cervical cell cytoplasm segmentation. In 2016 IEEE EMBS Conference on Biomedical Engineering and Sciences (IECBES), pages 106–109. IEEE, 2016.
[25] Hansang Lee and Junmo Kim. Segmentation of overlapping cervical cells in microscopic images with superpixel partitioning and cell-wise contour refinement. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pages 63–69, 2016.
[26] Srikanth Ragothaman, Sridharakumar Narasimhan, Madivala G Basavaraj, and Rajan Dewar. Unsupervised segmentation of cervical cell images using gaussian mixture model. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition Workshops, pages 70–75, 2016.
[27] Olaf Ronneberger, Philipp Fischer, and Thomas Brox. U-net: Convolutional networks for biomedical image segmentation. In International Conference on
Medical image computing and computer-assisted intervention, pages 234–241. Springer, 2015.
[28] Pasquale Chieco and Massimo Derenzini. The feulgen reaction 75 years on. Histochemistry and cell biology, 111(5):345–358, 1999.
[29] Ketaro Wada. labelme: Image Polygonal Annotation with Python. https://github. com/wkentaro/labelme, 2016.
[30] Kaiming He, Xiangyu Zhang, Shaoqing Ren, and Jian Sun. Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 770–778, 2016.
[31] Jeremy Howard. Lesson 3: Deep learning 2019 - data blocks; multi-label classification; segmentation, jan 2019. URL https://www.youtube.com/watch?v= MpZxV6DVsmM.
[32] Leslie N. Smith. A disciplined approach to neural network hyper-parameters: Part 1 - learning rate, batch size, momentum, and weight decay. CoRR, abs/1803.09820, 2018. URL http://arxiv.org/abs/1803.09820.
[33] Sylvain Gugger. The 1cycle policy, apr 2018. URL https://sgugger.github.io/the- 1cycle-policy.html.
[34] Leslie N. Smith. No more pesky learning rate guessing games. CoRR, abs/1506.01186, 2015. URL http://arxiv.org/abs/1506.01186.
[35] John Canny. A computational approach to edge detection. IEEE Transactions on pattern analysis and machine intelligence, (6):679–698, 1986.
[36] Jeremy Jordan. Evaluating image segmentation models, may 2018. URL https: //www.jeremyjordan.me/evaluating-image-segmentation-models/.
[37] Alex Krizhevsky, Ilya Sutskever, and Geoffrey E Hinton. Imagenet classification with deep convolutional neural networks. In Advances in neural information processing systems, pages 1097–1105, 2012.
[38] Andreas Kamilaris and Francesc X Prenafeta-Boldú. Deep learning in agriculture: A survey. Computers and Electronics in Agriculture, 147:70–90, 2018.