Home

Awesome

Deep Learning for Biologists with Keras

By whom?

Yosuke Toda (tyosuke-at-aquaseerser.com)

JST PRESTO researcher at Nagoya Univeristy / Agri-Heir Co., Ltd.

What is it?

Note

To do

Notebooks Open

Keras Code Collection: How to use Keras

<img src = "assets/image-20190117153946252.png" width="150" ALIGN="left" /> image <br>

badge

Basic code collection of Keras. Loading images, build, train, evaluate and using the CNN. As Keras is quite flexible and have multiple ways of writing code even for a simplest CNN, I have written down all the patterns in one Colab notebook. Images of rice seeds were provided from Dr. S. Nishiuchi at Nagoya Univ. in 2016 (personal communication). However, the data itself is not going to be discussed. Preferred to go through the notebook of Rice Seed Integrity below.

badge Finalize with feedbacks

<br>

Rice Seed Integrity: What is deep learning.

<img src = "assets/image-20190115201428173.png" width="150" ALIGN="left" /> image <br>

badgebadge badge badge

An introductory notebook to deep learning based image analysis as well as comparing it with classical machine learning algorithms and manual image classification. The object of this notebook is to give the readers an implementation of; What does "Representative Learning" actually mean? What is Feature Selection? Images of rice seeds were provided from Dr. S. Nishiuchi at Nagoya Univ. in 2016 (personal communication).

badge Refurnish Codes and Comments

<br>

17 Flowers Dataset

<img src = "assets/image-20190115201017711.png" width="150" ALIGN="left" /> image <br>

badge badge badge badge

Will build a convolutional neural network (CNN) based classification model using a 17 category flower dataset provided by the team at University of Oxford (http://www.robots.ox.ac.uk/~vgg/data/flowers/17/). The dataset provides of 80 images per category. We will compare the training process starting from scratch (de novo), transfer-learning and fine-tuning which the later two are pretrained with ImageNet Dataset. We will see that upon training with not so much data (for CNN), pretraining has a great effect upon speed and (ocasionally) accuracy of the model.

badge Refurnish Codes and Comments

<br>

Crop/Weed Segmentation

<img src = "assets/image-20190115201227438.png" width="150" ALIGN="left" /> image <br>

badge badge

In this notebook, we will perform a segmentation of crop and weed region from images taken by an autonomous field robot, which the dataset are from Haug et al., "A Crop/Weed Field Image Dataset for the Evaluation of Computer Vision Based Precision Agriculture Tasks" (2015). With conventional approaches, we possibly can isolate the weed and crop resions from the soil area using a color threshold in the green domain. However, how can we further classify the weed (red) from the crop (green) region? Such feature selection is a master of a master craftsmanship. Instead, we will use DL, in specific, semantic segmentation methods to 1) Isolate the grass regions from the soil, 2) Isolating and classifying weeds and crops regions. A neural network architecture named U-Net will be used here.

badge Need to add commentary throughout the notebook.

<br>

Bamboo forest detection from Google Earth Images

<img src = "assets/image-20190120202734357.png" width="150" ALIGN="left" /> image <br>

Original paper from Watanabe et al., "Using deep learning for bamboo forest detection from Google Earth images" doi: https://doi.org/10.1101/351643.

The data was provided by Dr. Ise (personal communcation). However the image originates from Google Earth, so please confirm the copyright if you intend to use this in non research purpose.

badge badge

badge Need to add commentary throughout the notebook and above.

<br>

Yeast GFP Protein Localization

<img src = "assets/image-20190115201711326.png" height="150px" ALIGN="left" /> image <br>

badge badge badge

Yet another classification task. Dataset from the paper Accurate Classification of Protein Subcellular Localization from High-Throughput Microscopy Images Using Deep Learning Tanel Pärnamaa and Leopold Parts G3: GENES, GENOMES, GENETICS May 1, 2017 vol. 7 no. 5 1385-1392. This dataset has a csv file with file name information associated with class label instead of allocating the file to folders corresponding to its class. Moreover, dataset comprises of 90,000 images, possibly too large to load everything in label. Here we use the ImageDataGenerator class and its flow_from_dataframe function to feed batches of image upon training. Note: flow_from_dataframe in keras repository has a bug. we instead install keras_preprocessing and use the uptodate ImageDataGenerator (19, Jan. 2019 ).

badge Need to add commentary throughout the notebook.

<br>

Others

Simulated ChIP-seq motif extraction

<img src = "assets/image-20190115202731524.png" height="150px" ALIGN="left" /> image <br>

badge badge badge badge

badgeNeed to add commentary throughout the notebook.

<br> <br> <br>

Updates

Notebooks To be Opened

<img src = "assets/image-20190115144920126.png" height="120px" ALIGN="left" /> -->