Tagesbuch

Start from GAN
If feed all data in, after enough iterations, all output of generator would be 1 (using MNIST dataset), which is the simplest digit. --> "the generator fools discriminator with garbage"
training GAN for each classes individually --> 1. GAN structure is suitable for some classes, but when training some classes it leads to collapse mode; 2. not easy to select models for each classes

then to conditional GAN
similar structure, but with one-hot label concatenated to input of G and D
Advantage: no need to train model individually
Note: learning rate set to 0.001, 0.0001 would lead to bad result

then ACGAN
Current test shows ACGAN works not well while using two dense layers, reason might be that ACGAN only works when using convolutional D and G
todo: pretrain D

then Wasserstein GAN

January

read Advances in Spectral–Spatial Classification of Hyperspectral Images (to IV B.)
Morphological reconstruction

January
refine the proposal

10-12. January

implement a DC classifier for preparation to implement the discriminator
read Improved GAN, focus on this paper in following days

January

DC classifier has no bugs, but performs awfully
install theano and lasagne to run the improvedGAN code

- 19. January

finally install theano and its GPU backend correctly and fix a lot of deprecated issues

January

try to translate it to keras, find way to implement the loss function

January

translation to keras is way complicated, first try paviaU in the original theano code
1D improved GAN is too bad for training paviaU (maybe the reason of the training data, check the training and testing data and resave them)

January

prepare for the questions for tomorrow's meeting:
the loss function in the code does not match the loss in the paper, and the former has a very strange type
the l_lab and the train_err is the same thing
no implementation of K+1 class

February

as to the 3d convolution, an idea: set stride=(1,1,2), which only manipulate the spectral dimension
try semi-supervised gan, discriminator classifies labeled sample, and generated sample as k+1, use unlabeled training data, set label as [0.1, 0.1, 0.1, ..., 0], on mnist dataset

Feb. /- 9. Feb.

1D tryout, seems good, need more tests

March
ready to test:

(replace conv3d to conv2d)
different training data size (count)
different patch size
different channel number
(different batch size)
(different deepwise conv channel)

March
find a case: the results that randomly choose 200 samples from the whole image as training set is much better than using randomly choose 200 samples from training set
April

email to cluster team
try cross validation
ask Amir how to determine the final result
read the "discr_loss" blog, and try their code
read gan paper

April

adam vs sgd
the validation curve of using adam is up and down --> not suitable for normal early stopping algorithm
try to fix: use smaller learning rate
alternative for progress (early stopping)
not calculate the ratio of average training loss and min training loss in a training strip, but average training loss and past average training loss
learning rate decay strategy
optimizer for G and optimizer for D
use cross entropy loss of only first 9 labels to determine when to early stop
double check the Dataloader in demoGAN (zhu et al)(pytorch)

April

test feature match, start from one layer model (ssgan_improved_pytorch)
try to implement custom loss function like keras

Tagesbuch

推荐阅读更多精彩内容