使用tensorflow实现VGG网络,训练mnist数据集方式

更新时间：2020年05月26日 11:37:40 作者：masterjames

这篇文章主要介绍了使用tensorflow实现VGG网络,训练mnist数据集方式，具有很好的参考价值，希望对大家有所帮助。一起跟随小编过来看看吧

VGG作为流行的几个模型之一,训练图形数据效果不错，在mnist数据集是常用的入门集数据，VGG层数非常多，如果严格按照规范来实现，并用来训练mnist数据集，会出现各种问题，如，经过16层卷积后，28*28*1的图片几乎无法进行。

先介绍下VGG

ILSVRC 2014的第二名是Karen Simonyan和 Andrew Zisserman实现的卷积神经网络，现在称其为VGGNet。它主要的贡献是展示出网络的深度是算法优良性能的关键部分。

他们最好的网络包含了16个卷积/全连接层。网络的结构非常一致，从头到尾全部使用的是3x3的卷积和2x2的汇聚。他们的预训练模型是可以在网络上获得并在Caffe中使用的。

VGGNet不好的一点是它耗费更多计算资源，并且使用了更多的参数，导致更多的内存占用（140M）。其中绝大多数的参数都是来自于第一个全连接层。

模型结构：

本文在实现时候，尽量保存VGG原来模型结构，核心代码如下：

weights ={
  'wc1':tf.Variable(tf.random_normal([3,3,1,64])),
  'wc2':tf.Variable(tf.random_normal([3,3,64,64])),
  'wc3':tf.Variable(tf.random_normal([3,3,64,128])),
  'wc4':tf.Variable(tf.random_normal([3,3,128,128])),
  
  'wc5':tf.Variable(tf.random_normal([3,3,128,256])),
  'wc6':tf.Variable(tf.random_normal([3,3,256,256])),
  'wc7':tf.Variable(tf.random_normal([3,3,256,256])),
  'wc8':tf.Variable(tf.random_normal([3,3,256,256])),
  
  'wc9':tf.Variable(tf.random_normal([3,3,256,512])),
  'wc10':tf.Variable(tf.random_normal([3,3,512,512])),
  'wc11':tf.Variable(tf.random_normal([3,3,512,512])),
  'wc12':tf.Variable(tf.random_normal([3,3,512,512])),
  'wc13':tf.Variable(tf.random_normal([3,3,512,512])),
  'wc14':tf.Variable(tf.random_normal([3,3,512,512])),
  'wc15':tf.Variable(tf.random_normal([3,3,512,512])),
  'wc16':tf.Variable(tf.random_normal([3,3,512,256])),
  
  'wd1':tf.Variable(tf.random_normal([4096,4096])),
  'wd2':tf.Variable(tf.random_normal([4096,4096])),
  'out':tf.Variable(tf.random_normal([4096,nn_classes])),
}
 
biases ={
  'bc1':tf.Variable(tf.zeros([64])),
  'bc2':tf.Variable(tf.zeros([64])),
  'bc3':tf.Variable(tf.zeros([128])),
  'bc4':tf.Variable(tf.zeros([128])),
  'bc5':tf.Variable(tf.zeros([256])),
  'bc6':tf.Variable(tf.zeros([256])),
  'bc7':tf.Variable(tf.zeros([256])),
  'bc8':tf.Variable(tf.zeros([256])),
  'bc9':tf.Variable(tf.zeros([512])),
  'bc10':tf.Variable(tf.zeros([512])),
  'bc11':tf.Variable(tf.zeros([512])),
  'bc12':tf.Variable(tf.zeros([512])),
  'bc13':tf.Variable(tf.zeros([512])),
  'bc14':tf.Variable(tf.zeros([512])),
  'bc15':tf.Variable(tf.zeros([512])),
  'bc16':tf.Variable(tf.zeros([256])),
  
  
  'bd1':tf.Variable(tf.zeros([4096])),
  'bd2':tf.Variable(tf.zeros([4096])),
  'out':tf.Variable(tf.zeros([nn_classes])),
}

卷积实现：

def convLevel(i,input,type):
  num = i
  out = conv2D('conv'+str(num),input,weights['wc'+str(num)],biases['bc'+str(num)])
  if type=='p':
    out = maxPool2D('pool'+str(num),out, k=2) 
    out = norm('norm'+str(num),out, lsize=4)
  return out 
 
def VGG(x,weights,biases,dropout):
  x = tf.reshape(x,shape=[-1,28,28,1])
 
  input = x
 
  for i in range(16):
    i += 1
    if(i==2) or (i==4) or (i==12) : # 根据模型定义还需要更多的POOL化，但mnist图片大小不允许。
      input = convLevel(i,input,'p')
    else:
      input = convLevel(i,input,'c')

训练：

pred = VGG(x, weights, biases, keep_prob)
cost = tf.reduce_mean(tf.nn.softmax_cross_entropy_with_logits(logits=pred,labels=y))
optimizer = tf.train.AdamOptimizer(learning_rate=learning_rate).minimize(cost)
 
correct_pred = tf.equal(tf.argmax(pred,1), tf.argmax(y,1))
accuracy_ = tf.reduce_mean(tf.cast(correct_pred,tf.float32))
 
init = tf.global_variables_initializer()
with tf.Session() as sess:
  sess.run(init)
  step = 1
  while step*batch_size < train_iters:
    batch_x,batch_y = mnist.train.next_batch(batch_size)
    sess.run(optimizer,feed_dict={x:batch_x,y:batch_y,keep_prob:dropout})
    print(step*batch_size)
    if step % display_step == 0 :
      #loss,acc = sess.run([cost,accuracy],feed_dict={x:batch_x,y:batch_y,keep_prob=1.0})
      acc = sess.run(accuracy_, feed_dict={x: batch_x, y: batch_y, keep_prob: 1.})
      # 计算损失值
      
      loss = sess.run(cost, feed_dict={x: batch_x, y: batch_y, keep_prob: 1.})
      print("iter: "+str(step*batch_size)+"mini batch Loss="+"{:.6f}".format(loss)+",acc="+"{:6f}".format(acc))
 
    step += 1 
   
  print("training end!")

最终效果：

训练10000次后：结果如下：

iter: 12288 mini batch Loss=5088409.500000,acc=0.578125

iter: 12800 mini batch Loss=4514274.000000,acc=0.601562

iter: 13312 mini batch Loss=4483454.500000,acc=0.648438

这种深度的模型可以考虑循环10万次以上。目前效果还不错，本人没有GPU，心痛笔记本的CPU，100%的CPU利用率，听到风扇响就不忍心再训练，本文也借鉴了alex网络实现，当然我也实现了这个网络模型。在MNIST数据上，ALEX由于层数较少，收敛更快，当然MNIST，用CNN足够了。

以上这篇使用tensorflow实现VGG网络,训练mnist数据集方式就是小编分享给大家的全部内容了，希望能给大家一个参考，也希望大家多多支持脚本之家。

您可能感兴趣的文章:

Python实现的读取电脑硬件信息功能示例
这篇文章主要介绍了Python实现的读取电脑硬件信息功能,结合实例形式分析了Python基于wmi库读取电脑CPU、磁盘、网络、进程等硬件信息相关操作技巧,需要的朋友可以参考下
2018-05-05
在Python运行时动态查看进程内部信息的方法
今天小编就为大家分享一篇在Python运行时动态查看进程内部信息的方法，具有很好的参考价值，希望对大家有所帮助。一起跟随小编过来看看吧
2019-02-02
Python 恐龙跑跑小游戏实现流程
大家好，本篇文章主要讲的是用python实现谷歌小恐龙小游戏，看看这是你断网时的样子么，感兴趣的同学赶快来看一看吧，对你有帮助的话记得收藏一下
2022-02-02
Python中range()与np.arange()的具体使用
本文主要介绍了Python中range()与np.arange()的具体使用，文中通过示例代码介绍的非常详细，对大家的学习或者工作具有一定的参考学习价值，需要的朋友们下面随着小编来一起学习学习吧
2022-06-06
MATLAB中print函数使用示例详解
print函数的功能是打印图窗或保存为特定文件格式，这篇文章主要介绍了MATLAB中print函数使用,需要的朋友可以参考下
2023-03-03
Pycharm新手使用教程(图文详解)
这篇文章主要介绍了Pycharm新手使用教程(图文详解)，文中通过示例代码介绍的非常详细，对大家的学习或者工作具有一定的参考学习价值，需要的朋友们下面随着小编来一起学习学习吧
2020-09-09
python中的列表和元组区别分析
这篇文章主要介绍了python中的列表和元组区别分析,需要的朋友可以参考下
2020-12-12
python画图的函数用法以及技巧
在本文里小编给大家整理的是关于怎么用python画图以及python画图中turtle函数的用法技巧，需要的朋友们参考下。
2019-06-06
教你怎么用Python实现GIF动图的提取及合成
今天教大家一个Python有趣好玩的小功能：将多张图片转为GIF,同时也可以将一个GIF动图提取出里面的图片,文中有非常详细的介绍及代码示例,需要的朋友可以参考下
2021-06-06
flask重启后端口被占用的问题解决(非kill)
本文主要介绍了flask重启后端口被占用的问题解决(非kill)，文中通过示例代码介绍的非常详细，对大家的学习或者工作具有一定的参考学习价值，需要的朋友们下面随着小编来一起学习学习吧
2023-04-04

使用tensorflow实现VGG网络,训练mnist数据集方式

相关文章

最新评论

大家感兴趣的内容

最近更新的内容

常用在线小工具