快速开启你的第一个项目：TensorFlow项目架构模板

机器之心

发布于 2018-05-10 11:02:59

1K0

发布于 2018-05-10 11:02:59

文章被收录于专栏：机器之心

选自GitHub

作者：Mahmoud Gemy

机器之心编译

参与：黄小天、李泽南

作为最为流行的深度学习资源库，TensorFlow 是帮助深度学习新方法走向实现的强大工具。它为大多数深度学习领域中使用的常用语言提供了大量应用程序接口。对于开发者和研究人员来说，在开启新的项目前首先面临的问题是：如何构建一个简单明了的结构，本文或许可以为你带来帮助。

项目链接：https://github.com/Mrgemy95/Tensorflow-Project-Template

TensorFlow 项目模板

简洁而精密的结构对于深度学习项目来说是必不可少的，在经过多次练习和 TensorFlow 项目开发之后，本文作者提出了一个结合简便性、优化文件结构和良好 OOP 设计的 TensorFlow 项目模板。该模板可以帮助你快速启动自己的 TensorFlow 项目，直接从实现自己的核心思想开始。

这个简单的模板可以帮助你直接从构建模型、训练等任务开始工作。

概述
详述
项目架构
文件夹结构
主要组件
模型
训练器
数据加载器
记录器
配置
Main
未来工作

概述

简言之，本文介绍的是这一模板的使用方法，例如，如果你希望实现 VGG 模型，那么你应该：

在模型文件夹中创建一个名为 VGG 的类，由它继承「base_model」类

class VGGModel(BaseModel):
        def __init__(self, config):
            super(VGGModel, self).__init__(config)
            #call the build_model and init_saver functions.
            self.build_model() 
            self.init_saver()

覆写这两个函数 "build_model"，在其中执行你的 VGG 模型；以及定义 TensorFlow 保存的「init_saver」，随后在 initalizer 中调用它们。

def build_model(self):
        # here you build the tensorflow graph of any model you want and also define the loss.
        pass

     def init_saver(self):
        #here you initalize the tensorflow saver that will be used in saving the checkpoints.
        self.saver = tf.train.Saver(max_to_keep=self.config.max_to_keep)

在 trainers 文件夹中创建 VGG 训练器，继承「base_train」类。

class VGGTrainer(BaseTrain):
        def __init__(self, sess, model, data, config, logger):
            super(VGGTrainer, self).__init__(sess, model, data, config, logger)

覆写这两个函数「train_step」、「train_epoch」，在其中写入训练过程的逻辑。

def train_epoch(self):
        """
       implement the logic of epoch:
       -loop ever the number of iteration in the config and call teh train step
       -add any summaries you want using the sammary
        """
        pass

    def train_step(self):
        """
       implement the logic of the train step
       - run the tensorflow session
       - return any metrics you need to summarize
       """
        pass

在主文件中创建会话，创建以下对象：「Model」、「Logger」、「Data_Generator」、「Trainer」与配置：

sess = tf.Session()
    # create instance of the model you want
    model = VGGModel(config)
    # create your data generator
    data = DataGenerator(config)
    # create tensorboard logger
    logger = Logger(sess, config)

向所有这些对象传递训练器对象，通过调用「trainer.train()」开始训练。

trainer = VGGTrainer(sess, model, data, config, logger)

    # here you train your model
    trainer.train()

你会看到模板文件、一个示例模型和训练文件夹，向你展示如何快速开始你的第一个模型。

详述

模型架构

文件夹结构

├──  base
│   ├── base_model.py   - this file contains the abstract class of the model.
│   └── ease_train.py - this file contains the abstract class of the trainer.
│
│
├── model               -This folder contains any model of your project.
│   └── example_model.py
│
│
├── trainer             -this folder contains trainers of your project.
│   └── example_trainer.py
│   
├──  mains              - here's the main/s of your project (you may need more than one main.
│                         
│  
├──  data _loader  
│    └── data_generator.py  - here's the data_generator that responsible for all data handling.
│ 
└── utils
     ├── logger.py
     └── any_other_utils_you_need

主要组件

模型