lstm 训练 QA 问答系统的问题?

推荐学习书目

› Learn Python the Hard Way

Python Sites

› PyPI - Python Package Index

› http://diveintopython.org/toc/index.html

› Pocoo

值得关注的项目

› PyPy

› Celery

› Jinja2

› Read the Docs

› gevent

› pyenv

› virtualenv

› Stackless Python

› Beautiful Soup

› 结巴中文分词

› Green Unicorn

› Sentry

› Shovel

› Pyflakes

› pytest

Python 编程

› pep8 Checker

Styles

› PEP 8

› Google Python Style Guide

› Code Style from The Hitchhiker's Guide

This topic created in 3076 days ago, the information mentioned may be changed or developed.

我想用 word2vex 训练词向量用于后续 lstm 模型的训练，那么训练词向量用的语料可以和训练 lstm 用的一样吗？

lstm

训练

语料

word2vex

6 replies • 2018-02-26 22:26:22 +08:00

menc

Feb 26, 2018

可以。

supervipcard

Feb 26, 2018

@menc 那请问用于训练词向量的语料在语料文件大小，每篇文章的长度等方面有什么需要注意的吗

afpro

Feb 26, 2018

直接加一个 embedding_lookup 就好了不 word2vec 也可以

menc

Feb 26, 2018

@supervipcard 越大越好。可以像楼下说的，用 embedding 层来做，数据量大的时候差别不大。

neosfung

Feb 26, 2018

embedding_lookup 的实现原理和 word2vec 貌似不一样吧？

supervipcard

Feb 26, 2018

@menc @afpro 用 embedding 层的话是先将训练集的句子中单词转换成一个个 id，相当于 ont-hot 编码，并且初始化一个词向量矩阵，再输入 embedding 层的吧。