在线时间:8:00-16:00
迪恩网络APP
随时随地掌握行业动态
扫描二维码
关注迪恩网络微信公众号
开源软件名称(OpenSource Name):jimmy-ren/vLSTM开源软件地址(OpenSource Url):https://github.com/jimmy-ren/vLSTM开源编程语言(OpenSource Language):MATLAB 91.9%开源软件介绍(OpenSource Introduction):vLSTMVectorized Long Short-term Memory (LSTM) using Matlab and GPU It supports both the regular LSTM described here and the multimodal LSTM described here. If you are interested, visit here for details of the experiments described in the multimodal LSTM paper. Hardware/software requirementsTo run the code, you have to have a NVidia GPU with at least 4GB GPU memory. The code was tested in Ubuntu 14.04 and Windows 7 using MATLAB 2014b. Character level language generationThe task is the same as that in the char-rnn project, which is a good indicator to show if the LSTM implementation is effective. Generation using a pre-trained modelOpen the
Data generation and trainingPaul Graham's essay is used in this sample. All text is stored in Once the data is ready, you may run Multimodal LSTM for speaker namingThe training procedure of the Multimodal speaker naming LSTM as well as the pre-processed data (the one you can use off-the-shelf) has been releaseed. Please follow the instruction below to perform the training. Download dataPlease go here or here to download all the pre-processed training data and put all the files under In addition, please go here or here to download the pre-processed multimodal validation data and put all the files under Start trainingOnce all the data is in place, you may start to train 3 types of models, namly the model only classifies the face features, the model only classifies the audio features and the model simultaneously classifies the face+audio multimodal features (multimodal LSTM). To train the face only model, you may run this script. Meanwhile, you can also run tests for the aforementioned three models by using the pre-train models. CitationsJimmy SJ. Ren, Yongtao Hu, Yu-Wing Tai, Chuan Wang, Li Xu, Wenxiu Sun, Qiong Yan,
"Look, Listen and Learn - A Multimodal LSTM for Speaker Identification", The 30th AAAI Conference on Artificial Intelligence (AAAI-16). |
2023-10-27
2022-08-15
2022-08-17
2022-09-23
2022-08-13
请发表评论