NaN loss and only OOV in the greedy output

The loss initially was decreasing until it reach nan's for a while. I am running it on the squad dataset and the exact argument used for running it is: 

python train.py --train_tasks squad --device 0 --data ./.data --save ./results/ --embeddings ./.embeddings/ --train_batch_tokens 2000

So the only change is the train batch tokens to 2000 since my GPU was running out of memory. I am attaching a screenshot. Is there anything I am missing? Should I try something else?

<img width="1668" alt="screenshot 2018-11-02 14 35 47" src="https://user-images.githubusercontent.com/680145/47942799-7b5b6e00-deb0-11e8-98ca-875fe5a60c15.png">


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

NaN loss and only OOV in the greedy output #42

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

NaN loss and only OOV in the greedy output #42

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions