-
Notifications
You must be signed in to change notification settings - Fork 70
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Doubt on reset session_level init Hs[0] #11
Comments
Hi @gescobedo , first thanks for carefully checking the code. |
Hi @gescobedo , I just pushed a fix for the initialization issue.
As you may notice there's a small drop in peformance, albeit not stat. sig.
I'll let you know when I have more data on this issue. |
Hi @mquad thanks for checking this issue so quickly, it is good to know that it does not have a big drop in performance. I will keep an eye on this. |
In the initialization of session_level you use a mask for resetting the hidden state of the cell for some items of the batch and generate
h_s = Hs[0] * (1 - Sstart[:, None]) + h_s_init * Sstart[:, None]
but then you use the same old stateHs[0]
to calculate the final output state of the current cell. Why isHs[0]
used instead ofh_s
inh_s = (1.0 - z) * Hs[0] + z * h_s_candidate.T
?. Thanks in advance.The text was updated successfully, but these errors were encountered: