Hi! I had a question regarding order of the operations in your implementation of the maryland scheme. Why do you first add delta and then do temperature? Doesnt it contradict 4.2 in the original paper about effect of delta on the perplexity (see first two paragraphs)
Hi! I had a question regarding order of the operations in your implementation of the maryland scheme. Why do you first add delta and then do temperature? Doesnt it contradict 4.2 in the original paper about effect of delta on the perplexity (see first two paragraphs)