Skip to content
This repository has been archived by the owner on Jan 1, 2021. It is now read-only.

Loss not improving after 2.2 #149

Open
atinesh-s opened this issue Oct 9, 2019 · 0 comments
Open

Loss not improving after 2.2 #149

atinesh-s opened this issue Oct 9, 2019 · 0 comments

Comments

@atinesh-s
Copy link

I have trained the model for quite a long (till Iteration 1000000), But loss seems to get stuck around 2.2 and bot response is also not satisfactory

HUMAN ++++ hello
BOT ++++ ?
HUMAN ++++ who are you
BOT ++++ . . .
HUMAN ++++ whats your name
BOT ++++ .
HUMAN ++++ where are you from
BOT ++++ ' s gone .
HUMAN ++++ what is the time there
BOT ++++ ' s not going to be a bad thing .
HUMAN ++++ I do not understand anything
BOT ++++ !
2019-10-09 14:01:09,550 -- INFO -- Iter 981000: loss 2.2273828502893447, time 0.08978772163391113
2019-10-09 14:03:29,982 -- INFO -- Iter 982000: loss 2.251806115269661, time 0.1407618522644043
2019-10-09 14:05:51,542 -- INFO -- Iter 983000: loss 2.2446057442426683, time 0.2159106731414795
2019-10-09 14:08:11,021 -- INFO -- Iter 984000: loss 2.2406707513332367, time 0.09187722206115723
2019-10-09 14:10:33,569 -- INFO -- Iter 985000: loss 2.243191688776016, time 0.284625768661499
2019-10-09 14:12:52,848 -- INFO -- Iter 986000: loss 2.229316849589348, time 0.09166932106018066
2019-10-09 14:15:12,966 -- INFO -- Iter 987000: loss 2.238621209859848, time 0.0910036563873291
2019-10-09 14:17:32,610 -- INFO -- Iter 988000: loss 2.2455421295166014, time 0.09028363227844238
2019-10-09 14:19:50,839 -- INFO -- Iter 989000: loss 2.2432832870483397, time 0.34220433235168457
2019-10-09 14:22:06,687 -- INFO -- Iter 990000: loss 2.2466694812774657, time 0.09603047370910645
2019-10-09 14:22:15,398 -- INFO -- Test bucket 0: loss 3.1786046028137207, time 0.05455660820007324
2019-10-09 14:22:15,458 -- INFO -- Test bucket 1: loss 3.5405633449554443, time 0.05947542190551758
2019-10-09 14:22:15,532 -- INFO -- Test bucket 2: loss 3.5922553539276123, time 0.07441473007202148
2019-10-09 14:22:15,633 -- INFO -- Test bucket 3: loss 3.529684543609619, time 0.10061907768249512
2019-10-09 14:22:15,755 -- INFO -- Test bucket 4: loss 3.7335658073425293, time 0.12195014953613281
2019-10-09 14:22:15,903 -- INFO -- Test bucket 5: loss 3.8613126277923584, time 0.14754486083984375
2019-10-09 14:24:21,939 -- INFO -- Iter 991000: loss 2.2421313049793246, time 0.08835697174072266
2019-10-09 14:26:39,390 -- INFO -- Iter 992000: loss 2.2535737413167953, time 0.16527605056762695
2019-10-09 14:29:01,609 -- INFO -- Iter 993000: loss 2.263053869485855, time 0.1388380527496338
2019-10-09 14:31:20,481 -- INFO -- Iter 994000: loss 2.2561977257728576, time 0.08809399604797363
2019-10-09 14:33:43,944 -- INFO -- Iter 995000: loss 2.2676013087034224, time 0.09026694297790527
2019-10-09 14:36:02,296 -- INFO -- Iter 996000: loss 2.2528336789608003, time 0.09229564666748047
2019-10-09 14:38:20,023 -- INFO -- Iter 997000: loss 2.2519494262933732, time 0.08894729614257812
2019-10-09 14:40:37,962 -- INFO -- Iter 998000: loss 2.266415533065796, time 0.08812499046325684
2019-10-09 14:42:58,623 -- INFO -- Iter 999000: loss 2.2743323941230775, time 0.09042882919311523
2019-10-09 14:45:17,886 -- INFO -- Iter 1000000: loss 2.2651904397010805, time 0.08829283714294434
2019-10-09 14:45:26,655 -- INFO -- Test bucket 0: loss 3.332874059677124, time 0.05175375938415527
2019-10-09 14:45:26,718 -- INFO -- Test bucket 1: loss 3.459939956665039, time 0.06204557418823242
2019-10-09 14:45:26,791 -- INFO -- Test bucket 2: loss 3.472844123840332, time 0.07347464561462402
2019-10-09 14:45:26,889 -- INFO -- Test bucket 3: loss 3.750105857849121, time 0.09812140464782715
2019-10-09 14:45:27,012 -- INFO -- Test bucket 4: loss 3.4856009483337402, time 0.1224663257598877
2019-10-09 14:45:27,151 -- INFO -- Test bucket 5: loss 3.558220863342285, time 0.13911890983581543
2019-10-09 14:47:39,301 -- INFO -- Iter 1001000: loss 2.268606811881065, time 0.08983755111694336
2019-10-09 14:50:02,334 -- INFO -- Iter 1002000: loss 2.275747295618057, time 0.09300374984741211
2019-10-09 14:52:24,905 -- INFO -- Iter 1003000: loss 2.265431757092476, time 0.2277059555053711
2019-10-09 14:54:45,345 -- INFO -- Iter 1004000: loss 2.2564414784908293, time 0.1413567066192627
2019-10-09 14:57:04,630 -- INFO -- Iter 1005000: loss 2.2581456750631332, time 0.0926673412322998
2019-10-09 14:59:28,269 -- INFO -- Iter 1006000: loss 2.268891993880272, time 0.21859478950500488
2019-10-09 15:01:54,166 -- INFO -- Iter 1007000: loss 2.269172732114792, time 0.09133005142211914
2019-10-09 15:04:14,596 -- INFO -- Iter 1008000: loss 2.2641460099220274, time 0.3426644802093506
2019-10-09 15:06:40,198 -- INFO -- Iter 1009000: loss 2.2892661439180375, time 0.2196190357208252
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant