We use mini-batch statistics during train, and use population statistics during test (which using some kind of approximation like exponential averages).
In case of small mini-batch, a mini-batch statistics seems to be a poor choice.
I can only wonder why we don’t use a kind of exponential average more during training?
thanks you RSS link
More link Blog tech
more link ADS
Blockchain, bitcoin, ethereum, blockchain technology, cryptocurrencies
Information Security, latest Hacking News, Cyber Security, Network Sec
Information Security, latest Hacking News, Cyber Security, Network Security
Blog! Development Software and Application Mobile
Development apps, Android, Ios anh Tranning IT, data center, hacking
Car News, Reviews, Pricing for New & Used Cars, car reviews and news, concept cars