Qa on Watchstep Blog

Qa on Watchstep Bloghttps://blog.watchstep.site/categories/qa/Recent content in Qa on Watchstep BlogHugoenÂ©Â 2025 watchstepThu, 18 Dec 2025 09:26:10 +0900❓What is the difference between LLaDA and BERT?https://blog.watchstep.site/posts/llada-qa/Fri, 04 Apr 2025 09:26:10 +0900https://blog.watchstep.site/posts/llada-qa/<h2 id="how-do-the-masking-of-llada-large-language-diffusion-with-masking-and-bert-differ">How do the “masking” of LLaDA (Large Language Diffusion with Masking) and BERT differ?</h2> <p><a href="https://arxiv.org/abs/1810.04805">BERT (Bidirectional Encoder Representations from Transformers)</a> 와 <a href="https://arxiv.org/abs/2502.09992">LLaDA (Large Language Diffusion with Masking)</a> 는 모두 “masking” 기법을 사용한다.</p>