Personally identifiable information has been found in DataComp CommonPool, one of the largest open-source data sets used to train image generation models. Millions of images of passports, credit cards ...
Zachary del Rosario receives funding from the National Science Foundation and Toyota Research Institute. Nicknamed “Galloping Gertie” for its tendency to bend and undulate, the Tacoma Narrows Bridge ...
A new kind of large language model, developed by researchers at the Allen Institute for AI (Ai2), makes it possible to control how training data is used even after a model has been built.
When AI models fail to meet expectations, the first instinct may be to blame the algorithm. But the real culprit is often the data—specifically, how it’s labeled. Better data annotation—more accurate, ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results