data poisoning

Hello, you have come here looking for the meaning of the word data poisoning. In DICTIOUS you will not only get to know all the dictionary meanings for the word data poisoning, but we will also tell you about its etymology, its characteristics and you will know how to say data poisoning in singular and plural. Everything you need to know about the word data poisoning you have here. The definition of the word data poisoning will help you to be more precise and correct when speaking or writing your texts. Knowing the definition ofdata poisoning, as well as those of other words, enriches your vocabulary and provides you with more and better linguistic resources.

English

Noun

data poisoning (countable and uncountable, plural data poisonings)

  1. (machine learning, computer security) The deliberate use of a training dataset with data designed to increase errors in the output of a machine learning model.
    Synonym: poisoning attack
    • 2019, Simon N. Foley, editor, Data and Applications Security and Privacy XXXIII , Springer, →ISBN, page 4:
      While data sanitization shows promise to defend against data poisoning, it is often impossible to validate every data source .
    • 2022, Alfred Z. Spector, Peter Norvig, Chris Wiggins, Jeannette M. Wing, Data Science in Context: Foundations, Challenges, Opportunities, Cambridge University Press, →ISBN, page 148:
      Similarly, the Tay chatbot suffered from data poisoning. To mitigate data poisoning, it is important not to let any one group contribute too much data to a model.
    • 2023, Katharine Jarmul, Practical Data Privacy, O'Reilly, →ISBN:
      Data poisoning is one type of adversarial attack—where a user or group of users submit false data to influence the model toward a particular or incorrect prediction.
    • 2023, Paul Scharre, Four Battlegrounds: Power in the Age of Artificial Intelligence, W. W. Norton & Company, →ISBN:
      Some forms of data poisoning are undetectable. Attackers can insert adversarial noise into the training data, altering the training data in a way that is hidden to human observers.