Skip to content

Data

  • Data is the core of machine learning where general-purpose methodologies are designed to extract valuable patterns from data.
  • For example, given a large corpus of documents, machine learning methods are used to automatically extract topics from the documents.
  • Data is usually presented in the form of a numeric vector.
  • Data is presented typically in tabular format where each row is an instance and each column is a feature.