System and method of extracting data from structured and unstructured sources of data using automated joins

DRIVE November 8, 2018
Source
System and method for creating enabling the user to select fields from a database, semi structured or unstructured documents that produces an automated process of joining the database tables, semi structured or unstructured documents into a feature vector that can be further processed by machine learning algorithms or preprocessing routines and filters. The full join performed starts by producing a graph representation of the links between data tables/documents and then restructuring the information into the most efficient join tree. The join tree then extracts the data in the form of a feature vector.

Discussion in the ATmosphere

Loading comments...