Datasets

Developing algorithms requires data in the right quantities and containing the right features for the problem being solved. The process of collecting significant quantities of data and applying accurate labels suitable for machine intelligence is typically a labor intensive activity. SRC has embraced the use of synthetically generated data alongside collected data to support advanced machine intelligence algorithm development which operates in highly congested environments.

The following datasets were generated using SRC's data simulation tools called Synthetic Data Incubator for Cognitive and Tactical ELINT (SynDICaTE) and "Data Management and Bootstrapping Processing For Machine Learning and Classification Development" (U.S. Patent No. 11,651,290). They demonstrate some capabilities of the software to generate different environments containing configurable quantities and types of signals along with background interference. These settings are easily tailored giving the algorithm engineer or data scientist a great deal of flexibility in the data they are synthesizing.

A total of 3 datasets have been created which contain a number of file pairs. Each file pair consists of one in-Phase/Quadrature Phase (I/Q) data file written in binary format and one label file written in JSON format. The IQ data file is formatted as 32 bit float interleaved I and Q samples. Each label file consists of an array of rectangular bounding boxes which encompass each pulse along with a set of labels describing the pulse and which emitter it originated from. Example code which will allow you to easily parse and plot the data is also available for download.

Let us help solve your next "impossible" challenge. Contact us today!