The Department of Defense and intelligence community is coming to grips with the realization that it might have too much to make sense of after almost two decades of collecting mass quantities of intelligence data.
This realization was one of the catalysts for DoD’s flagship machine-learning effort dubbed Project Maven; with the hours of imagery data collected from aerial platforms, maybe algorithms could do the processing and unburden analysts.
However, these algorithms rely on data sets that have been prepared to be analyzed, which is called structured or tagged data.
Leaders are now asserting that many of these data sets have not yet been tagged, thus potentially rendering the latest technology advancements useless.
“There’s a lot of unexamined data … that you can apply machine learning to identify objects of interest in the seams,” Dawn Meyerriecks, director of Science and Technology at CIA, said Sept. 4 at the Intelligence and National Security Summit hosted by INSA and AFCEA.
“We have a ton of work to do as a community in terms of tagging, which is absolutely essential to actually be able to do anything interesting with the stuff from an analytical perspective.”
For the IC, large data sets that have already been collected yet not analyzed are ripe, low-hanging fruit, a problem Meyerriecks said the community is working on.
DoD and the IC have been very good at the data gathering piece, Lt. Gen. John Bansemer, assistant director of National Intelligence for Partner Engagement within the Office of the Director of National Intelligence, said Sept. 5 at the same conference. But now the community as a whole must focus more on how to enable analysts to make sense of all this data and get to the essence of decision to provide an advantage to commanders.
After all, the data is useless if it cannot be analyzed and made sense of faster than the adversary.
Some in the military are beginning to tag data immediately in order to allow for faster analysis by machines.
Lt. Gen. Jerry Harris, deputy chief of staff for Strategic Plans and Requirements with Headquarters Air Force, told the House Armed Services Committee in March that the service is working with several groups within its intelligence processing enterprise to ensure they are tagging data.
This tagged data will provide a larger and more robust training set so algorithms can more accurately identify key indictors in images or video.
Mark Pomerleau is a reporter for C4ISRNET, covering information warfare and cyberspace.