5/15/2019

Hadoop Project

Switching gears slightly, working on a Cloudera Hadoop Big Data project.  Process PDF files using OCR, using Spark on HBase, then index and search using Solr, in Azure.

A nice juicy project.


The world of unstructured data.