Data Engineering Stack - collect, transform and visualize geospatial data
I'm making a side project, where I collect geospatial data by web scrapping and from OSM API. I've started with simple Java application, however, I would like to make it as a data flow, purely for learning purposes.
Unfortunately, my knowledge about tools, and mostly connecting them, is, well, low.
What is my goal?
As a final result I want to visualize scrapped geospatial points on the map with the roads connecting them(from OSM).
Current flow:
In standalone Java application I'm scrapping the data for geospatial points. There is a client consuming the OSM API for needed data.
What I think it might be useful:
Use Apache Spark for collecting and transforming the data. Then use somehow GeoSpark, or Geotrellis, and Zeppelin to visualize the data. I was also thinking about using ES + Kibana for geodata, but it looks like the Zeppelin is enough.
I feel comfortable to work with Java, then Scala.
What do you think? Are there any better tools I can use? Did I miss anything?
Topic data-engineering geospatial visualization tools
Category Data Science