RDII Apache Spark

All Code written for processing D3S detector data collected from a mobile sensor network. This network consists of a small radiation detector that can easily fit in a pocket. This detector is paired via bluetooth to a Samsung Galaxy S6. The smartphone has an app created by our research group that sends the data into our Amazon Web Service Cloud where it is stored in S3. The header of the data is as follows:

Detector ID is a unique identifcation number paired with each detector Latitude and Logitude are the GPS coordinates of the detector (within a few meters if outdoors) in degrees Sigma is the ionizing radiation counts per second measured by the D3S detector N is the neutron counts per second measured by the D3S detector Time is the unix time measured in nanoseconds Channels 0-4096 represent the energy spectrum measured from the detector

Additionally weather data was used from wunderground.com

GOAL: Sensor networks are increasingly being used for Nuclear Nonproliferation purposes in urban environments. The goal of this research is to take the gps and time-tagged radiation data and generate heat maps using various geostatistical techniques. Including MapReduce, Kriging, and Inverse Distance Weighting. By accurately characterizing background radiation we can improve outlier detection and find anomalous radioactive sources in our data set. This project required utilizing Elastic Map Reduce (EMR) clusters with Apache Spark pre-installed. All scripts were then run on subsets of the D3S data set. Some data cleaning and manipulation was required before processing.

TODO:

Describe research project, goals, and accomplishments in README
Add descriptive comment headers to each file...

All spark code run in the Amazon EMR environment.

DISCLAIMER: These code examples alternate between using SparkSQL and DataFrame operations (often within the same file). I acknowledge that this might not be the best coding practice; this was done for my own learning purposes.

Name		Name	Last commit message	Last commit date
Latest commit History 21 Commits
3Sigma.scala		3Sigma.scala
DayCount.scala		DayCount.scala
Geotrellis.scala		Geotrellis.scala
HeatMap.pig		HeatMap.pig
HeatMap.scala		HeatMap.scala
HeatMap_MHJ.scala		HeatMap_MHJ.scala
README.md		README.md
SummerCamp.scala		SummerCamp.scala
kriging.scala		kriging.scala
oneDetector.scala		oneDetector.scala
sampling.scala		sampling.scala
semivariogram.scala		semivariogram.scala
semivariogram_SampleSizeTest.scala		semivariogram_SampleSizeTest.scala

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

3Sigma.scala

3Sigma.scala

DayCount.scala

DayCount.scala

Geotrellis.scala

Geotrellis.scala

HeatMap.pig

HeatMap.pig

HeatMap.scala

HeatMap.scala

HeatMap_MHJ.scala

HeatMap_MHJ.scala

README.md

README.md

SummerCamp.scala

SummerCamp.scala

kriging.scala

kriging.scala

oneDetector.scala

oneDetector.scala

sampling.scala

sampling.scala

semivariogram.scala

semivariogram.scala

semivariogram_SampleSizeTest.scala

semivariogram_SampleSizeTest.scala

Repository files navigation

RDII Apache Spark

About

Releases

Packages

Languages

nrothchicago/SparkCode

Folders and files

Latest commit

History

Repository files navigation

RDII Apache Spark

About

Topics

Resources

Stars

Watchers

Forks

Languages