Skip to content

Commit d174aa9

Browse files
authored
Update README.md
1 parent 54c0980 commit d174aa9

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

README.md

+3-3
Original file line numberDiff line numberDiff line change
@@ -1,15 +1,15 @@
1-
# Apache Spark setup and spatial queries execution
1+
# Large Scale Geo-Spatial Data Analysis using SparkSQL
22

33
## CSE 512 - Distributed and Parallel Database Systems
44

55
## Project description
66

7-
The project was aimed to setup a spark cluster with HDFS and run SparkSQL queries on the it.
7+
The project was aimed to setup a spark cluster with HDFS and run SparkSQL queries (geo-spatial) on the it.
88

99
* Native spark cluster was used as cluster manager.
1010
* Hadoop Distributed File System (HDFS) was used as distributed storage system.
1111
* The setup was done using Amazon EC2 virtual machines as nodes.
12-
* Spatial queries such as range query, range join query, distance query and distance join query were executed.
12+
* Spatial queries such as range query, range join query, distance query, distance join query, hot zone analysis and hot cell analysis were executed.
1313
- Spatial queries were executed by implementing user defined functions such as ST_contains and ST_within in Scala.
1414
- ST_contains takes a point and a rectangle and returns a boolean indicating whether the point is inside the rectangle.
1515
- ST_within takes two points and a distance and returns a boolean indication whether the distance between the points is not more than the distance provided.

0 commit comments

Comments
 (0)