SlideShare une entreprise Scribd logo
1  sur  59
GIS 5103 – Fundamentals of GIS
Lecture 8
3D GIS
Dimensionality
A dimension is the minimum amount of coordinates needed to
define the mathematical space of an object.
Lecture 8 – 3D GIS
2DA figure that has only length & height as its dimensions. 2D
shapes lie on a flat surface; known as plane figures or plane
shapes. While they have areas, 2D shapes have no volume.2D
figures are plotted on two axes, the x- and y-axes.
Lecture 8 – 3D GIS
3DLength + Height + WidthX + Y + Z Volume + Surface
AreaExamples include: 3D multi-patch building, roof, interior
floors, and foundation would all contain different z-values for
the same 2D coordinate. Aircraft's 3D position or a walking
trail up a mountain, would only have a single z-value for each
X,Y location.
Lecture 8 – 3D GIS
2.5DHighly used in GIS to represent Z data that is not
continuous = 3DZ data does not need to be elevation. Pollution,
# of cases, etc.
Lecture 8 – 3D GIS
3D file types .3dd.sxdWhen you import an ArcGlobe or
ArcScene document:First the .3dd file opens by default in
global mode Secondly the .sxd file opens in local mode. Any
new blank scene view defaults to global mode.
Lecture 8 – 3D GIS
Data TypesVector FeaturesPoint / Line / Polygon Surface
typesTIN (Triangular Irregular Networks)DEM’s (Digital
Elevation Models)Raster SurfaceLAS Dataset
Lecture 8 – 3D GIS
3D StylesPointsGeometric
ShapesModelsMarkersImportedLinesTextured
SymbolsGeometric ShapesPolygonsTexture FillBasic Colors
Lecture 8 – 3D GIS
Lecture 8 – 3D GIS
Perspective ViewPerspective drawing is the most common
drawing mode in 3D, where features in the foreground are
shown larger than those off in the distance. This matches the
way we see the world in our day-to-day lives, and the result is a
realistic representation of 3D content. All scenes open in
perspective viewing mode. You can switch between Perspective
Perspective View and Parallel Isometric View viewing modes
using the Drawing Mode drop-down menu in the Scene group on
the View tab.
Lecture 8 – 3D GIS
Parallel / Isometric ViewParallel drawing renders the 3D view
using a parallel projection, where features of the same physical
size are rendered on-screen identically, regardless of their
distance from the viewing camera. Parallel drawing is useful for
architectural drawings, as well as for representing statistical
data in a 3D view, such as extruded shapes symbolizing numeric
values.
Lecture 8 – 3D GIS
3D Analyst ToolsData Conversion/PreparationTxt / Binary /
Feature class / Raster / TINSurface CreationInterpolation /
LASD CreationSurface AnalysisAspect / Slope / Contour /
Feature Interpolation3D Operator & VisibilitySkyline / Inter-
visibility / Sun Shadow Analysis
Lecture 8 – 3D GIS
There are five exploratory analysis tools:The Line of Sight tool
creates sight lines to determine if one or more targets are
visible from a given observer location.The View Dome tool
determines the parts of a sphere that are visible from an
observer located at the center.The Viewshed tool determines the
visible surface area from a given observer location through a
defined viewing angle.The Slice tool visually cuts through the
view's display to reveal hidden content.The Cut and Fill tool
provides volume calculations while visually demonstrating the
surface area where the ground will be filled or removed.
Lecture 8 – 3D GIS
Exploratory Analysis ToolsThe first three analysis tools
determine the visible sight lines, field of view, or spherical
visibility coverage. The Slice tool temporarily removes
displayed content using clipping planes as volumes.The Cut and
Fill tool calculates volumetric displacement using a planar
surface. In all cases, the results are temporary and must be
saved as feature classes if you want to reload the interactive
objects again.
Lecture 8 – 3D GIS
illumination propertiesA unique element to authoring scenes is
that you can also define illumination properties. This includes
properties such as the time of day, whether or not the sun casts
a shadow, and how much ambient light is used. In global mode,
you can also simulate atmospheric diffusion.
Lecture 8 – 3D GIS
By the end of this week you should be able to:
Understand dimensionality List the differences between 2D vs
2.5D vs 3D Know the data types, Views, & Scene Types for 3D
GIS Understand Surfaces & How to create/find themFamiliarize
yourself with the Exploratory & Analyst toolsList the unique
properties of 3D GIS vs Traditional 2D mapsComprehend the
issues of 3D GIS
Lecture 8 – 3D GIS
GIS 5103 – Fundamentals of GIS
Lecture 7
Intro to R-Studio & Python for GIS
Week 7
Winter 2019
What is R-Studio & why should I use it?Free open-source
integrated coding language.Powerful computing/programing &
simple UIAlso very popular
Intro to R & Python - Lecture 7
Applications &
CapabilitiesStatisticsGraphicsProgramingCalculatorGISNeural
NetworksEtc.
Intro to R & Python - Lecture 7
Intro to R & Python - Lecture 7
Installinghttps://courses.edx.org/courses/UTAustinX/UT.7.01x/
3T2014/56c5437b88fa43cf828bff5371c6a924/
Set working DirectoryDefining the working area / confining
thinking space / Files of interest located here
Intro to R & Python - Lecture 7
Import DataFiles of interest located in working directory.
Intro to R & Python - Lecture 7
Manipulate & Organize Data
Intro to R & Python - Lecture 7
Various functions for parameters of interest
Able to work with data and create data.
Convert, Transpose, data with single commands.
Plots / DiagramsplotMF(object.cls)plotMF(object)
Intro to R & Python - Lecture 7
Most Popular Uses of RData ManagementConversion, sub-
setting, merging, transformationsStatistical & Mathematical
AnalysisANOVA’s, Modeling, RegressionsProgramming /
Operations / System InterfaceAutomation, Special
ConditionsGraph / Data Display / ConfigurationData
Visualization, High CustomizationSimulationsHigh level
scenario modeling
Intro to R & Python - Lecture 7
Variables
Can define characters/numbers as nearly anything
x = 5
First_Name = Mike
f = (sin(y) / tan(sum(z)) * variance(y/z)
Name – Type – Size – Value
Created Title– character/matrix/list – Bytes/KB/MB – Output
Intro to R & Python - Lecture 7
Functions & Syntax
Mean(x, trim = 0, na.rm = FALSE, …)
Function(input, argument 1, argument 2, …)
*Commas used to separate arguments/inputs*
Intro to R & Python - Lecture 7
Intro to R & Python - Lecture 7
Help Dialogue
Intro to R & Python - Lecture 7
What is Python & why should I use it?Object-orientated, high
level programming language with dynamic semanticBuilt in data
structures, dynamic typing/bindingUsed as glue to connect
programs / components together.Supports modules / packages /
code re-use
Intro to R & Python - Lecture 7
Python Libraries = R-Studio PackagesGIS
LibrariesGDALOGRPyprojShapelyMapnikMySQLPostGISSpati
aLite
Intro to R & Python - Lecture 7
How to download Pythonhttps://realpython.com/installing-
python/
How to Pip install
librarieshttps://packaging.python.org/tutorials/installing-
packages/
Intro to R & Python - Lecture 7
Intro to R & Python - Lecture 7
How we will use python & Libraries
Example of ArcGIS & python
https://developers.arcgis.com/python/sample-
notebooks/chennai-floods-analysis/
Intro to R & Python - Lecture 7
GIS AutomationPythonR-StudioModel Builder
Intro to R & Python - Lecture 7
Common Automated processesAddress systemsData
UpdatingMap TemplatingSpatial JoinsDigitizing
CalculationsModelsWork Flows
Intro to R & Python - Lecture 7
After reviewing this lecture you should be able to:
List applications/uses of a coding languagesKnow many of the
popular input file typesUnderstand/Replicate functions, syntax,
styleKnow the names of essential packages/librariesList
commonly automated tasksBe able to download & Install the
softwares
Intro to R & Python - Lecture 7
2
Foundations of Geographic Information Science
GIS 5103
Fall 2019 - Week Six
Vector Analysis
Objectives:
By the end of week seven you will be able to:
Describe Measurements in GIS;
Perform data queries in ArcGIS Pro
Explain techniques used in reclassification of data
Describe Buffering Functions and map overlays
Earlier in this course we covered some of the questions that can
be answered by GIS. Over the past few weeks we have been
discussing different types of data, data formats, data models,
data creation and editing, and finding/correcting errors in data.
Once we have our software, computer hardware, personnel, and
data in place we can start to answer questions with our system.
This week we will take a look at some of the questions a GIS
can answer and also
some of the tools GIS uses to answer the questions like the ones
listed below:
Where are particular features found?
What geographical patterns exist?
Where have changes occurred over a given period?
Where do certain conditions apply?
What will the spatial implications be if an organization takes a
certain action?
Common questions answered by GIS
Questions for GIS Analysis
Of course “where” is the predominant question but there are
some additional questions
beyond just where. Below are some examples of the questions
beyond where.
Location: what is at or where is it? E.g., where is the nearest
Mexican restaurant to my house?
Patterns: what spatial patterns exist? E.g., where do most of the
students live in Boston?
Trends: what has changed since? Has the number of deer
incidents on roads
reduced since commuters have used the deer whistles?
Conditions: where can I find something close to, within or by?
Modeling/Implications: what if? An example may include: if a
new baseball park were built adjacent to Gillette stadium, how
would traffic flow be affected?
Our data analysis procedures assist us in answering these
questions. Let us look at them more closely now.
Making Measurements in GIS
Calculating lengths, perimeters and areas are common
applications in GIS. For example, measuring the length of a
road from a digital map is pretty straightforward. However, a
GIS can make measurements on other types of features as well.
There are a number of tools at your disposal and depending on
the type of data you are using (raster or vector). The next few
slides will introduce you to some of those measuring tools and
techniques.
a. Length (distance); b. Perimeter; c. Area
Vector GIS measurements: (a) distance;
(b) area; and (c)perimeter can be measured as the sum of
straight line lengths.
The figure to the right shows how these
quantities are calculated by the GIS, but as users it is much
easier for us. All that is needed is to click on the measure tool
in the toolbar of the ArcMap GIS software. The GIS does the
rest.
In Vector GIS, length, perimeter and area data can be stored as
attributes in a database. By storing the lengths and areas as
attributes we can select features meeting a certain criteria and
sum the areas or lengths. For example how many acres of
wetlands do we have in a city?
Measurements in GIS
The figure above from Esri shows 3 and 5 minute drive time
polygons from a downtown area. Drivetime polygons can be
measured using either a straight line distance or by using road
networks and traffic conditions.
Example of a Measurement
Queries in GIS
Queries are the main data retrieval operations that are useful at
all stages of GIS analysis for everything from data quality
checking to a final analysis.
Features may be selected by two primary methods:
Select by Location - select features based on their location
relative to features in another layer. For instance, if you want
to know how many homes were affected by a recent flood, you
could select all the homes that fall within the flood boundary.
You can use a variety of selection methods to select the point,
line, or polygon features in one layer that are near or overlap
the features in the same or another layer. (Esri, 2017)
Select by Attribute - Select By Attributes allows you to provide
a SQL query expression that is used to
select features that match the selection criteria. (Esri, 2017).
An example procedure for selecting features by attribute might
include:
A prospective buyer wants to know where all the 3 bedroom
houses are for under $175,000 (apparently it’s possible, just
not in Boston). The ArcGIS user can call the Selection menu in
the software and then by pressing the “select by attributes”
option. This will provide a new window with
fields, unique values and numerical operators within it. This
window allows
the attribute table to be queried. A command line query question
may then look like:
status=Y + Bedroom=3+ saleprice=$175000.
Where: status refers to house availability; 3 refers to the desired
number of bedrooms; and sale price refers to… well, the sale
price,and a dream sale price at that for any buyer in Boston.
Select by Attribute
Selecting by attributes involves the use of Boolean Operators
(Pg. 380) in which use the conditions OR, AND, or NOT. In
Esri’s ArcMap software we can use the “Select by Attributes”
dialog box to build an expression. The dialog box to the right
lists the layer “zebra_mussel” we want to select features from,
the attribute fields “STATE” and “YEAR”. The buttons and
dialog box
can be used to select all of the points from th. e feature layer
“zebra_mussel” that are in the state of New York and are from
2005.
We can use this dialog box to build very complex expressions
and even save the expressions for future use. The next slide
will show an example of how more complex expressions can be
built.
Select by Attributes
Queries Continued
This image illustrates how an attributes or non-spatial query is
executed in ArcGIS. In previous chapters we had
discussed how SQL Queries can be used to select features from
an attribute table. Notice the query “SELECT * FROM
Parcel_Poly_BT_2008_LOCID WHERE: BEDROOMS = 3
AND FULLBATHS= 2 AND TOTALVALUE<175,000.
Essentially, the query is telling the software to find all of the
homes in Waltham that have 2 full baths, 3 Bedrooms, that cost
less than $175,000.By using operators such as “AND”,
“OR”,“NOT”,etc. we can build queries based on whatever
criteria we choose. I can execute this query by clicking on the
“Apply” button. The parcels that are selected can then be
exported to create a new layer or table. We can also modify our
query to refine our selection. You will have an opportunity to
gain some hands-on experience working with these queries in
the co-requisite with this course; GIS5102.
An example procedure of selecting features by location might
include:
An analyst wants to know where all the osprey nests are within
a distance of
0.5 miles to a major highway. The ArcGIS user can call the
Selection menu in the software and then by pressing the
“selecting by location” tool.
This will allow the user to use the operators: “are within a
distance
of” (0.5miles to freeway) and “contain” (osprey nests) in order
to carry out the query.
In the following slide, an example is included of selecting
particular soil types by query as part of a step to produce a soil
erosion model. It is important to realize that these techniques
can be used sequentially and do not have to be standalone.
Queries Continued
Example of a typical spatial query
A spatial query is used to find features that are adjacent, within
a distance of, are contained by, crossed by, etc. The graphic to
the right looks similar to the “Select by Attributes” dialog box
on the previous slides but the main difference is that instead of
using SQL queries or Boolean operators we are able to build
expressions to find features that are near, within, are within a
distance of, touching, or intersecting another feature.
The following set of slides illustrates how a “Spatial Query”is
executed. In this following example we will be selecting all of
the parcels within 300 feet of our selected parcel (highlighted
in red).
Example of a typical spatial query
We can select features adjacent to or within a specified
distance of a selected feature using “Select by Location”.
Example of a typical spatial query
Notice the “Select by Location” toolbox that now appears on
the screen. The first dropdownbox allows me to specify which
operation I would like to use. In the second box, the
“Parcel_Poly” layer is checked off. By checking off the
“Parcel_Poly” layer I am specifying which layer I will be
querying. I could also use the select by locations dialog to
query features that are in separate layers, for instance I could
also create a query that would find all of the hydrants within
300 feet of the selected property. The drop-down menu at the
bottom of the dialog box allows me specify the type of location
query I would like to use. In this example I will use the “are
within a distance of” option.
Example of a typical spatial query
At the bottom of the dialog box I can specify a distance that I
would like to use. In this case I will use 300 feet. Once I have
built the select by location query I will click on the apply
button and all of the parcels within 300 feet of the subject
property are now highlighted in red.
Earlier in the chapter we looked at ways of finding features that
are within a certain distance of a selected feature or features.
This distance is commonly referred to as a buffer. Buffers are
drawn based on a user specified distance and vary in shape
based upon whether the feature being buffered is a point (a), a
line (b), or a polygon (c). The diagram below illustrates how
these shapes are drawn. If multiple features are being buffered
and the buffer polygons overlap you can dissolve the buffer
boundaries if you choose to create a continuous buffer polygon.
Buffering and neighborhood functions
a b
c
We discussed one way to use a buffer is to designate an area for
special protection. In this example, 1000 foot buffer zones
(purple) around the rail network (black) are established to
define an oil spill impact area. This is what buffers around line
features look like.
An Example of a Buffer
The graphic above illustrates a buffer polygon that was created
to show an area 300 feet around a parcel polygon. The buffer is
shown in blue. When buffering a polygon, the buffer that is
created emulates the shape of the object being buffered. In this
case a polygon.
An Example of a Buffer
Map Overlay: Data Integration
One of the most basic questions asked of a GIS is "What's on
top of what?" For example:
What land use is on top of what soil type?
What parcels are within the 100-year floodplain? ("Within" is
just another way of saying "on top of.")
What roads are within what counties?
What wells are within abandoned military bases?
To answer such questions before the days of GIS, cartographers
would create maps on clear plastic sheets and overlay these
sheets on a light table (remember Week 1) to create a new map
of the overlaid data.
Because overlay yields such valuable information, it was
paramount to the development of GIS. There are two types of
overlays in GIS Vector and Raster
Map Overlay: Data Integration
An overlay operation is much more than a simple merging of
line work; all the attributes of the features taking part in the
overlay are carried through, as shown in the
example to the right, where parcels (polygons) and flood zones
(polygons) are overlaid
(using the Union tool) to create a new polygon dataset. The
parcels are split where they are crossed by the flood zone
boundary, and new polygons created. The FID_flood value
indicates whether polygons are outside (-1) or inside the flood
zone, and all polygons retain their original land-use category
values. Esri 2017.
Map Overlay: Data Integration
The total area of each land-use type in the flood zone can be
calculated by selecting all polygons within the flood zone
(using the Select Layer By Attribute tool, for example) and
summarizing the area by land- use type (using the Frequency
tool). Following is a chart illustrating the result of this
calculation. (Esri 2018).
Map Overlay: Data Integration
There are a number of vector overlay tools that are common to
most commercially available GIS softwares. The graphic to the
right lists some of these tools. The tools you use vary based on
the output you desire but they generally involve creating or
updating new datasets.
You will have a chance to use some of these tools in your
analysis class and also in GIS5201 – Advanced
Spatial Analysis.
We have now looked at some of the common tools and
techniques used to conduct a
GIS analysis. Now that we have described some of the
techniques and tools how and when do we apply them? Also,
how and where do we begin? Below are five points to consider
when conducting a GIS analysis.
Frame the question – Begin by defining the objectives of your
analysis. What are the primary questions you are trying to
answer?
Explore and prepare data – Do you have the data you need to
conduct the analysis? Do you have to create new data or
acquire it from another source? These are critical questions.
You may not be able to acquire or develop the data you need
from the sources you have access to within the time frame or
cost associated with the analysis.
Choose analysis methods and tools – What tools or methods do
you need to perform your analysis?
Perform the analysis – Your analysis should be conducted a
number of times to make sure your
results make sense. You should have a good working knowledge
of the subject matter.
Examine and refine results – Examine your results and refine
your output products. Are your results repeatable? Your maps
and tables should be designed to be easily understood by your
intended audience.
Common questions answered by GIS
Foundations of Geographic Information Science
GIS5103 – Fall 2019
Week 4 Database Management
Outline
Differentiate between relational database management systems
(RDBMS), databases, and data files.
Explain why we select the database approach for GIS
Define what a relational database is
Describe how databases are created
Describe GIS database applications
Database Management System (DBMS)
What is a Database Management System (DBMS)?
A DBMS is a system [or software] designed to manage a
database and allow user-
selected operations to be performed on the data.
1.
2.
3.
4.
It is a logical construct for the storage and retrieval of
information. It puts the database into a geographic coordinate
system.
Attribute data models are needed for the DBMS.
It contains data definition language; a data-entry module; a data
update module; a report generator; and query language.
What is the difference between and database and a database
management system?
A database is a simply a collection of related data. This can
include non-computerized data such as those found in a ‘phone
or address book. It can also include information contained in
spreadsheets or also in relational databases or databases
comprised of multiple tables that are connected or related to
each other using common fields or
“keys” found in each of the tables.
Data in a computer database, however, are managed and
accessed through a database management system (DBMS). The
database management system is used to manage, access the
database
The Database is distinct from the DBMS
In order for there to be a DBMS, there must be a database. In
order for there to be a database there must be data files
Bolstad’s Figure 8-4. It shows the database approach to data
handling data.
First comes the Data File, then the database and then the DBMS
So, what is the difference between a data file and a database?
Heywood discusses this on p.110. For simplicity, below is a
comparison in tabular form.
The image above from Screencast illustrated the differences
between a flat file (like a spreadsheet) and the relational
database model structure (RDBMS).
Creating the Database, 337-339
To design a database, the user must:
Identify the features (i.e. entities) to be represented (e.g.,
forests, trails, and recreational features. These are all nouns);
Identify the appropriate tables for a relational database. What
is the ID# that could allow a relational join for e.g. (Forest-
ID)?
Identify the feature’s attributes
(e.g., Trail Difficulty)
Identify relationships between the features (e.g., what
activities, trails, can be explored at each of the forests? What
trails are in each forest and how difficult area they?);
Figure 8-6: Forest data in a relational database structure.
Creating the Database
Keys or Key ID’s are extremely important in a relational
database. They allow us to combine and display data from other
tables and also allow us to store, query, and edit data more
efficiently. There are two types of keys, primary and foreign.
A primary key, also called a primary keyword, is a key in a
relational database that is unique for each record. It is a
unique identifier, such as a driver license number, telephone
number (including area code), or vehicle identification number
(VIN). In the example to the left, the Forest-ID would be the
primary key. A relational database must always have one and
only one primary key.
A foreign key is a set of one or more columns in a table that
refers to the primary key in another table. There isn’t any
special code, configurations, or table definitions you need to
place to officially “designate” a foreign key. For example,
trail name.
Cardinality
In the context of databases, cardinality refers to the uniqueness
of data values contained in a column.
High cardinality columns are those with very unique or
uncommon data values. For example, in a
database table that stores bank account numbers, the “Account
Number” column should have very high
cardinality – by definition, every item of data in this column
should be totally unique.
Normal cardinality columns are those with a somewhat unique
percentage of data values. For instance, if a table holds
customer information, the “Last Name” column would have
normal cardinality. Not every last name will be unique (for
example, there will likely be several occurrences of “Smith”)
but on the whole, the data is fairly non-repetitive.
Low cardinality columns are those with very few unique values.
In a customer table, a low cardinality column would be the
“Gender” column. This column will likely only have “M” and
“F” as the range of values to choose from, and all the
thousands or millions of records in the table can only pick one
of these two values for this column.
Cardinality relationships between tables can take the form of
one-to-one, one-to-many (whose reversal is many-to-one) or
many-to-many. These terms simply refer to the relationships of
data between the tables. For example, the relationship between
the “Customers” table and the “Bank Accounts” table is one-to-
many, that is, one customer can have several accounts, but one
account cannot belong to more than one customer. That is, of
course, assuming this bank has never heard of joint accounts!
Cardinality
Cardinality is commonly expressed in the following ways:
Databases
In many tabular databases (e.g., Dbase, SQL Server, ORACLE,
Excel etc.), it is possible to list data, create queries, and sort
the results.
Linking a tabular database to a GIS, however, provides
additional “spatial”
capabilities that enhance the value of the data to provide useful
information.
The “geographic search” is the secret to GIS data retrieval, in
addition to the
ability to get back on demand data that were previously stored.
Let’s take a look some examples of GIS software and the
“spatial” database
facilities they provide.
Multitier Architecture
.
Figure 8-5: Data are stored and accessed from the databases on
the bottom tier and requests to view the data (queries) are made
from the clients on the top tier consisting of desktop and web
based products. Application servers act as the middle-men,
processing transactions from the clients to the databases on the
bottom tier. In well designed systems there are rules or policies
setup that control how the data is accessed and by whom. We
will look at this in more detail layer in the lecture.
Spatial Database facilities in ArcGIS
ESRI’s ArcGIS software is packaged with tutorials for
ArcCatalog, ArcMap, ArcToolbox, and any extensions that you
purchase. ESRI maintains on-line resources for teaching GIS in
higher education, which includes links to lab exercises. The
software you are using in the analysis course is, of course, ESRI
software.
GIS and image processing software developed at Clark Labs -
an educational and research institution at Clark University in
Worcester, Massachusetts - is called Idrisi. It is easy to use
while providing professional-level GIS, image processing, and
spatial statistics analysis on Windows computers. It originally
focused on handling raster data.
Database facilities in Idrisi http://www.clarklabs.org/
The historical approach to data handling is that a different
version of the visitor’s details may be stored in separate
databases. Additionally, data may be duplicated, errors may
occur during transcription, and data storage mechanisms are
often inflexible etc.
Why Choose a Database Approach?
Card index record from ski school manual database
It is also very difficult to update files using the traditional
approach, as
illustrated in this example for a hardcopy index card held for an
individual.
We are striving for the database approach for data handling
Key Points for databases:
They are collections of related material shared by different
users;
Data is organized for easy reference with no duplication (e.g.,
think of a phone book);
Computer databases are superior to hard- copy ones because
they are easier to update and data can be adapted to different
uses (e.g., unlike the phonebook).
Relational Databases
Example of a relational attribute table data for hotels in Happy
Valley, including the
terminology
/ ID number
Earlier this term you were introduced to flat file and
hierarchical attribute tables. Relational attribute files are
different to flat files and hierarchical files (see next slide for
comparison). As we can see here, by organizing the data into 2D
tables that contain records for each entity / feature, it is
possible to “query” these data using a GIS, to find out which
hotels are the cheapest or which is 5 star etc.
Examples will be given in later slides.
New attribute tables can be created using the relational ID
number.
This is 1 example of the flexibility of computer databases over
hardcopy forms, in that they can be adapted to different
purposes to illustrate different relationships.
You will be doing these types of exercises during your
analysis course.
Creating New Attribute Tables from Existing Ones
Retrieval Operations
Retrieval operations may be applied to either spatial or attribute
data. For example, if searches are made to attribute data then
the user would find or browse an entity.
As we saw in the last slide, and as you have experienced in your
analysis course, this can be used for data reorganization by
selecting, renumbering, and sorting the data.
Similarly, new attributes can be calculated by computation of
selected values.
Spatial Retrieval Operations
In a spatial (map) database, records are features or themes.
The spatial equivalent of an “attribute find” is locate. The GIS
then
highlights the result.
Spatial equivalents of the DBMS queries result in locating sets
of features or building new GIS layers.
Queries
Functions, query language and applying algorithms could be
difficult requiring a
command line approach. For example:
find in states where state_name = ‘California’
<1 record in result>
use states
calculate in states population_density = population / area
<50 records in result>
restrict in states where population_density > 1000
<20 records selected in result>
So you are coming into this field at a good time because it
recent years it has become A LOT more user friendly.
Querying via SQL
At the highest level, a query in SQL is comprised of three key
components or operations: SELECT, FROM, and WHERE.
Intuitively, these operations used together tell your DBMS/GIS
to SELECT some subset or fields of data FROM some data set
WHERE some such condition is met.
Retrieval User Interface
Now the Retrieval User Interface simplifies the process. Most
GIS packages use the GUI (Graphical User Interface) of the
computer’s operating system to support both a menu-type query
interface and a macro or programming language.
SQL (Standard Query Language) is a standard interface that
works with relational databases and is supported by many GIS
software packages.
So, let us examine a few things we can do operations with the
GUI and SQL in the next slides. You will also look at this in
more detail in week 8 (chapter 9) and you will be applying
these techniques extensively in the analysis course.
I will start with spatial retrieval operations. Then go on to
illustrate an application from my own work that uses complex
retrieval operations. Finally, I will finish this week’s lecture
component by discussing GIS applications using the web
interface.
Examples of Attribute Searches and Queries
Identify– The image below shows a commonly used tool in GIS.
The user clicks on the “identify” tool (circled in red) and then
clicks on a feature in the map which will then display all of the
attributes associated with that feature. This tool provides a
quick
“snapshot” of attributes associated with a feature and does not
allow the user to save or
work with the popup.
Examples of Attribute Searches and Queries
Select by Attributes – The “select by attributes” tool allows the
user to build a SQL query and then view the results of the
query on the map display. The user can also see the results in
the layer’s attribute table, further refine the query, or export
the data into a new table. The example below
shows features from the 2013 EPA_Temperature_2013 layer
greater than January 2nd 2013 and less than or equal to April
1st 2013. Another advantage of using this tool is that more
complex queries can be saved an re-used at a later date.
Examples of Spatial Searches and Queries
– Select by Location
In GIS there will be instances where you will be required to
find features that are near or that intersect other features.
While the select by attributes tool we saw on the previous slide
is a powerful tool it is limited in that it can only use SQL
language to select features. The Select by Location is an
example of a tool that can be used to select features from one
layer that intersect, touch, are within a distance of, etc. a
feature or features in another layer. The example to the right
retrieves all of the 2009 seismic events that are within a mile
of the selected countries. Once the data is selected in can be
exported to another feature layer or viewed as an attribute table
which can be exported.
Foundations of Geographic Information Science
GIS5103 – Fall 2019
Week 5
Data Input, Throughput, & Quality
By the end of this week you should be able to:
Describe methods of data input
Describe Digitizing sources ‐ the cartographic base
Contrast Digitizing methods within a GIS (Tablet vs. “Heads
Up”)
Describe Data editing (correcting digitizing errors,
re‐projections
and generalizations)
Define accuracy and precision
Describe the importance of data standards and metadata
List sources of errors in a GIS project
Differentiate spatial and attribute errors
Describe techniques for coping with errors in Data
The Data Stream
The focus of this week’s lecture component is on creating GIS
or spatial data from a variety of
sources. We will also be looking at how GIS data is updated.
Digitizing GIS Data
As we have discussed over the past several weeks one of the
primary features of a GIS is its ability to integrate data from a
variety of sources. Depending on how the data is to be used the
need to convert the data or extract features from it may arise.
Fortunately there are a number of techniques and tools available
to assist in acquiring data for
a new GIS is straightforward these days. Some of the sources
for this data can include some of the following:
GPS (Global Positioning Systems) has become a major source
of new GIS data. Data acquired through GPS can be highly
spatially accurate (1cm horizontal) depending upon the type of
receiver used.
Digital map images such as scanned maps, LiDAR , satellite
data, and aerial photographs (Ortho photos) are now often used
as a cartographic base for digitizing; this is another way to
create new GIS data layers.
Remotely sensed satellite data are becoming an important
source of GIS data as the cost of data falls, free data becomes
available and new types of data emerge.
The following examples show the different spatial scales
commonly found
with remote sensing data. Examples are also provided that show
how they can be used to digitize temporal changes or changes
to features over time (urbanization) or after an event (Natural
Disaster).
Satellite data are raster data. The amount of spatial detail
represented by one pixel of a raster
image determines its spatial resolution. So, an image with one
meter spatial resolution means
that each pixel in the image represents one square meter on the
ground. With improvements in
cameras and digital image acquisition the cost of acquiring
aerial images has decreased. It is now possible to obtain
imagery with a 6” pixel resolution for a few thousand dollars.
30 meters
10 meters
5 meters
The Different Spatial Resolutions
Image Resolutions Continued
2 meters
1 foot
In cartographic applications, higher spatial resolutions are often
favored. However, for applications such as weather forecasting
or to monitor seasonal plant growth on a
yearly basis for North America, coarser spatial resolutions are
favored. While this type of data
can be extremely useful there are data storage considerations
and large file sizes to consider which may impact the
accessibility of these images especially in web based
applications.
1 meter
Where to obtain remote sensing data
There are a number of useful websites from which remote
sensing data (satellite images) can be downloaded. Here are
some examples:
Free satellite image data @ University of Maryland:
http://glcf.umd.edu/data/
It is also possible to BUY data! e.g.
http://gs.mdacorporation.com/products/index.asp
Free Digital Terrain data:
http://www.webgis.com/terraindata.html
Digitizing within a GIS
Sometimes digitizing is necessary if:
We need new to input new features.
Map features are incorrectly mapped.
Updates are needed for existing features.
And so, just what is digitizing?
Digitizing is the process of capturing map data in a GIS layer
by tracing points, lines, or polygons from a map or image by
using a mouse or a “puck” on on a digitizing tablet.
It can be done by either using a digitizing tablet, or by
digitizing directly from a satellite image or a scanned map on
screen.
On‐screen or “heads up” digitizing creates a spatial dataset by
tracing over features displayed on a computer monitor with a
mouse. The newly created dataset picks up the spatial reference
of the source document and results in a string of points with (x,
y) values. You will learn how to digitize in tutorial 6 of your
analysis class but we will examine how we can do this now as
well.
Digitizing table and PC workstation
A few years ago, the digitizing table was widely used in GIS.
This method allows features to be input into a GIS
database as points, lines or polygons but often requires
extensive data clean-up and editing. With the updates made to
GIS software this method of data entry is essentially obsolete
and has been replaced by on-screen digitizing.
Digitizing directly on screen
Digitizing directly on screen or “heads up” digitizing, is the
approach I am most familiar with and it is what you will be
doing in the analysis class.
In this case, you use your computer mouse to digitize paper
maps, aerial photos, or other images displayed in the GIS. We
can refer to this layer as a stable base map (that is geocoded) ‐
recall from slide 15 that the newly created dataset picks up
the spatial reference of the source document? Below is an
example, where the outline of fluvial features (blue) and
contour lines (brown) have been digitized. Only the fluvial
features can have been digitized directly from the image on the
left, however.
Selecting points to digitize
A vertex
Digitizing is not difficult to do. You can choose to digitize
either a point, a line (snap to feature) or a polygon (snap to
end). Snapping is a routine embedded within the GIS that
makes sure the separate vertices in a line or an area are
connected. That is, it ensures there are no gaps. This is
important if calculations or correlations are going to be
performed on the new layers. In these cases,
gaps would cause errors. The next slide shows how snapping
can be achieved –it uses topology
Geocoding Address Data
What is it?
Geocoding address data is the process of relating an address to a
geographic location (such as latitude/longitude coordinates) or
geographic area (such as census tract, block group, block, or
ZIP code). The address itself then is used to determine the
geographical coordinates.
Geocoding can be affected by the quality of data, e.g., incorrect
spelling and the use of different abbreviations (e.g., for Street
and Avenue). Therefore, the use of standards is relevant to
geocoding address data.
Address Matching
Address matching is the process of geocoding street addresses
to a street network.
(Modified) ESRI’s definition of address matching:
A process that compares an address or a table of addresses to
the address attributes of a reference dataset (e.g., locations may
be determined based on address ranges stored for each street
segment). This determines whether a particular address falls
within an address range associated with a feature in the
reference dataset.
If it does, it is considered a match and a location can be
returned. The next slide provides an example of how an address
can be converted to a point feature and drawn along a line
segment based upon an address range.
Address Matching
In most instances a geocoding service or engine is built upon a
street network in which line segments are assigned an address
number range on the left and right side of the streets, a city or
twin name, and a zip code. When a user enters an address the
geocoding engine finds a match along a line segment and places
the location of the address (a point) based upon where it falls
along the street line segment.
GIS Services: Address Matching Resources
Creating a geocoding engine can be a time consuming process.
However, there are some commercially available geocoding
services that allow us to enter a table of addresses for
geocoding
and then download the results and add them to our GIS
database. Some geocoding services are free and some charge a
fee.
http://www.batchgeo.com
https://geomap.ffiec.gov/FFIECGeocMap/GeocodeMap1.aspx
https://www.census.gov/geo/maps-data/data/geocoder.html
Error, Accuracy, and Precision
Until quite recently, people involved in developing and using
GIS paid little attention to the problems caused by error,
inaccuracy, and imprecision in spatial datasets. Surely, a GIS is
too powerful for error?
Not true! It is now recognized that error, inaccuracy, and
imprecision, if left unchecked, can make the results of a GIS
analysis almost worthless, it is like putting garbage in means
getting garbage out.
But where do these errors come from?
Since a GIS can collate and cross‐reference many types of data
by location and can integrate many discrete datasets (which is
the heart of its power), it can also inherit error from the
imported datasets.
17
Error, Accuracy, and Precision
We can discuss error in terms of data quality.
Data quality refers to the relative accuracy and precision of a
GIS database. These facts are often documented in data quality
reports.
Errors may exist both in map data (which can be reduced by
maintaining topological integrity) and also attribute data.
Next we will discuss the differences between accuracy and
precision in GIS data.
18
Data Accuracy
Accuracy is the degree to which information on a map, or in a
database, matches true values. The level of accuracy required
for particular applications and GIS analyses varies greatly.
Highly accurate data can be very difficult and costly to produce
and maintain often requiring highly accurate GPS and/or survey
data or the acquisition of LiDAR or remotely sends data
(satellite imagery, aerial photographs, etc.).
In discussing a GIS database, it is possible to consider
horizontal and vertical (spatial) accuracy with respect to
geographic position (i.e. how close is the mapped feature to its
actual location on the earths surface) as well as attribute
accuracy (i.e. do the values in the Attribute table match the real
world values).
19
Data Accuracy
1:1,200 ± 1 m
1:2,400 ± 2 m
1:4,800 ± 4 m
1:10,000 ± 8.5 m
1:12,000 ± 10 m
1:24,000 ± 12 m
1:63,360 ± 32 m
1:100,000 ± 50 m
This means that when we see a point on a map we have its
"probable" location within a certain area. For example, a sewer
manhole shown on a 1:1,200 scale map should be within 1
meter (3 ft) of it’s true location on the ground to be considered
within the 1:1,200 accuracy standard. It’s important to
understand the limitations of your data as well. While a layer
of sewer manholes within the 1:1200 data standard may be
suitable for broad planning or maintenance activities, using this
data at scales it was not intended (engineering purposes for
example) for can have negative and potentially costly
consequences.
20
Defining Precision
Precision refers to the level of measurement and exactness of
description in a GIS database.
The level of precision required for particular applications varies
greatly. Engineering projects such as road and utility
construction require very precise information measured
to the millimeter. Demographic analyses of marketing or
electoral trends can often make do
with less. For example to the closest zip code or precinct
boundary. Highly precise data can also be very difficult and
costly to collect.
Precise data ‐ no matter how carefully measured ‐ may be
inaccurate and vice-versa. Let’s try to understand this better by
using Bolstad’s diagram on page 625, which I have included in
the next slide.
21
Accuracy and Precision
Points (yellow circles) are digitized to represent the center of
the cloverleaf intersection. Average accuracy is high when the
average of the points falls near the true location, as in the
panels on the left side of the figure. Precision is high when the
points are all clustered near each other (top panels). A group of
points may be accurate, but not precise (lower left), or precise,
but not accurate (upper right). We typically strive for a
process that provides both accuracy and precision (upper left),
and avoid low accuracy and low precision (lower right).
22
Sources of Inaccuracy and Precision
Let us take a minute to discuss attribute accuracy and precision,
that is the non‐spatial data linked to lo
cation.
Inaccuracies may result from mistakes of many sorts, including
basic data entry. With respect to precision, precise attribute
information describes phenomena in great detail. For example,
a precise description of a person living at a particular address
might include gender, age, income, occupation, level of
education, and many other characteristics. A less precise
description might include just income or gender. It is the
application that will determine whether precise or less precise
data are needed.
There are many sources of error that may affect the quality of a
GIS dataset. Some are quite obvious, but others can be difficult
to discern. Few of these will be automatically identified by the
GIS itself.
It is the user's responsibility to prevent them.
Sources of error can be divided into 3 main categories:
Conceptual errors;
Errors arising through data processing; and
Errors arising from source data
23
Errors Arising from Source Data
Age of Data
Data sources may be too old to be useful or relevant to current
GIS projects. Past collection standards may be unknown,
non‐existent, or currently acceptable. For instance, John
Wesley Powell's nineteenth century survey data of the Grand
Canyon lacks the precision of data that can be developed and
used today. Additionally, erosion, deposition, and other
geomorphic processes will have modified the landscape.
Therefore, reliance on old data could skew, bias, or negate
results.
Density of Observations
The number of observations within an area is a guide to data
reliability and they should be known by the map user. An
insufficient number of observations may not provide the level
of resolution required to adequately perform spatial analysis
and determine the patterns GIS projects seek to resolve or
define.
24
Errors Arising from Source Data
There are four ways we describe errors in spatial data
Positional accuracy – describes how close the locations of
objects represented in a digital dataset correspond to the true
locations of the real-world entities.
Attribute accuracy – summarizes how different the attributes are
from their real world values.
Logical Consistency – reflects the presence, absence, or
frequency of inconsistent data. Tests for logical consistency
often require comparisons among themes (i.e. all buildings
must be on dry land).
Completeness – Describes how well a layer reflects all of the
real world features it is supposed to represent.
The next slide shows a figure from the text (Figure 14-3, Page
624) that depicts examples of these types of errors.
25
Errors Arising from Source Data
26
Data Standards
Regular checks and tests should be employed during a project to
make sure that standards are being followed. This allows a
designer to pinpoint difficulties at an early stage and correct
them. Establishing data standards helps with data exchange –
unfortunately, the history of GIS data exchange has been
chaotic and has been wasteful in the past.
Examples of good data standards include:
USGS, National Mapping Program Standards,
http://nationalmap.gov/standards//
Spatial Data Transfer Standard
http://mcmcweb.er.usgs.gov/sdts/
27
2
Foundations of Geographic Information Science
GIS 5103
Fall 2019 - Week 3
Spatial Data Modeling
By the end of Week Three you should be able to:
Provide a definition of a ‘spatial data model’
Explain how spatial entities are used to create a data model
Distinguish between rasters and vectors
Describe a spatial data structure
Explain what topology is
List the advantages and disadvantages of different types of
spatial models
File Types
Files in a GIS may be binary data files. They can be read by
computers (a program or a hardware understands in advance
exactly how it is formatted) but not people.
All executable programs and most numeric data files are stored
in binary files.
Or they can be ASCII text. ASCII files can be read by humans.
Binary is faster to read and smaller.
Raster and Vector Data
A file must have a mental model of how the physical data
represent a geographic feature.
GIS traditionally use either raster or vector models to store
data.
Rasters represent phenomenon distributed continuously over a
large area.
Vectors are object based (point, line, polygon).
Raster Data Example
Satellite data has a RASTER data structure (GRID
CELL), composed of matrix of pixels;
Each pixel records the “radiation intensity” measured by the
remote sensing instrument called its radiometric resolution.
The intensity of radiation is represented by a data number
(DN), that is, the “brightness value”.
One grid cell (pixel) has one attribute that represents its data
number/ brightness value (even if 0); a cell also has a spatial
resolution, given as the cell size in ground units.
Grid Structure of Raster Data, p.81-82
Vector Data Example
Spatial feature files that are object (vector) based:
Have identifiable boundaries or spatial extent;
Relevant to an intended application;
They have “ATTRIBUTES” – descriptive characteristics of a
feature.
For example, a tax parcel can have a rectangular shape and be
represented by a polygon, that is the feature’s geometry. The
owner name, parcel id, property value, etc. would be
considered the
polygon’s attributes.
Attribute data are stored in tables comprised of a matrix of
numbers and values
and are stored in rows and columns, like a spreadsheet, e.g.
attribute tables.
OBJECTS may be:
Exact, e.g., man-made objects; or
Inexact (fuzzy), e.g., soil types and forest stands etc.
Recall: spatial objects represented by vector files may be
represented either as
points, lines or polygons (areas) [slides 9-11, class 2].
Vectors vs Rasters:
Vectors can store information about topology (relational space,
p. 51-52).
GIS Data File Formats
Most GIS systems can import different data formats, or use
utility programs to convert them;
Data formats can be industry standard or commonly accepted
standard;
Example vector formats: DLG (Digital Line Graph) and TIGER
(Topologically Integrated Geographic Encoding and
Referencing system), shp. (ESRI)
Personal, File, and Enterprise GeoDatabase (ESRI);
Most raster formats are digital image formats, e.g. Tiff (Tagged
Image File
Format) or DEMs (Digital Elevation Models).
Data Models
Once you have determined which features you would like to
map or create data
layers for you next have to decide how you wish to portray
those features
and also consider the following:
How the geographic features are to be represented for each
theme (for example, as points, lines, polygons, or rasters) along
with their tabular attributes
How the data will be organized into datasets, such as feature
classes, attributes, raster datasets, and so forth
What additional spatial and database elements will be needed
for integrity rules, for implementing rich GIS behavior (such as
topologies, networks, and raster catalogs), and defining spatial
and attribute relationships between datasets.
Representation
Each GIS database design begins with a decision as to what the
geographic representations will be for each dataset. Individual
geographic entities can be represented as:
Feature classes (sets of points, lines, and polygons)
Imagery and rasters (aerial photography, grid datasets)
Continuous surfaces that can be represented using features
(such as contours), rasters (digital elevation models [DEM]), or
triangulated irregular networks (TINs) using terrain datasets
Attribute tables for descriptive data – these tables can be stand-
alone tables that contain information related to a feature in a
feature class – such as an inspection, an activity (catch basin
cleaning, tree maintenance, etc.)
Data Themes
Geographic representations are organized in a series of data
themes (sometimes referred to as thematic layers). A key
concept in a GIS is one of data layers, or themes. A data theme
is a collection of common geographic elements such as a road
network, a collection of parcel boundaries, soil types, an
elevation surface, satellite imagery for a certain date, well
locations, and so on.
The concept of a thematic layer was one of the early notions in
GIS. Practitioners thought about how the geographic
information in maps could be partitioned into logical
information layers—as more than a random collection of
individual objects (such as a road, a bridge, a hill, a house, a
peninsula). These early GIS users organized information in
thematic layers that described the distribution of a phenomenon
and how it should be portrayed across a geographic extent.
These layers also provided a protocol (capture rules) for
collecting the representations (as feature sets, raster layers,
attribute tables, and so on).
In GIS, thematic layers are one of the main organizing
principles for GIS database design.
Each GIS will contain multiple themes for a common
geographic area. The collection of themes acts as layers in a
stack. Each theme can be managed as an information set
independent of other themes. Each has its own representations
(points, lines, polygons, surfaces, rasters, and so on). Because
the various independent themes are spatially referenced, they
overlay one another and can be combined in a common map
display. Plus, GIS analysis operations, such as overlay, can fuse
information between themes.
GIS datasets are collections of representations for a data
theme
Geographic data collections can be represented as feature
classes and raster-based datasets in a GIS database.
Many themes are represented by a single collection of
homogeneous features such as a feature class of soil type
polygons and a point feature class of well locations. Other
themes, such as a transportation framework, are represented by
multiple datasets (such as a set of spatially related feature
classes for streets, intersections, bridges, highway ramps, and
so on).
Raster datasets are used to represent continuous surfaces, such
as elevation, slope, and aspect, as well as to hold satellite
imagery, aerial photography, and other gridded datasets (such
as land cover and vegetation types).
GIS datasets are collections of representations for a data
theme – Cont.
Both the intended use and existing data sources influence
spatial representations in a GIS. When designing a GIS
database, users have a set of applications in mind. They
understand what questions will be asked of the GIS. Defining
these uses helps to determine the content specification for each
theme and how each is to be represented geographically. For
example, there are numerous alternatives for representing
surface elevation: as contour lines and spot height locations
(such as hilltops, peaks), as a continuous terrain surface (a
TIN), or as shaded relief. Any or all of these may be relevant
for each particular GIS database design. The intended uses of
the data will help to determine which of these representations
will be required.
Frequently, the geographic representations will be
predetermined to some degree by the available data sources for
the theme. If a preexisting data source was collected at a
particular scale and representation, it will often be necessary to
adapt your design to use it.
Individual GIS datasets often are collected in concert with other
data
layers
While each GIS dataset can be used independently of other GIS
data, it is often quite important to collect datasets in concert
with other information layers so that the fundamental spatial
behavior and spatial relationships are maintained and consistent
between the related GIS data layers. Here are a few examples
that help to illustrate this concept:
The spatial relationships between elevation, landform, soil type,
slope, vegetation, surficial geology, and other terrain
properties are typically compiled in unison to characterize
environmental resource units. Understanding the science behind
these spatial relationships helps to build a consistent, logical
database where features from each data layer are consistent
with each other.
Topographic basemap information is compiled in an integrated
manner. Hydrography, transportation, structures, administrative
boundaries, and other topographic map layers are compiled in
unison. These cartographic representations in the map display
are built in an integrated manner to communicate clearly and
accurately and draw attention to key map locations.
Individual GIS datasets often are collected in concert with other
data
layers – Cont.
In each of these cases, a data model defines a collection of
related data themes that fit into an overall information
framework. Each framework is essentially a collection of
related data themes that are best captured in unison with each
other. The data capture guidelines follow sound scientific
principles about their spatial behavior and relationships. Each
theme plays an important part in the holistic characterization of
a particular landscape. For example:
Terrain landscape. Topographic maps, elevation, drainage
network, transportation network, map features, cross-country
movement, and so forth
Urban landscape. Buildings, critical infrastructure, and so forth
Imagery landscape. Satellite and aerial, local, regional, and
national assets, and so forth
Human landscape. Demographics (population characteristics),
cultural centers, citizens, administrative districts and zones and
so forth
Workforce landscape. Mobile workforce tracking, service
centers, traffic
conditions, warehouses, and so forth
Sensor landscape. Camera locations, devices, and so forth
Operations and plans landscape. Zones of control, planned
movements,
response, and so forth
This concept of collecting integrated data themes in unison is
one of the key design principles used in each of the ArcGIS
Data Models.
Topology
Topology is a collection of rules that, coupled with a set of
editing tools and techniques, enables the geodatabase to more
accurately model geometric relationships. ArcGIS implements
topology through a set of rules that define how features may
share a geographic space and a set of editing tools that work
with features that share geometry in an integrated fashion. A
topology is stored in a geodatabase as one or more
relationships that define how the features in one or more
feature classes share geometry. The features participating in a
topology are still simple feature classes—rather than modifying
the definition of the feature class, a topology serves as a
description of how the features can be spatially related.
Why Topology?
Topology has long been a key GIS requirement for data
management and integrity. In general, a topological data model
manages spatial relationships by representing spatial objects
(point, line, and area features) as an underlying graph of
topological primitives—nodes, faces, and edges. These
primitives, together with their relationships to one another and
to the features whose boundaries they represent, are defined by
representing the feature geometries in a planar graph of
topological elements.
Why topology?
Topology is a collection of rules that, coupled with a set of
editing tools and techniques, enables the geodatabase to more
accurately model geometric relationships. ArcGIS implements
topology through a set of rules that define how features may
share a geographic space and a set of editing tools that work
with features that share geometry in an integrated fashion. A
topology is stored in a geodatabase as one or more relationships
that define how the features in one or more feature classes share
geometry. The features participating in a topology are still
simple feature classes—rather than modifying the definition of
the feature class, a topology serves as a description of how the
features can be spatially related.
Why topology?
Topology is fundamentally used to ensure data quality of the
spatial relationships and to aid in data compilation. Topology
is also used for analyzing spatial relationships in many
situations, such as dissolving the boundaries between adjacent
polygons with the same attribute values or traversing a network
of the elements in a topology graph.
Topology can also be used to model how the geometry from a
number of feature classes can be integrated. Some refer to this
as vertical integration of feature classes.
Ways that features share geometry within a topology
Features can share geometry within a topology. Here are some
examples among adjacent
features:
Area features can share boundaries (polygon topology).
Line features can share endpoints (edge-node topology).
In addition, shared geometry can be managed between feature
classes using a geodatabase topology. For example:
Line features can share segments with other line features.
Area features can be coincident with other area features. For
example, parcels can
nest within blocks.
Line features can share endpoint vertices with other point
features (node topology).
Point features can be coincident with line features (point
events).
The next slide shows some commonly used topology rules found
in ArcGIS.
28
What are the advantages of topology?
The results are that topological primitives (nodes, edges, and
faces) and their relationships to one another and their features
can be efficiently discovered and assembled. This has several
advantages:
Simple feature geometry storage is used for features. This
storage model is open, efficient, and scales to large sizes and
numbers of users.
This simple features data model is transactional and is
multiuser. By contrast, the older topological storage models
will not scale and have difficulties supporting multiple editor
transactions and numerous other GIS data management
workflows.
Geodatabase topologies fully support all the long transaction
and versioning capabilities of the geodatabase. Geodatabase
topologies need not be tiled, and many users can
simultaneously edit the topological database—even their
individual versions of the same features if necessary.
Feature classes can grow to any size (hundreds of millions of
features) with very strong performance.
What are the advantages of topology?
Topology can be added this to an existing schema of spatially
related feature classes. The alternative is that you must
redefine and convert all your existing feature classes to new
data schemas holding topological primitives.
There need only be one data model for geometry editing and
data use, not two or more.
It is interoperable because all feature geometry storage adheres
to simple features specifications from the Open Geospatial
Consortium and ISO.
Data modeling is more natural because it is based on user
features (such as parcels, streets, soil types, and watersheds)
instead of topological primitives (such as nodes, edges, and
faces). Users will begin to think about the integrity rules and
behavior of their actual features instead of the integrity rules of
the topological primitives. For example, how do parcels
behave? This will enable stronger modeling for all kinds of
geographic features. It will improve our thinking about streets,
soils types, census units, watersheds, rail systems, geology,
forest stands, land forms, physical features, and on and on.
What are the advantages of topology?
Geodatabase topologies provide the same information content as
maintained topological implementations—either you store a
topological line graph and discover the feature geometry (like
coverages) or you store the feature geometry and discover the
topological elements and relationships (like geodatabases).
In cases where users want to store the topological primitives, it
is easy to create and post topologies and their relationships to
tables for various analytic and interoperability purposes (such
as users who want to post their features into an Oracle Spatial
warehouse that stores tables of topological primitives).
At a pragmatic level, the ArcGIS topology implementation
works. It scales to extremely large geodatabases and multiuser
systems without loss of performance. It includes validation and
editing tools for building and maintaining topologies in
geodatabases. It includes rich and flexible data modeling tools
that enable users to assemble practical, working systems on file
systems, in any relational database, and on any number of
schemas.

Contenu connexe

Similaire à GIS 5103 – Fundamentals of GISLecture 83D GIS.docx

09165566d individual
09165566d individual09165566d individual
09165566d individual
lsgi4321
 
Best practices for_managing_geospatial_data1
Best practices for_managing_geospatial_data1Best practices for_managing_geospatial_data1
Best practices for_managing_geospatial_data1
Leng Kim Leng
 
What we've done so far with mago3D, an open source based 'Digital Twin' platf...
What we've done so far with mago3D, an open source based 'Digital Twin' platf...What we've done so far with mago3D, an open source based 'Digital Twin' platf...
What we've done so far with mago3D, an open source based 'Digital Twin' platf...
SANGHEE SHIN
 
EGU General Assembly 2013: The new g.infer add-on for GRASS GIS
EGU General Assembly 2013: The new g.infer add-on for GRASS GISEGU General Assembly 2013: The new g.infer add-on for GRASS GIS
EGU General Assembly 2013: The new g.infer add-on for GRASS GIS
Peter Löwe
 
Martin Lander 14 Resit Project – Globally Conscious Engineer .docx
Martin Lander 14  Resit Project – Globally Conscious Engineer .docxMartin Lander 14  Resit Project – Globally Conscious Engineer .docx
Martin Lander 14 Resit Project – Globally Conscious Engineer .docx
infantsuk
 
Data Science Challenge presentation given to the CinBITools Meetup Group
Data Science Challenge presentation given to the CinBITools Meetup GroupData Science Challenge presentation given to the CinBITools Meetup Group
Data Science Challenge presentation given to the CinBITools Meetup Group
Doug Needham
 
F21SC Industrial Programming CW2 Data Analytics (35) 20192.docx
F21SC Industrial Programming CW2 Data Analytics (35) 20192.docxF21SC Industrial Programming CW2 Data Analytics (35) 20192.docx
F21SC Industrial Programming CW2 Data Analytics (35) 20192.docx
lmelaine
 

Similaire à GIS 5103 – Fundamentals of GISLecture 83D GIS.docx (20)

09165566d individual
09165566d individual09165566d individual
09165566d individual
 
Multiplaform Solution for Graph Datasources
Multiplaform Solution for Graph DatasourcesMultiplaform Solution for Graph Datasources
Multiplaform Solution for Graph Datasources
 
GIS in Nutshell
GIS in NutshellGIS in Nutshell
GIS in Nutshell
 
Exploration and 3D GIS Software - MapInfo Professional Discover3D 2015
Exploration and 3D GIS Software - MapInfo Professional Discover3D 2015Exploration and 3D GIS Software - MapInfo Professional Discover3D 2015
Exploration and 3D GIS Software - MapInfo Professional Discover3D 2015
 
Portfolio
PortfolioPortfolio
Portfolio
 
SD-miner System to Retrieve Probabilistic Neighborhood Points in Spatial Dat...
SD-miner System to Retrieve Probabilistic Neighborhood Points  in Spatial Dat...SD-miner System to Retrieve Probabilistic Neighborhood Points  in Spatial Dat...
SD-miner System to Retrieve Probabilistic Neighborhood Points in Spatial Dat...
 
A Uniform Specification Of Mixed Reality Interface Components
A Uniform Specification Of Mixed Reality Interface ComponentsA Uniform Specification Of Mixed Reality Interface Components
A Uniform Specification Of Mixed Reality Interface Components
 
D3.JS Tips & Tricks (export to svg, crossfilter, maps etc.)
D3.JS Tips & Tricks (export to svg, crossfilter, maps etc.)D3.JS Tips & Tricks (export to svg, crossfilter, maps etc.)
D3.JS Tips & Tricks (export to svg, crossfilter, maps etc.)
 
A Review on data visualization tools used for Big Data
A Review on data visualization tools used for Big DataA Review on data visualization tools used for Big Data
A Review on data visualization tools used for Big Data
 
International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)International Journal of Engineering Research and Development (IJERD)
International Journal of Engineering Research and Development (IJERD)
 
Best practices for_managing_geospatial_data1
Best practices for_managing_geospatial_data1Best practices for_managing_geospatial_data1
Best practices for_managing_geospatial_data1
 
What we've done so far with mago3D, an open source based 'Digital Twin' platf...
What we've done so far with mago3D, an open source based 'Digital Twin' platf...What we've done so far with mago3D, an open source based 'Digital Twin' platf...
What we've done so far with mago3D, an open source based 'Digital Twin' platf...
 
A NOVEL APPROACH FOR PROCESSING BIG DATA
A NOVEL APPROACH FOR PROCESSING BIG DATAA NOVEL APPROACH FOR PROCESSING BIG DATA
A NOVEL APPROACH FOR PROCESSING BIG DATA
 
Concepts and Methods of Embedding Statistical Data into Maps
Concepts and Methods of Embedding Statistical Data into MapsConcepts and Methods of Embedding Statistical Data into Maps
Concepts and Methods of Embedding Statistical Data into Maps
 
Concepts and Methods of Embedding Statistical Data into Maps
Concepts and Methods of Embedding Statistical Data into Maps Concepts and Methods of Embedding Statistical Data into Maps
Concepts and Methods of Embedding Statistical Data into Maps
 
EGU General Assembly 2013: The new g.infer add-on for GRASS GIS
EGU General Assembly 2013: The new g.infer add-on for GRASS GISEGU General Assembly 2013: The new g.infer add-on for GRASS GIS
EGU General Assembly 2013: The new g.infer add-on for GRASS GIS
 
Martin Lander 14 Resit Project – Globally Conscious Engineer .docx
Martin Lander 14  Resit Project – Globally Conscious Engineer .docxMartin Lander 14  Resit Project – Globally Conscious Engineer .docx
Martin Lander 14 Resit Project – Globally Conscious Engineer .docx
 
Data Science Challenge presentation given to the CinBITools Meetup Group
Data Science Challenge presentation given to the CinBITools Meetup GroupData Science Challenge presentation given to the CinBITools Meetup Group
Data Science Challenge presentation given to the CinBITools Meetup Group
 
Cloudera Data Science Challenge
Cloudera Data Science ChallengeCloudera Data Science Challenge
Cloudera Data Science Challenge
 
F21SC Industrial Programming CW2 Data Analytics (35) 20192.docx
F21SC Industrial Programming CW2 Data Analytics (35) 20192.docxF21SC Industrial Programming CW2 Data Analytics (35) 20192.docx
F21SC Industrial Programming CW2 Data Analytics (35) 20192.docx
 

Plus de shericehewat

You have been asked to explain the differences between certain categ.docx
You have been asked to explain the differences between certain categ.docxYou have been asked to explain the differences between certain categ.docx
You have been asked to explain the differences between certain categ.docx
shericehewat
 
You have been hired asa cons.docx
You have been hired asa cons.docxYou have been hired asa cons.docx
You have been hired asa cons.docx
shericehewat
 
You choose one and I will upload the materials for u.Choose 1 of.docx
You choose one and I will upload the materials for u.Choose 1 of.docxYou choose one and I will upload the materials for u.Choose 1 of.docx
You choose one and I will upload the materials for u.Choose 1 of.docx
shericehewat
 
You are to interview a woman 50 and older and write up the interview.docx
You are to interview a woman 50 and older and write up the interview.docxYou are to interview a woman 50 and older and write up the interview.docx
You are to interview a woman 50 and older and write up the interview.docx
shericehewat
 
You are the vice president of a human resources department and Susan.docx
You are the vice president of a human resources department and Susan.docxYou are the vice president of a human resources department and Susan.docx
You are the vice president of a human resources department and Susan.docx
shericehewat
 
You are the environmental compliance officer at a company that is .docx
You are the environmental compliance officer at a company that is .docxYou are the environmental compliance officer at a company that is .docx
You are the environmental compliance officer at a company that is .docx
shericehewat
 

Plus de shericehewat (20)

You have been asked to explain the differences between certain categ.docx
You have been asked to explain the differences between certain categ.docxYou have been asked to explain the differences between certain categ.docx
You have been asked to explain the differences between certain categ.docx
 
You have been asked to help secure the information system and users .docx
You have been asked to help secure the information system and users .docxYou have been asked to help secure the information system and users .docx
You have been asked to help secure the information system and users .docx
 
You have been asked to participate in a local radio program to add.docx
You have been asked to participate in a local radio program to add.docxYou have been asked to participate in a local radio program to add.docx
You have been asked to participate in a local radio program to add.docx
 
You have been hired asa cons.docx
You have been hired asa cons.docxYou have been hired asa cons.docx
You have been hired asa cons.docx
 
You have been appointed as a system analyst in the IT department of .docx
You have been appointed as a system analyst in the IT department of .docxYou have been appointed as a system analyst in the IT department of .docx
You have been appointed as a system analyst in the IT department of .docx
 
You choose one and I will upload the materials for u.Choose 1 of.docx
You choose one and I will upload the materials for u.Choose 1 of.docxYou choose one and I will upload the materials for u.Choose 1 of.docx
You choose one and I will upload the materials for u.Choose 1 of.docx
 
You are Incident Commander and principal planner for the DRNC even.docx
You are Incident Commander and principal planner for the DRNC even.docxYou are Incident Commander and principal planner for the DRNC even.docx
You are Incident Commander and principal planner for the DRNC even.docx
 
You DecideCryptographic Tunneling and the OSI ModelWrite a p.docx
You DecideCryptographic Tunneling and the OSI ModelWrite a p.docxYou DecideCryptographic Tunneling and the OSI ModelWrite a p.docx
You DecideCryptographic Tunneling and the OSI ModelWrite a p.docx
 
You are working as a behavioral health specialist in a neurological .docx
You are working as a behavioral health specialist in a neurological .docxYou are working as a behavioral health specialist in a neurological .docx
You are working as a behavioral health specialist in a neurological .docx
 
You are to write up a reflection (longer than 2 pages) that discusse.docx
You are to write up a reflection (longer than 2 pages) that discusse.docxYou are to write up a reflection (longer than 2 pages) that discusse.docx
You are to write up a reflection (longer than 2 pages) that discusse.docx
 
You can only take this assignment if you have the book Discovering t.docx
You can only take this assignment if you have the book Discovering t.docxYou can only take this assignment if you have the book Discovering t.docx
You can only take this assignment if you have the book Discovering t.docx
 
You are to interview a woman 50 and older and write up the interview.docx
You are to interview a woman 50 and older and write up the interview.docxYou are to interview a woman 50 and older and write up the interview.docx
You are to interview a woman 50 and older and write up the interview.docx
 
You are to complete TWO essays and answer the following questions.  .docx
You are to complete TWO essays and answer the following questions.  .docxYou are to complete TWO essays and answer the following questions.  .docx
You are to complete TWO essays and answer the following questions.  .docx
 
You are the vice president of a human resources department and Susan.docx
You are the vice president of a human resources department and Susan.docxYou are the vice president of a human resources department and Susan.docx
You are the vice president of a human resources department and Susan.docx
 
You are the purchasing manager of a company that has relationships w.docx
You are the purchasing manager of a company that has relationships w.docxYou are the purchasing manager of a company that has relationships w.docx
You are the purchasing manager of a company that has relationships w.docx
 
You are to briefly describe how the Bible is related to the topics c.docx
You are to briefly describe how the Bible is related to the topics c.docxYou are to briefly describe how the Bible is related to the topics c.docx
You are to briefly describe how the Bible is related to the topics c.docx
 
You are the manager of an accounting department and would like to hi.docx
You are the manager of an accounting department and would like to hi.docxYou are the manager of an accounting department and would like to hi.docx
You are the manager of an accounting department and would like to hi.docx
 
You are the new chief financial officer (CFO) hired by a company. .docx
You are the new chief financial officer (CFO) hired by a company. .docxYou are the new chief financial officer (CFO) hired by a company. .docx
You are the new chief financial officer (CFO) hired by a company. .docx
 
You are the manager of a team of six proposal-writing professionals..docx
You are the manager of a team of six proposal-writing professionals..docxYou are the manager of a team of six proposal-writing professionals..docx
You are the manager of a team of six proposal-writing professionals..docx
 
You are the environmental compliance officer at a company that is .docx
You are the environmental compliance officer at a company that is .docxYou are the environmental compliance officer at a company that is .docx
You are the environmental compliance officer at a company that is .docx
 

Dernier

Dernier (20)

How to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POSHow to Manage Global Discount in Odoo 17 POS
How to Manage Global Discount in Odoo 17 POS
 
Graduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - EnglishGraduate Outcomes Presentation Slides - English
Graduate Outcomes Presentation Slides - English
 
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
80 ĐỀ THI THỬ TUYỂN SINH TIẾNG ANH VÀO 10 SỞ GD – ĐT THÀNH PHỐ HỒ CHÍ MINH NĂ...
 
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
Sensory_Experience_and_Emotional_Resonance_in_Gabriel_Okaras_The_Piano_and_Th...
 
Interdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptxInterdisciplinary_Insights_Data_Collection_Methods.pptx
Interdisciplinary_Insights_Data_Collection_Methods.pptx
 
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptxHMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
HMCS Max Bernays Pre-Deployment Brief (May 2024).pptx
 
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
TỔNG ÔN TẬP THI VÀO LỚP 10 MÔN TIẾNG ANH NĂM HỌC 2023 - 2024 CÓ ĐÁP ÁN (NGỮ Â...
 
Sociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning ExhibitSociology 101 Demonstration of Learning Exhibit
Sociology 101 Demonstration of Learning Exhibit
 
Wellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptxWellbeing inclusion and digital dystopias.pptx
Wellbeing inclusion and digital dystopias.pptx
 
General Principles of Intellectual Property: Concepts of Intellectual Proper...
General Principles of Intellectual Property: Concepts of Intellectual  Proper...General Principles of Intellectual Property: Concepts of Intellectual  Proper...
General Principles of Intellectual Property: Concepts of Intellectual Proper...
 
FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024FSB Advising Checklist - Orientation 2024
FSB Advising Checklist - Orientation 2024
 
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdfUGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
UGC NET Paper 1 Mathematical Reasoning & Aptitude.pdf
 
REMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptxREMIFENTANIL: An Ultra short acting opioid.pptx
REMIFENTANIL: An Ultra short acting opioid.pptx
 
Python Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docxPython Notes for mca i year students osmania university.docx
Python Notes for mca i year students osmania university.docx
 
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptxHMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
HMCS Vancouver Pre-Deployment Brief - May 2024 (Web Version).pptx
 
Key note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdfKey note speaker Neum_Admir Softic_ENG.pdf
Key note speaker Neum_Admir Softic_ENG.pdf
 
How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17How to Add New Custom Addons Path in Odoo 17
How to Add New Custom Addons Path in Odoo 17
 
Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)Jamworks pilot and AI at Jisc (20/03/2024)
Jamworks pilot and AI at Jisc (20/03/2024)
 
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
Beyond_Borders_Understanding_Anime_and_Manga_Fandom_A_Comprehensive_Audience_...
 
On National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan FellowsOn National Teacher Day, meet the 2024-25 Kenan Fellows
On National Teacher Day, meet the 2024-25 Kenan Fellows
 

GIS 5103 – Fundamentals of GISLecture 83D GIS.docx

  • 1. GIS 5103 – Fundamentals of GIS Lecture 8 3D GIS Dimensionality A dimension is the minimum amount of coordinates needed to define the mathematical space of an object. Lecture 8 – 3D GIS 2DA figure that has only length & height as its dimensions. 2D shapes lie on a flat surface; known as plane figures or plane shapes. While they have areas, 2D shapes have no volume.2D figures are plotted on two axes, the x- and y-axes. Lecture 8 – 3D GIS 3DLength + Height + WidthX + Y + Z Volume + Surface AreaExamples include: 3D multi-patch building, roof, interior floors, and foundation would all contain different z-values for the same 2D coordinate. Aircraft's 3D position or a walking
  • 2. trail up a mountain, would only have a single z-value for each X,Y location. Lecture 8 – 3D GIS 2.5DHighly used in GIS to represent Z data that is not continuous = 3DZ data does not need to be elevation. Pollution, # of cases, etc. Lecture 8 – 3D GIS 3D file types .3dd.sxdWhen you import an ArcGlobe or ArcScene document:First the .3dd file opens by default in global mode Secondly the .sxd file opens in local mode. Any new blank scene view defaults to global mode. Lecture 8 – 3D GIS Data TypesVector FeaturesPoint / Line / Polygon Surface typesTIN (Triangular Irregular Networks)DEM’s (Digital Elevation Models)Raster SurfaceLAS Dataset Lecture 8 – 3D GIS 3D StylesPointsGeometric ShapesModelsMarkersImportedLinesTextured SymbolsGeometric ShapesPolygonsTexture FillBasic Colors Lecture 8 – 3D GIS Lecture 8 – 3D GIS Perspective ViewPerspective drawing is the most common
  • 3. drawing mode in 3D, where features in the foreground are shown larger than those off in the distance. This matches the way we see the world in our day-to-day lives, and the result is a realistic representation of 3D content. All scenes open in perspective viewing mode. You can switch between Perspective Perspective View and Parallel Isometric View viewing modes using the Drawing Mode drop-down menu in the Scene group on the View tab. Lecture 8 – 3D GIS Parallel / Isometric ViewParallel drawing renders the 3D view using a parallel projection, where features of the same physical size are rendered on-screen identically, regardless of their distance from the viewing camera. Parallel drawing is useful for architectural drawings, as well as for representing statistical data in a 3D view, such as extruded shapes symbolizing numeric values. Lecture 8 – 3D GIS 3D Analyst ToolsData Conversion/PreparationTxt / Binary / Feature class / Raster / TINSurface CreationInterpolation / LASD CreationSurface AnalysisAspect / Slope / Contour / Feature Interpolation3D Operator & VisibilitySkyline / Inter- visibility / Sun Shadow Analysis Lecture 8 – 3D GIS There are five exploratory analysis tools:The Line of Sight tool creates sight lines to determine if one or more targets are visible from a given observer location.The View Dome tool determines the parts of a sphere that are visible from an observer located at the center.The Viewshed tool determines the visible surface area from a given observer location through a
  • 4. defined viewing angle.The Slice tool visually cuts through the view's display to reveal hidden content.The Cut and Fill tool provides volume calculations while visually demonstrating the surface area where the ground will be filled or removed. Lecture 8 – 3D GIS Exploratory Analysis ToolsThe first three analysis tools determine the visible sight lines, field of view, or spherical visibility coverage. The Slice tool temporarily removes displayed content using clipping planes as volumes.The Cut and Fill tool calculates volumetric displacement using a planar surface. In all cases, the results are temporary and must be saved as feature classes if you want to reload the interactive objects again. Lecture 8 – 3D GIS illumination propertiesA unique element to authoring scenes is that you can also define illumination properties. This includes properties such as the time of day, whether or not the sun casts a shadow, and how much ambient light is used. In global mode, you can also simulate atmospheric diffusion. Lecture 8 – 3D GIS By the end of this week you should be able to: Understand dimensionality List the differences between 2D vs 2.5D vs 3D Know the data types, Views, & Scene Types for 3D GIS Understand Surfaces & How to create/find themFamiliarize yourself with the Exploratory & Analyst toolsList the unique properties of 3D GIS vs Traditional 2D mapsComprehend the issues of 3D GIS Lecture 8 – 3D GIS
  • 5. GIS 5103 – Fundamentals of GIS Lecture 7 Intro to R-Studio & Python for GIS Week 7 Winter 2019 What is R-Studio & why should I use it?Free open-source integrated coding language.Powerful computing/programing & simple UIAlso very popular Intro to R & Python - Lecture 7 Applications & CapabilitiesStatisticsGraphicsProgramingCalculatorGISNeural NetworksEtc. Intro to R & Python - Lecture 7 Intro to R & Python - Lecture 7 Installinghttps://courses.edx.org/courses/UTAustinX/UT.7.01x/ 3T2014/56c5437b88fa43cf828bff5371c6a924/ Set working DirectoryDefining the working area / confining thinking space / Files of interest located here
  • 6. Intro to R & Python - Lecture 7 Import DataFiles of interest located in working directory. Intro to R & Python - Lecture 7 Manipulate & Organize Data Intro to R & Python - Lecture 7 Various functions for parameters of interest Able to work with data and create data. Convert, Transpose, data with single commands. Plots / DiagramsplotMF(object.cls)plotMF(object) Intro to R & Python - Lecture 7 Most Popular Uses of RData ManagementConversion, sub- setting, merging, transformationsStatistical & Mathematical AnalysisANOVA’s, Modeling, RegressionsProgramming / Operations / System InterfaceAutomation, Special ConditionsGraph / Data Display / ConfigurationData Visualization, High CustomizationSimulationsHigh level scenario modeling Intro to R & Python - Lecture 7
  • 7. Variables Can define characters/numbers as nearly anything x = 5 First_Name = Mike f = (sin(y) / tan(sum(z)) * variance(y/z) Name – Type – Size – Value Created Title– character/matrix/list – Bytes/KB/MB – Output Intro to R & Python - Lecture 7 Functions & Syntax Mean(x, trim = 0, na.rm = FALSE, …) Function(input, argument 1, argument 2, …) *Commas used to separate arguments/inputs* Intro to R & Python - Lecture 7 Intro to R & Python - Lecture 7 Help Dialogue
  • 8. Intro to R & Python - Lecture 7 What is Python & why should I use it?Object-orientated, high level programming language with dynamic semanticBuilt in data structures, dynamic typing/bindingUsed as glue to connect programs / components together.Supports modules / packages / code re-use Intro to R & Python - Lecture 7 Python Libraries = R-Studio PackagesGIS LibrariesGDALOGRPyprojShapelyMapnikMySQLPostGISSpati aLite Intro to R & Python - Lecture 7 How to download Pythonhttps://realpython.com/installing- python/ How to Pip install librarieshttps://packaging.python.org/tutorials/installing- packages/ Intro to R & Python - Lecture 7 Intro to R & Python - Lecture 7 How we will use python & Libraries Example of ArcGIS & python https://developers.arcgis.com/python/sample-
  • 9. notebooks/chennai-floods-analysis/ Intro to R & Python - Lecture 7 GIS AutomationPythonR-StudioModel Builder Intro to R & Python - Lecture 7 Common Automated processesAddress systemsData UpdatingMap TemplatingSpatial JoinsDigitizing CalculationsModelsWork Flows Intro to R & Python - Lecture 7 After reviewing this lecture you should be able to: List applications/uses of a coding languagesKnow many of the popular input file typesUnderstand/Replicate functions, syntax, styleKnow the names of essential packages/librariesList commonly automated tasksBe able to download & Install the softwares Intro to R & Python - Lecture 7 2 Foundations of Geographic Information Science GIS 5103 Fall 2019 - Week Six Vector Analysis
  • 10. Objectives: By the end of week seven you will be able to: Describe Measurements in GIS; Perform data queries in ArcGIS Pro Explain techniques used in reclassification of data Describe Buffering Functions and map overlays Earlier in this course we covered some of the questions that can be answered by GIS. Over the past few weeks we have been discussing different types of data, data formats, data models, data creation and editing, and finding/correcting errors in data. Once we have our software, computer hardware, personnel, and data in place we can start to answer questions with our system. This week we will take a look at some of the questions a GIS can answer and also some of the tools GIS uses to answer the questions like the ones listed below: Where are particular features found? What geographical patterns exist? Where have changes occurred over a given period? Where do certain conditions apply? What will the spatial implications be if an organization takes a certain action? Common questions answered by GIS
  • 11. Questions for GIS Analysis Of course “where” is the predominant question but there are some additional questions beyond just where. Below are some examples of the questions beyond where. Location: what is at or where is it? E.g., where is the nearest Mexican restaurant to my house? Patterns: what spatial patterns exist? E.g., where do most of the students live in Boston? Trends: what has changed since? Has the number of deer incidents on roads reduced since commuters have used the deer whistles? Conditions: where can I find something close to, within or by? Modeling/Implications: what if? An example may include: if a new baseball park were built adjacent to Gillette stadium, how would traffic flow be affected? Our data analysis procedures assist us in answering these questions. Let us look at them more closely now. Making Measurements in GIS Calculating lengths, perimeters and areas are common applications in GIS. For example, measuring the length of a road from a digital map is pretty straightforward. However, a GIS can make measurements on other types of features as well. There are a number of tools at your disposal and depending on the type of data you are using (raster or vector). The next few slides will introduce you to some of those measuring tools and techniques.
  • 12. a. Length (distance); b. Perimeter; c. Area Vector GIS measurements: (a) distance; (b) area; and (c)perimeter can be measured as the sum of straight line lengths. The figure to the right shows how these quantities are calculated by the GIS, but as users it is much easier for us. All that is needed is to click on the measure tool in the toolbar of the ArcMap GIS software. The GIS does the rest. In Vector GIS, length, perimeter and area data can be stored as attributes in a database. By storing the lengths and areas as attributes we can select features meeting a certain criteria and sum the areas or lengths. For example how many acres of wetlands do we have in a city? Measurements in GIS The figure above from Esri shows 3 and 5 minute drive time polygons from a downtown area. Drivetime polygons can be measured using either a straight line distance or by using road networks and traffic conditions. Example of a Measurement Queries in GIS Queries are the main data retrieval operations that are useful at
  • 13. all stages of GIS analysis for everything from data quality checking to a final analysis. Features may be selected by two primary methods: Select by Location - select features based on their location relative to features in another layer. For instance, if you want to know how many homes were affected by a recent flood, you could select all the homes that fall within the flood boundary. You can use a variety of selection methods to select the point, line, or polygon features in one layer that are near or overlap the features in the same or another layer. (Esri, 2017) Select by Attribute - Select By Attributes allows you to provide a SQL query expression that is used to select features that match the selection criteria. (Esri, 2017). An example procedure for selecting features by attribute might include: A prospective buyer wants to know where all the 3 bedroom houses are for under $175,000 (apparently it’s possible, just not in Boston). The ArcGIS user can call the Selection menu in the software and then by pressing the “select by attributes” option. This will provide a new window with fields, unique values and numerical operators within it. This window allows the attribute table to be queried. A command line query question may then look like: status=Y + Bedroom=3+ saleprice=$175000. Where: status refers to house availability; 3 refers to the desired number of bedrooms; and sale price refers to… well, the sale price,and a dream sale price at that for any buyer in Boston. Select by Attribute
  • 14. Selecting by attributes involves the use of Boolean Operators (Pg. 380) in which use the conditions OR, AND, or NOT. In Esri’s ArcMap software we can use the “Select by Attributes” dialog box to build an expression. The dialog box to the right lists the layer “zebra_mussel” we want to select features from, the attribute fields “STATE” and “YEAR”. The buttons and dialog box can be used to select all of the points from th. e feature layer “zebra_mussel” that are in the state of New York and are from 2005. We can use this dialog box to build very complex expressions and even save the expressions for future use. The next slide will show an example of how more complex expressions can be built. Select by Attributes Queries Continued This image illustrates how an attributes or non-spatial query is executed in ArcGIS. In previous chapters we had discussed how SQL Queries can be used to select features from an attribute table. Notice the query “SELECT * FROM Parcel_Poly_BT_2008_LOCID WHERE: BEDROOMS = 3 AND FULLBATHS= 2 AND TOTALVALUE<175,000. Essentially, the query is telling the software to find all of the homes in Waltham that have 2 full baths, 3 Bedrooms, that cost less than $175,000.By using operators such as “AND”, “OR”,“NOT”,etc. we can build queries based on whatever criteria we choose. I can execute this query by clicking on the “Apply” button. The parcels that are selected can then be
  • 15. exported to create a new layer or table. We can also modify our query to refine our selection. You will have an opportunity to gain some hands-on experience working with these queries in the co-requisite with this course; GIS5102. An example procedure of selecting features by location might include: An analyst wants to know where all the osprey nests are within a distance of 0.5 miles to a major highway. The ArcGIS user can call the Selection menu in the software and then by pressing the “selecting by location” tool. This will allow the user to use the operators: “are within a distance of” (0.5miles to freeway) and “contain” (osprey nests) in order to carry out the query. In the following slide, an example is included of selecting particular soil types by query as part of a step to produce a soil erosion model. It is important to realize that these techniques can be used sequentially and do not have to be standalone. Queries Continued Example of a typical spatial query A spatial query is used to find features that are adjacent, within a distance of, are contained by, crossed by, etc. The graphic to the right looks similar to the “Select by Attributes” dialog box on the previous slides but the main difference is that instead of using SQL queries or Boolean operators we are able to build expressions to find features that are near, within, are within a distance of, touching, or intersecting another feature.
  • 16. The following set of slides illustrates how a “Spatial Query”is executed. In this following example we will be selecting all of the parcels within 300 feet of our selected parcel (highlighted in red). Example of a typical spatial query We can select features adjacent to or within a specified distance of a selected feature using “Select by Location”. Example of a typical spatial query Notice the “Select by Location” toolbox that now appears on the screen. The first dropdownbox allows me to specify which operation I would like to use. In the second box, the “Parcel_Poly” layer is checked off. By checking off the “Parcel_Poly” layer I am specifying which layer I will be querying. I could also use the select by locations dialog to query features that are in separate layers, for instance I could also create a query that would find all of the hydrants within 300 feet of the selected property. The drop-down menu at the bottom of the dialog box allows me specify the type of location query I would like to use. In this example I will use the “are within a distance of” option. Example of a typical spatial query At the bottom of the dialog box I can specify a distance that I
  • 17. would like to use. In this case I will use 300 feet. Once I have built the select by location query I will click on the apply button and all of the parcels within 300 feet of the subject property are now highlighted in red. Earlier in the chapter we looked at ways of finding features that are within a certain distance of a selected feature or features. This distance is commonly referred to as a buffer. Buffers are drawn based on a user specified distance and vary in shape based upon whether the feature being buffered is a point (a), a line (b), or a polygon (c). The diagram below illustrates how these shapes are drawn. If multiple features are being buffered and the buffer polygons overlap you can dissolve the buffer boundaries if you choose to create a continuous buffer polygon. Buffering and neighborhood functions a b c We discussed one way to use a buffer is to designate an area for special protection. In this example, 1000 foot buffer zones (purple) around the rail network (black) are established to define an oil spill impact area. This is what buffers around line features look like. An Example of a Buffer
  • 18. The graphic above illustrates a buffer polygon that was created to show an area 300 feet around a parcel polygon. The buffer is shown in blue. When buffering a polygon, the buffer that is created emulates the shape of the object being buffered. In this case a polygon. An Example of a Buffer Map Overlay: Data Integration One of the most basic questions asked of a GIS is "What's on top of what?" For example: What land use is on top of what soil type? What parcels are within the 100-year floodplain? ("Within" is just another way of saying "on top of.") What roads are within what counties? What wells are within abandoned military bases? To answer such questions before the days of GIS, cartographers would create maps on clear plastic sheets and overlay these sheets on a light table (remember Week 1) to create a new map of the overlaid data. Because overlay yields such valuable information, it was paramount to the development of GIS. There are two types of overlays in GIS Vector and Raster Map Overlay: Data Integration
  • 19. An overlay operation is much more than a simple merging of line work; all the attributes of the features taking part in the overlay are carried through, as shown in the example to the right, where parcels (polygons) and flood zones (polygons) are overlaid (using the Union tool) to create a new polygon dataset. The parcels are split where they are crossed by the flood zone boundary, and new polygons created. The FID_flood value indicates whether polygons are outside (-1) or inside the flood zone, and all polygons retain their original land-use category values. Esri 2017. Map Overlay: Data Integration The total area of each land-use type in the flood zone can be calculated by selecting all polygons within the flood zone (using the Select Layer By Attribute tool, for example) and summarizing the area by land- use type (using the Frequency tool). Following is a chart illustrating the result of this calculation. (Esri 2018). Map Overlay: Data Integration There are a number of vector overlay tools that are common to most commercially available GIS softwares. The graphic to the right lists some of these tools. The tools you use vary based on the output you desire but they generally involve creating or updating new datasets. You will have a chance to use some of these tools in your analysis class and also in GIS5201 – Advanced Spatial Analysis.
  • 20. We have now looked at some of the common tools and techniques used to conduct a GIS analysis. Now that we have described some of the techniques and tools how and when do we apply them? Also, how and where do we begin? Below are five points to consider when conducting a GIS analysis. Frame the question – Begin by defining the objectives of your analysis. What are the primary questions you are trying to answer? Explore and prepare data – Do you have the data you need to conduct the analysis? Do you have to create new data or acquire it from another source? These are critical questions. You may not be able to acquire or develop the data you need from the sources you have access to within the time frame or cost associated with the analysis. Choose analysis methods and tools – What tools or methods do you need to perform your analysis? Perform the analysis – Your analysis should be conducted a number of times to make sure your results make sense. You should have a good working knowledge of the subject matter. Examine and refine results – Examine your results and refine your output products. Are your results repeatable? Your maps and tables should be designed to be easily understood by your intended audience. Common questions answered by GIS
  • 21. Foundations of Geographic Information Science GIS5103 – Fall 2019 Week 4 Database Management Outline Differentiate between relational database management systems (RDBMS), databases, and data files. Explain why we select the database approach for GIS Define what a relational database is Describe how databases are created Describe GIS database applications Database Management System (DBMS) What is a Database Management System (DBMS)? A DBMS is a system [or software] designed to manage a database and allow user- selected operations to be performed on the data. 1. 2. 3. 4. It is a logical construct for the storage and retrieval of information. It puts the database into a geographic coordinate system. Attribute data models are needed for the DBMS. It contains data definition language; a data-entry module; a data update module; a report generator; and query language.
  • 22. What is the difference between and database and a database management system? A database is a simply a collection of related data. This can include non-computerized data such as those found in a ‘phone or address book. It can also include information contained in spreadsheets or also in relational databases or databases comprised of multiple tables that are connected or related to each other using common fields or “keys” found in each of the tables. Data in a computer database, however, are managed and accessed through a database management system (DBMS). The database management system is used to manage, access the database The Database is distinct from the DBMS In order for there to be a DBMS, there must be a database. In order for there to be a database there must be data files Bolstad’s Figure 8-4. It shows the database approach to data handling data. First comes the Data File, then the database and then the DBMS So, what is the difference between a data file and a database? Heywood discusses this on p.110. For simplicity, below is a comparison in tabular form. The image above from Screencast illustrated the differences between a flat file (like a spreadsheet) and the relational database model structure (RDBMS).
  • 23. Creating the Database, 337-339 To design a database, the user must: Identify the features (i.e. entities) to be represented (e.g., forests, trails, and recreational features. These are all nouns); Identify the appropriate tables for a relational database. What is the ID# that could allow a relational join for e.g. (Forest- ID)? Identify the feature’s attributes (e.g., Trail Difficulty) Identify relationships between the features (e.g., what activities, trails, can be explored at each of the forests? What trails are in each forest and how difficult area they?); Figure 8-6: Forest data in a relational database structure. Creating the Database Keys or Key ID’s are extremely important in a relational database. They allow us to combine and display data from other tables and also allow us to store, query, and edit data more efficiently. There are two types of keys, primary and foreign. A primary key, also called a primary keyword, is a key in a relational database that is unique for each record. It is a unique identifier, such as a driver license number, telephone number (including area code), or vehicle identification number (VIN). In the example to the left, the Forest-ID would be the primary key. A relational database must always have one and only one primary key. A foreign key is a set of one or more columns in a table that refers to the primary key in another table. There isn’t any special code, configurations, or table definitions you need to
  • 24. place to officially “designate” a foreign key. For example, trail name. Cardinality In the context of databases, cardinality refers to the uniqueness of data values contained in a column. High cardinality columns are those with very unique or uncommon data values. For example, in a database table that stores bank account numbers, the “Account Number” column should have very high cardinality – by definition, every item of data in this column should be totally unique. Normal cardinality columns are those with a somewhat unique percentage of data values. For instance, if a table holds customer information, the “Last Name” column would have normal cardinality. Not every last name will be unique (for example, there will likely be several occurrences of “Smith”) but on the whole, the data is fairly non-repetitive. Low cardinality columns are those with very few unique values. In a customer table, a low cardinality column would be the “Gender” column. This column will likely only have “M” and “F” as the range of values to choose from, and all the thousands or millions of records in the table can only pick one of these two values for this column. Cardinality relationships between tables can take the form of one-to-one, one-to-many (whose reversal is many-to-one) or many-to-many. These terms simply refer to the relationships of data between the tables. For example, the relationship between the “Customers” table and the “Bank Accounts” table is one-to-
  • 25. many, that is, one customer can have several accounts, but one account cannot belong to more than one customer. That is, of course, assuming this bank has never heard of joint accounts! Cardinality Cardinality is commonly expressed in the following ways: Databases In many tabular databases (e.g., Dbase, SQL Server, ORACLE, Excel etc.), it is possible to list data, create queries, and sort the results. Linking a tabular database to a GIS, however, provides additional “spatial” capabilities that enhance the value of the data to provide useful information. The “geographic search” is the secret to GIS data retrieval, in addition to the ability to get back on demand data that were previously stored. Let’s take a look some examples of GIS software and the “spatial” database facilities they provide. Multitier Architecture . Figure 8-5: Data are stored and accessed from the databases on the bottom tier and requests to view the data (queries) are made from the clients on the top tier consisting of desktop and web based products. Application servers act as the middle-men, processing transactions from the clients to the databases on the
  • 26. bottom tier. In well designed systems there are rules or policies setup that control how the data is accessed and by whom. We will look at this in more detail layer in the lecture. Spatial Database facilities in ArcGIS ESRI’s ArcGIS software is packaged with tutorials for ArcCatalog, ArcMap, ArcToolbox, and any extensions that you purchase. ESRI maintains on-line resources for teaching GIS in higher education, which includes links to lab exercises. The software you are using in the analysis course is, of course, ESRI software. GIS and image processing software developed at Clark Labs - an educational and research institution at Clark University in Worcester, Massachusetts - is called Idrisi. It is easy to use while providing professional-level GIS, image processing, and spatial statistics analysis on Windows computers. It originally focused on handling raster data. Database facilities in Idrisi http://www.clarklabs.org/ The historical approach to data handling is that a different version of the visitor’s details may be stored in separate databases. Additionally, data may be duplicated, errors may occur during transcription, and data storage mechanisms are often inflexible etc. Why Choose a Database Approach?
  • 27. Card index record from ski school manual database It is also very difficult to update files using the traditional approach, as illustrated in this example for a hardcopy index card held for an individual. We are striving for the database approach for data handling Key Points for databases: They are collections of related material shared by different users; Data is organized for easy reference with no duplication (e.g., think of a phone book); Computer databases are superior to hard- copy ones because they are easier to update and data can be adapted to different uses (e.g., unlike the phonebook). Relational Databases Example of a relational attribute table data for hotels in Happy Valley, including the terminology / ID number Earlier this term you were introduced to flat file and hierarchical attribute tables. Relational attribute files are
  • 28. different to flat files and hierarchical files (see next slide for comparison). As we can see here, by organizing the data into 2D tables that contain records for each entity / feature, it is possible to “query” these data using a GIS, to find out which hotels are the cheapest or which is 5 star etc. Examples will be given in later slides. New attribute tables can be created using the relational ID number. This is 1 example of the flexibility of computer databases over hardcopy forms, in that they can be adapted to different purposes to illustrate different relationships. You will be doing these types of exercises during your analysis course. Creating New Attribute Tables from Existing Ones Retrieval Operations Retrieval operations may be applied to either spatial or attribute data. For example, if searches are made to attribute data then the user would find or browse an entity. As we saw in the last slide, and as you have experienced in your analysis course, this can be used for data reorganization by selecting, renumbering, and sorting the data. Similarly, new attributes can be calculated by computation of selected values.
  • 29. Spatial Retrieval Operations In a spatial (map) database, records are features or themes. The spatial equivalent of an “attribute find” is locate. The GIS then highlights the result. Spatial equivalents of the DBMS queries result in locating sets of features or building new GIS layers. Queries Functions, query language and applying algorithms could be difficult requiring a command line approach. For example: find in states where state_name = ‘California’ <1 record in result> use states calculate in states population_density = population / area <50 records in result> restrict in states where population_density > 1000 <20 records selected in result> So you are coming into this field at a good time because it recent years it has become A LOT more user friendly. Querying via SQL At the highest level, a query in SQL is comprised of three key components or operations: SELECT, FROM, and WHERE.
  • 30. Intuitively, these operations used together tell your DBMS/GIS to SELECT some subset or fields of data FROM some data set WHERE some such condition is met. Retrieval User Interface Now the Retrieval User Interface simplifies the process. Most GIS packages use the GUI (Graphical User Interface) of the computer’s operating system to support both a menu-type query interface and a macro or programming language. SQL (Standard Query Language) is a standard interface that works with relational databases and is supported by many GIS software packages. So, let us examine a few things we can do operations with the GUI and SQL in the next slides. You will also look at this in more detail in week 8 (chapter 9) and you will be applying these techniques extensively in the analysis course. I will start with spatial retrieval operations. Then go on to illustrate an application from my own work that uses complex retrieval operations. Finally, I will finish this week’s lecture component by discussing GIS applications using the web interface. Examples of Attribute Searches and Queries Identify– The image below shows a commonly used tool in GIS. The user clicks on the “identify” tool (circled in red) and then clicks on a feature in the map which will then display all of the attributes associated with that feature. This tool provides a quick “snapshot” of attributes associated with a feature and does not allow the user to save or work with the popup.
  • 31. Examples of Attribute Searches and Queries Select by Attributes – The “select by attributes” tool allows the user to build a SQL query and then view the results of the query on the map display. The user can also see the results in the layer’s attribute table, further refine the query, or export the data into a new table. The example below shows features from the 2013 EPA_Temperature_2013 layer greater than January 2nd 2013 and less than or equal to April 1st 2013. Another advantage of using this tool is that more complex queries can be saved an re-used at a later date. Examples of Spatial Searches and Queries – Select by Location In GIS there will be instances where you will be required to find features that are near or that intersect other features. While the select by attributes tool we saw on the previous slide is a powerful tool it is limited in that it can only use SQL language to select features. The Select by Location is an example of a tool that can be used to select features from one layer that intersect, touch, are within a distance of, etc. a feature or features in another layer. The example to the right retrieves all of the 2009 seismic events that are within a mile of the selected countries. Once the data is selected in can be exported to another feature layer or viewed as an attribute table which can be exported.
  • 32. Foundations of Geographic Information Science GIS5103 – Fall 2019 Week 5 Data Input, Throughput, & Quality By the end of this week you should be able to: Describe methods of data input Describe Digitizing sources ‐ the cartographic base Contrast Digitizing methods within a GIS (Tablet vs. “Heads Up”) Describe Data editing (correcting digitizing errors, re‐projections and generalizations) Define accuracy and precision Describe the importance of data standards and metadata List sources of errors in a GIS project Differentiate spatial and attribute errors Describe techniques for coping with errors in Data The Data Stream
  • 33. The focus of this week’s lecture component is on creating GIS or spatial data from a variety of sources. We will also be looking at how GIS data is updated. Digitizing GIS Data As we have discussed over the past several weeks one of the primary features of a GIS is its ability to integrate data from a variety of sources. Depending on how the data is to be used the need to convert the data or extract features from it may arise. Fortunately there are a number of techniques and tools available to assist in acquiring data for a new GIS is straightforward these days. Some of the sources for this data can include some of the following: GPS (Global Positioning Systems) has become a major source of new GIS data. Data acquired through GPS can be highly spatially accurate (1cm horizontal) depending upon the type of receiver used. Digital map images such as scanned maps, LiDAR , satellite data, and aerial photographs (Ortho photos) are now often used as a cartographic base for digitizing; this is another way to create new GIS data layers. Remotely sensed satellite data are becoming an important source of GIS data as the cost of data falls, free data becomes available and new types of data emerge. The following examples show the different spatial scales commonly found with remote sensing data. Examples are also provided that show how they can be used to digitize temporal changes or changes to features over time (urbanization) or after an event (Natural Disaster).
  • 34. Satellite data are raster data. The amount of spatial detail represented by one pixel of a raster image determines its spatial resolution. So, an image with one meter spatial resolution means that each pixel in the image represents one square meter on the ground. With improvements in cameras and digital image acquisition the cost of acquiring aerial images has decreased. It is now possible to obtain imagery with a 6” pixel resolution for a few thousand dollars. 30 meters 10 meters 5 meters The Different Spatial Resolutions Image Resolutions Continued 2 meters 1 foot In cartographic applications, higher spatial resolutions are often favored. However, for applications such as weather forecasting or to monitor seasonal plant growth on a yearly basis for North America, coarser spatial resolutions are favored. While this type of data can be extremely useful there are data storage considerations and large file sizes to consider which may impact the accessibility of these images especially in web based applications. 1 meter Where to obtain remote sensing data
  • 35. There are a number of useful websites from which remote sensing data (satellite images) can be downloaded. Here are some examples: Free satellite image data @ University of Maryland: http://glcf.umd.edu/data/ It is also possible to BUY data! e.g. http://gs.mdacorporation.com/products/index.asp Free Digital Terrain data: http://www.webgis.com/terraindata.html Digitizing within a GIS Sometimes digitizing is necessary if: We need new to input new features. Map features are incorrectly mapped. Updates are needed for existing features. And so, just what is digitizing? Digitizing is the process of capturing map data in a GIS layer by tracing points, lines, or polygons from a map or image by using a mouse or a “puck” on on a digitizing tablet. It can be done by either using a digitizing tablet, or by digitizing directly from a satellite image or a scanned map on screen. On‐screen or “heads up” digitizing creates a spatial dataset by tracing over features displayed on a computer monitor with a mouse. The newly created dataset picks up the spatial reference of the source document and results in a string of points with (x,
  • 36. y) values. You will learn how to digitize in tutorial 6 of your analysis class but we will examine how we can do this now as well. Digitizing table and PC workstation A few years ago, the digitizing table was widely used in GIS. This method allows features to be input into a GIS database as points, lines or polygons but often requires extensive data clean-up and editing. With the updates made to GIS software this method of data entry is essentially obsolete and has been replaced by on-screen digitizing. Digitizing directly on screen Digitizing directly on screen or “heads up” digitizing, is the approach I am most familiar with and it is what you will be doing in the analysis class. In this case, you use your computer mouse to digitize paper maps, aerial photos, or other images displayed in the GIS. We can refer to this layer as a stable base map (that is geocoded) ‐ recall from slide 15 that the newly created dataset picks up the spatial reference of the source document? Below is an example, where the outline of fluvial features (blue) and contour lines (brown) have been digitized. Only the fluvial features can have been digitized directly from the image on the left, however. Selecting points to digitize A vertex Digitizing is not difficult to do. You can choose to digitize
  • 37. either a point, a line (snap to feature) or a polygon (snap to end). Snapping is a routine embedded within the GIS that makes sure the separate vertices in a line or an area are connected. That is, it ensures there are no gaps. This is important if calculations or correlations are going to be performed on the new layers. In these cases, gaps would cause errors. The next slide shows how snapping can be achieved –it uses topology Geocoding Address Data What is it? Geocoding address data is the process of relating an address to a geographic location (such as latitude/longitude coordinates) or geographic area (such as census tract, block group, block, or ZIP code). The address itself then is used to determine the geographical coordinates. Geocoding can be affected by the quality of data, e.g., incorrect spelling and the use of different abbreviations (e.g., for Street and Avenue). Therefore, the use of standards is relevant to geocoding address data. Address Matching Address matching is the process of geocoding street addresses to a street network. (Modified) ESRI’s definition of address matching: A process that compares an address or a table of addresses to the address attributes of a reference dataset (e.g., locations may be determined based on address ranges stored for each street segment). This determines whether a particular address falls
  • 38. within an address range associated with a feature in the reference dataset. If it does, it is considered a match and a location can be returned. The next slide provides an example of how an address can be converted to a point feature and drawn along a line segment based upon an address range. Address Matching In most instances a geocoding service or engine is built upon a street network in which line segments are assigned an address number range on the left and right side of the streets, a city or twin name, and a zip code. When a user enters an address the geocoding engine finds a match along a line segment and places the location of the address (a point) based upon where it falls along the street line segment. GIS Services: Address Matching Resources Creating a geocoding engine can be a time consuming process. However, there are some commercially available geocoding services that allow us to enter a table of addresses for geocoding and then download the results and add them to our GIS database. Some geocoding services are free and some charge a fee. http://www.batchgeo.com https://geomap.ffiec.gov/FFIECGeocMap/GeocodeMap1.aspx https://www.census.gov/geo/maps-data/data/geocoder.html Error, Accuracy, and Precision
  • 39. Until quite recently, people involved in developing and using GIS paid little attention to the problems caused by error, inaccuracy, and imprecision in spatial datasets. Surely, a GIS is too powerful for error? Not true! It is now recognized that error, inaccuracy, and imprecision, if left unchecked, can make the results of a GIS analysis almost worthless, it is like putting garbage in means getting garbage out. But where do these errors come from? Since a GIS can collate and cross‐reference many types of data by location and can integrate many discrete datasets (which is the heart of its power), it can also inherit error from the imported datasets. 17 Error, Accuracy, and Precision We can discuss error in terms of data quality. Data quality refers to the relative accuracy and precision of a GIS database. These facts are often documented in data quality reports. Errors may exist both in map data (which can be reduced by maintaining topological integrity) and also attribute data. Next we will discuss the differences between accuracy and precision in GIS data. 18 Data Accuracy Accuracy is the degree to which information on a map, or in a
  • 40. database, matches true values. The level of accuracy required for particular applications and GIS analyses varies greatly. Highly accurate data can be very difficult and costly to produce and maintain often requiring highly accurate GPS and/or survey data or the acquisition of LiDAR or remotely sends data (satellite imagery, aerial photographs, etc.). In discussing a GIS database, it is possible to consider horizontal and vertical (spatial) accuracy with respect to geographic position (i.e. how close is the mapped feature to its actual location on the earths surface) as well as attribute accuracy (i.e. do the values in the Attribute table match the real world values). 19 Data Accuracy 1:1,200 ± 1 m 1:2,400 ± 2 m 1:4,800 ± 4 m 1:10,000 ± 8.5 m 1:12,000 ± 10 m 1:24,000 ± 12 m 1:63,360 ± 32 m 1:100,000 ± 50 m This means that when we see a point on a map we have its "probable" location within a certain area. For example, a sewer manhole shown on a 1:1,200 scale map should be within 1 meter (3 ft) of it’s true location on the ground to be considered within the 1:1,200 accuracy standard. It’s important to understand the limitations of your data as well. While a layer of sewer manholes within the 1:1200 data standard may be suitable for broad planning or maintenance activities, using this
  • 41. data at scales it was not intended (engineering purposes for example) for can have negative and potentially costly consequences. 20 Defining Precision Precision refers to the level of measurement and exactness of description in a GIS database. The level of precision required for particular applications varies greatly. Engineering projects such as road and utility construction require very precise information measured to the millimeter. Demographic analyses of marketing or electoral trends can often make do with less. For example to the closest zip code or precinct boundary. Highly precise data can also be very difficult and costly to collect. Precise data ‐ no matter how carefully measured ‐ may be inaccurate and vice-versa. Let’s try to understand this better by using Bolstad’s diagram on page 625, which I have included in the next slide. 21 Accuracy and Precision Points (yellow circles) are digitized to represent the center of the cloverleaf intersection. Average accuracy is high when the average of the points falls near the true location, as in the panels on the left side of the figure. Precision is high when the points are all clustered near each other (top panels). A group of
  • 42. points may be accurate, but not precise (lower left), or precise, but not accurate (upper right). We typically strive for a process that provides both accuracy and precision (upper left), and avoid low accuracy and low precision (lower right). 22 Sources of Inaccuracy and Precision Let us take a minute to discuss attribute accuracy and precision, that is the non‐spatial data linked to lo cation. Inaccuracies may result from mistakes of many sorts, including basic data entry. With respect to precision, precise attribute information describes phenomena in great detail. For example, a precise description of a person living at a particular address might include gender, age, income, occupation, level of education, and many other characteristics. A less precise description might include just income or gender. It is the application that will determine whether precise or less precise data are needed. There are many sources of error that may affect the quality of a GIS dataset. Some are quite obvious, but others can be difficult to discern. Few of these will be automatically identified by the GIS itself. It is the user's responsibility to prevent them. Sources of error can be divided into 3 main categories: Conceptual errors; Errors arising through data processing; and Errors arising from source data 23
  • 43. Errors Arising from Source Data Age of Data Data sources may be too old to be useful or relevant to current GIS projects. Past collection standards may be unknown, non‐existent, or currently acceptable. For instance, John Wesley Powell's nineteenth century survey data of the Grand Canyon lacks the precision of data that can be developed and used today. Additionally, erosion, deposition, and other geomorphic processes will have modified the landscape. Therefore, reliance on old data could skew, bias, or negate results. Density of Observations The number of observations within an area is a guide to data reliability and they should be known by the map user. An insufficient number of observations may not provide the level of resolution required to adequately perform spatial analysis and determine the patterns GIS projects seek to resolve or define. 24 Errors Arising from Source Data There are four ways we describe errors in spatial data Positional accuracy – describes how close the locations of objects represented in a digital dataset correspond to the true locations of the real-world entities.
  • 44. Attribute accuracy – summarizes how different the attributes are from their real world values. Logical Consistency – reflects the presence, absence, or frequency of inconsistent data. Tests for logical consistency often require comparisons among themes (i.e. all buildings must be on dry land). Completeness – Describes how well a layer reflects all of the real world features it is supposed to represent. The next slide shows a figure from the text (Figure 14-3, Page 624) that depicts examples of these types of errors. 25 Errors Arising from Source Data 26 Data Standards Regular checks and tests should be employed during a project to make sure that standards are being followed. This allows a designer to pinpoint difficulties at an early stage and correct them. Establishing data standards helps with data exchange – unfortunately, the history of GIS data exchange has been chaotic and has been wasteful in the past. Examples of good data standards include: USGS, National Mapping Program Standards, http://nationalmap.gov/standards//
  • 45. Spatial Data Transfer Standard http://mcmcweb.er.usgs.gov/sdts/ 27 2 Foundations of Geographic Information Science GIS 5103 Fall 2019 - Week 3 Spatial Data Modeling By the end of Week Three you should be able to: Provide a definition of a ‘spatial data model’ Explain how spatial entities are used to create a data model Distinguish between rasters and vectors Describe a spatial data structure Explain what topology is List the advantages and disadvantages of different types of spatial models
  • 46. File Types Files in a GIS may be binary data files. They can be read by computers (a program or a hardware understands in advance exactly how it is formatted) but not people. All executable programs and most numeric data files are stored in binary files. Or they can be ASCII text. ASCII files can be read by humans. Binary is faster to read and smaller. Raster and Vector Data A file must have a mental model of how the physical data represent a geographic feature. GIS traditionally use either raster or vector models to store data. Rasters represent phenomenon distributed continuously over a large area. Vectors are object based (point, line, polygon).
  • 47. Raster Data Example Satellite data has a RASTER data structure (GRID CELL), composed of matrix of pixels; Each pixel records the “radiation intensity” measured by the remote sensing instrument called its radiometric resolution. The intensity of radiation is represented by a data number (DN), that is, the “brightness value”. One grid cell (pixel) has one attribute that represents its data number/ brightness value (even if 0); a cell also has a spatial resolution, given as the cell size in ground units. Grid Structure of Raster Data, p.81-82 Vector Data Example Spatial feature files that are object (vector) based: Have identifiable boundaries or spatial extent; Relevant to an intended application; They have “ATTRIBUTES” – descriptive characteristics of a feature. For example, a tax parcel can have a rectangular shape and be represented by a polygon, that is the feature’s geometry. The owner name, parcel id, property value, etc. would be considered the
  • 48. polygon’s attributes. Attribute data are stored in tables comprised of a matrix of numbers and values and are stored in rows and columns, like a spreadsheet, e.g. attribute tables. OBJECTS may be: Exact, e.g., man-made objects; or Inexact (fuzzy), e.g., soil types and forest stands etc. Recall: spatial objects represented by vector files may be represented either as points, lines or polygons (areas) [slides 9-11, class 2]. Vectors vs Rasters: Vectors can store information about topology (relational space, p. 51-52). GIS Data File Formats Most GIS systems can import different data formats, or use utility programs to convert them; Data formats can be industry standard or commonly accepted standard; Example vector formats: DLG (Digital Line Graph) and TIGER (Topologically Integrated Geographic Encoding and Referencing system), shp. (ESRI) Personal, File, and Enterprise GeoDatabase (ESRI); Most raster formats are digital image formats, e.g. Tiff (Tagged Image File Format) or DEMs (Digital Elevation Models).
  • 49. Data Models Once you have determined which features you would like to map or create data layers for you next have to decide how you wish to portray those features and also consider the following: How the geographic features are to be represented for each theme (for example, as points, lines, polygons, or rasters) along with their tabular attributes How the data will be organized into datasets, such as feature classes, attributes, raster datasets, and so forth What additional spatial and database elements will be needed for integrity rules, for implementing rich GIS behavior (such as topologies, networks, and raster catalogs), and defining spatial and attribute relationships between datasets. Representation Each GIS database design begins with a decision as to what the geographic representations will be for each dataset. Individual geographic entities can be represented as: Feature classes (sets of points, lines, and polygons) Imagery and rasters (aerial photography, grid datasets) Continuous surfaces that can be represented using features (such as contours), rasters (digital elevation models [DEM]), or triangulated irregular networks (TINs) using terrain datasets Attribute tables for descriptive data – these tables can be stand- alone tables that contain information related to a feature in a
  • 50. feature class – such as an inspection, an activity (catch basin cleaning, tree maintenance, etc.) Data Themes Geographic representations are organized in a series of data themes (sometimes referred to as thematic layers). A key concept in a GIS is one of data layers, or themes. A data theme is a collection of common geographic elements such as a road network, a collection of parcel boundaries, soil types, an elevation surface, satellite imagery for a certain date, well locations, and so on. The concept of a thematic layer was one of the early notions in GIS. Practitioners thought about how the geographic information in maps could be partitioned into logical information layers—as more than a random collection of individual objects (such as a road, a bridge, a hill, a house, a peninsula). These early GIS users organized information in thematic layers that described the distribution of a phenomenon and how it should be portrayed across a geographic extent. These layers also provided a protocol (capture rules) for collecting the representations (as feature sets, raster layers, attribute tables, and so on). In GIS, thematic layers are one of the main organizing principles for GIS database design. Each GIS will contain multiple themes for a common geographic area. The collection of themes acts as layers in a stack. Each theme can be managed as an information set
  • 51. independent of other themes. Each has its own representations (points, lines, polygons, surfaces, rasters, and so on). Because the various independent themes are spatially referenced, they overlay one another and can be combined in a common map display. Plus, GIS analysis operations, such as overlay, can fuse information between themes. GIS datasets are collections of representations for a data theme Geographic data collections can be represented as feature classes and raster-based datasets in a GIS database. Many themes are represented by a single collection of homogeneous features such as a feature class of soil type polygons and a point feature class of well locations. Other themes, such as a transportation framework, are represented by multiple datasets (such as a set of spatially related feature classes for streets, intersections, bridges, highway ramps, and so on). Raster datasets are used to represent continuous surfaces, such as elevation, slope, and aspect, as well as to hold satellite imagery, aerial photography, and other gridded datasets (such as land cover and vegetation types). GIS datasets are collections of representations for a data theme – Cont. Both the intended use and existing data sources influence spatial representations in a GIS. When designing a GIS
  • 52. database, users have a set of applications in mind. They understand what questions will be asked of the GIS. Defining these uses helps to determine the content specification for each theme and how each is to be represented geographically. For example, there are numerous alternatives for representing surface elevation: as contour lines and spot height locations (such as hilltops, peaks), as a continuous terrain surface (a TIN), or as shaded relief. Any or all of these may be relevant for each particular GIS database design. The intended uses of the data will help to determine which of these representations will be required. Frequently, the geographic representations will be predetermined to some degree by the available data sources for the theme. If a preexisting data source was collected at a particular scale and representation, it will often be necessary to adapt your design to use it. Individual GIS datasets often are collected in concert with other data layers While each GIS dataset can be used independently of other GIS data, it is often quite important to collect datasets in concert with other information layers so that the fundamental spatial behavior and spatial relationships are maintained and consistent between the related GIS data layers. Here are a few examples that help to illustrate this concept: The spatial relationships between elevation, landform, soil type, slope, vegetation, surficial geology, and other terrain properties are typically compiled in unison to characterize environmental resource units. Understanding the science behind these spatial relationships helps to build a consistent, logical database where features from each data layer are consistent
  • 53. with each other. Topographic basemap information is compiled in an integrated manner. Hydrography, transportation, structures, administrative boundaries, and other topographic map layers are compiled in unison. These cartographic representations in the map display are built in an integrated manner to communicate clearly and accurately and draw attention to key map locations. Individual GIS datasets often are collected in concert with other data layers – Cont. In each of these cases, a data model defines a collection of related data themes that fit into an overall information framework. Each framework is essentially a collection of related data themes that are best captured in unison with each other. The data capture guidelines follow sound scientific principles about their spatial behavior and relationships. Each theme plays an important part in the holistic characterization of a particular landscape. For example: Terrain landscape. Topographic maps, elevation, drainage network, transportation network, map features, cross-country movement, and so forth Urban landscape. Buildings, critical infrastructure, and so forth Imagery landscape. Satellite and aerial, local, regional, and national assets, and so forth Human landscape. Demographics (population characteristics), cultural centers, citizens, administrative districts and zones and so forth
  • 54. Workforce landscape. Mobile workforce tracking, service centers, traffic conditions, warehouses, and so forth Sensor landscape. Camera locations, devices, and so forth Operations and plans landscape. Zones of control, planned movements, response, and so forth This concept of collecting integrated data themes in unison is one of the key design principles used in each of the ArcGIS Data Models. Topology Topology is a collection of rules that, coupled with a set of editing tools and techniques, enables the geodatabase to more accurately model geometric relationships. ArcGIS implements topology through a set of rules that define how features may share a geographic space and a set of editing tools that work with features that share geometry in an integrated fashion. A topology is stored in a geodatabase as one or more relationships that define how the features in one or more feature classes share geometry. The features participating in a topology are still simple feature classes—rather than modifying the definition of the feature class, a topology serves as a description of how the features can be spatially related. Why Topology? Topology has long been a key GIS requirement for data
  • 55. management and integrity. In general, a topological data model manages spatial relationships by representing spatial objects (point, line, and area features) as an underlying graph of topological primitives—nodes, faces, and edges. These primitives, together with their relationships to one another and to the features whose boundaries they represent, are defined by representing the feature geometries in a planar graph of topological elements. Why topology? Topology is a collection of rules that, coupled with a set of editing tools and techniques, enables the geodatabase to more accurately model geometric relationships. ArcGIS implements topology through a set of rules that define how features may share a geographic space and a set of editing tools that work with features that share geometry in an integrated fashion. A topology is stored in a geodatabase as one or more relationships that define how the features in one or more feature classes share geometry. The features participating in a topology are still simple feature classes—rather than modifying the definition of the feature class, a topology serves as a description of how the features can be spatially related. Why topology? Topology is fundamentally used to ensure data quality of the spatial relationships and to aid in data compilation. Topology is also used for analyzing spatial relationships in many
  • 56. situations, such as dissolving the boundaries between adjacent polygons with the same attribute values or traversing a network of the elements in a topology graph. Topology can also be used to model how the geometry from a number of feature classes can be integrated. Some refer to this as vertical integration of feature classes. Ways that features share geometry within a topology Features can share geometry within a topology. Here are some examples among adjacent features: Area features can share boundaries (polygon topology). Line features can share endpoints (edge-node topology). In addition, shared geometry can be managed between feature classes using a geodatabase topology. For example: Line features can share segments with other line features. Area features can be coincident with other area features. For example, parcels can nest within blocks. Line features can share endpoint vertices with other point features (node topology). Point features can be coincident with line features (point events). The next slide shows some commonly used topology rules found in ArcGIS. 28
  • 57. What are the advantages of topology? The results are that topological primitives (nodes, edges, and faces) and their relationships to one another and their features can be efficiently discovered and assembled. This has several advantages: Simple feature geometry storage is used for features. This storage model is open, efficient, and scales to large sizes and numbers of users. This simple features data model is transactional and is multiuser. By contrast, the older topological storage models will not scale and have difficulties supporting multiple editor transactions and numerous other GIS data management workflows. Geodatabase topologies fully support all the long transaction and versioning capabilities of the geodatabase. Geodatabase topologies need not be tiled, and many users can simultaneously edit the topological database—even their individual versions of the same features if necessary. Feature classes can grow to any size (hundreds of millions of features) with very strong performance. What are the advantages of topology? Topology can be added this to an existing schema of spatially related feature classes. The alternative is that you must redefine and convert all your existing feature classes to new data schemas holding topological primitives. There need only be one data model for geometry editing and
  • 58. data use, not two or more. It is interoperable because all feature geometry storage adheres to simple features specifications from the Open Geospatial Consortium and ISO. Data modeling is more natural because it is based on user features (such as parcels, streets, soil types, and watersheds) instead of topological primitives (such as nodes, edges, and faces). Users will begin to think about the integrity rules and behavior of their actual features instead of the integrity rules of the topological primitives. For example, how do parcels behave? This will enable stronger modeling for all kinds of geographic features. It will improve our thinking about streets, soils types, census units, watersheds, rail systems, geology, forest stands, land forms, physical features, and on and on. What are the advantages of topology? Geodatabase topologies provide the same information content as maintained topological implementations—either you store a topological line graph and discover the feature geometry (like coverages) or you store the feature geometry and discover the topological elements and relationships (like geodatabases). In cases where users want to store the topological primitives, it is easy to create and post topologies and their relationships to tables for various analytic and interoperability purposes (such as users who want to post their features into an Oracle Spatial warehouse that stores tables of topological primitives). At a pragmatic level, the ArcGIS topology implementation works. It scales to extremely large geodatabases and multiuser systems without loss of performance. It includes validation and editing tools for building and maintaining topologies in geodatabases. It includes rich and flexible data modeling tools that enable users to assemble practical, working systems on file
  • 59. systems, in any relational database, and on any number of schemas.