Comparative Archaeology Database | Center for Comparative Archaeology | University of Pittsburgh
Rosario Valley Architectural Gini Coefficient and Neighborhood Dataset
The data files below provide information on the house-group volume measurements for Ojo de Agua. They are listed from smallest to largest, accompanied by their Lorenz curve calculations. In the data, "mounds" are synonymous with "structures." Structures are the basic unit of analysis and consist of heavily eroded platforms (i.e., mounds) upon which domestic superstructures would have been built.
[Comma delimited UTF-8 format]
[Excel format]
Each line in the .CSV file represents one house group. There are 1957 lines, each with 13 variables separated by commas. The variables are listed in the following order:
1 | Site Name -- a number used to identify sites. |
2 | Volume of House-Group in Cubic Meters |
3 | Wide Method: f' - An index of change, this is the difference between the prior and subsequent observations, divided by two and rounded down. |
4 | Wide Method: f'' - An index of acceleration, this is the difference between the f’ values of prior and subsequent observations, divided by two and rounded down. |
5 | Narrow Method: f' - An index of change, this is the difference between the observation and the one that follows, divided by two and rounded down. |
6 | Narrow Method: f'' - An index of acceleration, this is the difference between the f’ values of the observation and of its prior counterpart, divided by two and rounded down. |
7 | Individual # -- a number used to count sites used for House-Group Area calculations |
8 | Pop. frac: The share of the total population that the house-group has; i.e. 1/ [sum of all observations] |
9 | Income frac: The proportion of the population's total volume that the mound has; i.e. [Mound Volume]/[Sum of Volume from all observations] |
10 | Line of Equality sum pop.: The cumulative value of each “population fraction” value, adding up to 1 on the final observation. |
11 | Lorenz Curve sum income: The cumulative value of each “income fraction” value, adding up to 1 on the final observation |
12 | G(i)*F(i+1): Calculates the differences between the line of equality and the Lorenz curve in order to estimate the area between them (via sums of their values) and consequently the Gini coefficient. |
13 | G(i+1)*F(i): Calculates the differences between the line of equality and the Lorenz curve in order to estimate the area between them (via sums of their values) and consequently the Gini coefficient. |
The first line of the .CSV file, for example, looks like this:
2029,0.64,,,0.16,,1,0.000510986,7.0062E-06,0.000510986,7.0062E-06,7.16014E-09,8.95018E-09
This means that 2029 has an volume of 0.64 meters squared. The two variables that follow show how quickly the volume changes between consecutive values, therefore it has no values or zero for these variables because it begins the sequence of observations. It has a value of 0.16 for Narrow Method f' and no value for Narrow Method f". 2029 is the first observation ("Individual #'=1) of this analysis, it makes up about 0.05% of the total population ("Pop frac.=0.000510986), it has 7.0062E-06 of the population's total volume ("income frac."=7.0062E-06), the cumulative share of the population counted with this site is about 0.05% ("sum pop."=0.000510986), the cumulative share of the volume counted with this site is 7.0062E-06 ("sum pop."=7.0062E-06), and the difference between the line of equality and Lorenz curve at this point is 7.16014E-09 ("G(i)*F(i+1)"=7.16014E-09;"G(i+1)*F(i) =8.95018E-09).