A prediction model for Xiangyang Neolithic sites based on a random forest algorithm

Bibliographic Details
Title: A prediction model for Xiangyang Neolithic sites based on a random forest algorithm
Authors: Li Linzhi, Chen Xingyu, Sun Deliang, Wen Haijia
Source: Open Geosciences, Vol 15, Iss 1, Pp 1-5 (2023)
Publisher Information: De Gruyter, 2023.
Publication Year: 2023
Collection: LCC:Geology
Subject Terms: archaeological site prediction, random forest model, xiangyang city, hubei, Geology, QE1-996.5
More Details: The archaeological site prediction model can accurately identify archaeological site areas to enable better knowledge and understanding of human civilization processes and social development patterns. A total of 129 Neolithic site data in the region were collected using the Xiangyang area as the study area. An eight-factor index system of elevation, slope, slope direction, micromorphology, distance to water, slope position, planar curvature, and profile curvature was constructed. A geospatial database with a resolution of 30 m × 30 m was established. The whole sample set was built and trained in the ratio of 1:1 archaeological to nonarchaeological sites to obtain the prediction results. The average Gini coefficient was used to evaluate the influence of various archaeological site factors. The results revealed that the area under the curve values of the receiver operating characteristic curves were 1.000, 0.994, and 0.867 for the training, complete, and test datasets, respectively. Moreover, 60% of the historical, archaeological sites were located in the high-probability zone, accounting for 12% of the study area. The prediction model proposed in this study matched the spatial distribution characteristics of archaeological site locations. With the model assessed using the best samples, the results were categorized into three classes: low, average, and high. The proportion of low-, average-, and high-probability zones decreased in order. The high-probability zones were mainly located near the second and third tributaries and distributed at the low eastern hills and central hillocks. The random forest (RF) model was used to rank the importance of archaeological site variables. Elevation, slope, and micro-geomorphology were classified as the three most important variables. The RF model for archaeological site prediction has better stability and predictive ability in the case field; the model provides a new research method for archaeological site prediction and provides a reference for revealing the relationship between archaeological activities and the natural environment.
Document Type: article
File Description: electronic resource
Language: English
ISSN: 2391-5447
Relation: https://doaj.org/toc/2391-5447
DOI: 10.1515/geo-2022-0467
Access URL: https://doaj.org/article/dde95471c85a492a8b94d051d98c456b
Accession Number: edsdoj.95471c85a492a8b94d051d98c456b
Database: Directory of Open Access Journals
More Details
ISSN:23915447
DOI:10.1515/geo-2022-0467
Published in:Open Geosciences
Language:English