Event prediction is concerned with modeling where, when, and why events occur. The estimated locations and times of future events can be informed by characteristics and attributes of the spatio-temporal environment (i.e., feature space) as well as by the patterns in the event history. A Hawkes, or self-exciting point process, model is a formalized approach to combine information from both the feature space and the event history while automatically determining how much weight to give to recent events. These models have been used to represent processes such as: earthquakes, crime and conflict, traffic incidents, social media activity, financial markets, new product adoption, social network communication, neural spiking, and disease spread.
Topics include: point processes, Hawkes models, self-exciting models, kernel density estimation, spatial statistics, time series, predictive analytics, machine learning.
Recent Papers:
Recent Funding:
Pattern and event detection is concerned with discovering patterns and anomalies in data. This often involves problems where the quick detection of anomalies or unusual observations from a data stream is crucial (e.g., disease and crime outbreak detection, activation of terrorist sleeper cell). But in addition to being able to detect the changes quickly, we should also be able to identify the cause of the event - like the spatial location where the outbreak started or the network neighborhood responsible for fraudulent financial activities. Clustering is another approach to making sense of complex data. This is the task of grouping observations that share features or have similar patterns. Similarly, data mining and machine learning approaches can be used to identify the features most useful for prediction or understanding the data generating process. Recent projects involve detecting anomalies in large dynamic networks and spatial event data.
Topics include: anomaly detection, dynamic network analysis, spatial hotspot detection, clustering, data mining, NMF.
Recent Papers:
Recent Funding:
Data linkage is concerned with matching data that correspond to the same entity. Our work in data linkage is, in essence, the assessment of statistical evidence in large disparate data. One example is crime linkage, the process of grouping together the unsolved crimes that were committed by the same offender(s). We use the behavioral patterns in crime data that have the most power to discriminate between offenders to create statistical measures of evidence. This helps crime analysts make the complex comparisons of not only the similarity, but also the distinctiveness of variables related to an offender's site selection behaviors, crime scene behaviors, and possibly even eyewitness descriptions. Besides applications in statistical forensics, concepts of data linkage can also be used for record linkage, the process of linking data records that correspond to the same entity. The recent political claims of voter fraud fit into this category as do medical record linkage to improve health outcomes and detect dangerous drug interactions. We are also developing network based approaches to road conflation, the task of merging multiple maps to better understand the traversable roads and paths of a region.
Topics include: crime linkage, statistical forensics, record linkage, road conflation, clustering, Bayesian.
Recent Papers:
Recent Funding: