Skip to Content

Department of Mathematics

RTG Biweekly Seminars

Time: Friday 3:40pm--4:40pm

Zoom Link: https://us06web.zoom.us/j/83134968156?pwd=VXRXWVphMEZyTXA3cEE5Y3crMHQvUT09

Meeting ID: 831 3496 8156

Passcode: 375864

 

Abstract: Approximating high-dimensional functions is challenging due to the curse of dimensionality. In this talk, we will discuss the Dimension Reduction via Learning Level Sets for function approximations. The approach contains two major components: one is the pseudo-reversible neural network module that effectively transforms high-dimensional input variables to low-dimensional active variables, the other is the synthesized regression module for approximating function values based on the transformed data in the low-dimensional space. This is a joint work with Prof. Lili Ju and our graduate student Mr. Yuankai Teng, and Dr. Anthony Gruber (Sandia) and Dr. Guannan Zhang (ORNL). 

Abstract:  Anomalously diffusive transport, which exhibits power-law decaying behavior, occurs in many applications along with many other power-law processes. In this talk we will go over related modeling and analysis issues in comparison to normal Fickian diffusive transport that exhibits exponentially decaying behavior.  We will show why fractional calculus, in which the order of differentiation may be a function of space, time, the unknown variable, or even a distribution, provides an appropriate modeling tool to these problems than conventional integer-order models do.

Abstract:  Networks in ecology can take many forms, describing interactions between species, dispersal pathways between different habitat patches in space, or associations between different classes of species (e.g., host and parasite species). In this talk, we will explore the different uses and issues present in the analysis of ecological networks and the prediction of potentially missing links in networks. In doing so, we will identify some frontiers in which graph theory may be applied to ecological networks using existing data, model simulations, and laboratory experiments.

Abstract: This talk is about the intrinsic obstructions encountered when approximating or recovering functions of a large number of variables, commonly subsumed under the term “Curse of Dimensionality”. Problems of this type are ubiquitous in Uncertainty Quantification and machine learning. In particular, we highlight the role of deep neural networks  (DNNs) in this context. A new sparsity notion, namely compositional dimension sparsity, is introduced, which is shown to favor efficient approximation by DNNs. It is also indicated that this notion is suited for function classes comprised of solutions to operator equations. This is quantified for solution manifolds of parametric families of transport equations. We focus on this scenario because (i)  it cannot be treated well by currently known concepts and (ii) it has interesting ramifications for related more general settings.

Abstract: Solid tumors are heterogeneous in composition. Cancer stem cells (CSCs) are a highly tumorigenic cell type found in developmentally diverse tumors that are believed to be resistant to standard chemotherapeutic drugs and responsible for tumor recurrence. Thus understanding the tumor growth kinetics is critical for development of novel strategies for cancer treatment. For this talk, I shall introduce mathematical modeling to study Her2 signaling for the dynamical interaction between cancer stem cells (CSCs) and non-stem cancer cells, and our findings reveal that two negative feedback loops are critical in controlling the balance between the population of CSCs and that of non-stem cancer cells. Furthermore, the model with negative feedback suggests that over-expression of the oncogene HER2 leads to an increase of CSCs by regulating the division mode or proliferation rate of CSCs

We derive mean-field information Hessian matrices on finite graphs. The ``information'' refers to entropy functions on the probability simplex. And the ``mean-field" means nonlinear weight functions of probabilities supported on graphs. These two concepts define a mean-field optimal transport type metric. In this metric space, we first derive Hessian matrices of energies on graphs, including linear, interaction energies, entropies. We name their smallest eigenvalues as mean-field Ricci curvature bounds on graphs. We next provide examples on two-point spaces and graph products. We last present several applications of the proposed matrices. E.g., we prove discrete Costa's entropy power inequalities on a two-point space.

Abstract:

Issues on how to study real-world time-series data ranging from data preprocessing to model selection will be discussed. In particular, specially designed LSTM RNNs and neural ODEs will be used to model the given patient data after preprocessing. Preliminary results show relative errors in short-time predictions are clinically acceptable.  

Abstract:

This talk is about the problem of learning an unknown function f from given data about f. The learning problem is to give an approximation f^ to f that predicts the values of f away

from the data. There are numerous settings for this learning problem depending on:

(i) what additional information we have about f (known as a model class assumption);

(ii) how we measure the accuracy of how well  f^ predicts f;

(iii) what is known about the data and data sites;

(iv) whether the data observations are polluted by noise.

A mathematical description of the optimal performance possible (the smallest possible error of recovery) is known in the presence of a model class assumption. Under standard model class assumptions, we show that a near optimal f^ can be found by solving a certain discrete over-parameterized optimization problem with a penalty term. Here, near optimal means that the error is bounded by a fixed constant times the optimal error. This explains the advantage of over-parameterization which is commonly used in modern machine learning. The main results of this talk prove that over-parameterized learning with an appropriate loss function gives a near optimal approximation f^ of the function f from which the data is collected. Quantitative bounds are given for how much over-parameterization needs to be employed and how the penalization needs to be scaled in order to guarantee a near optimal recovery of f.  An extension of these results to the case where the data is polluted by additive deterministic noise is also given.

This is a joint research project with Andrea Bonito, Ronald DeVore, and Guergana Petrova from Texas A&M University.

Abstract:
State Estimation or Data Assimilation are about estimating ``physical states'' of interest from two sources of partial information: data produced by external sensors   and a (typically incomplete or uncalibrated) background model, given  in  terms of a partial differential equation.   In this talk we focus on states that ideally satisfy a parabolic equation with known right hand side but unknown initial values. Additional partial information is given in terms of data that represent the unknown state  in a subdomain of  the whole space-time cylinder up to a fixed time horizon. Recovering the state from this information is known to be a (mildly) ill-posed problem. Earlier contributions  employ mesh-dependent regularizations in a fully discrete setting, bypassing a continuous problem formulation. Other contributions, closer to the approach discussed in this talk, consider a regularized least squares formulation first on an
infinite-dimensional level. The essential difference in the present talk is that the least squares formulation exploits the “natural mapping properties” of the underlying forward problem. The main consequences delineating our results from previous work are:

(i) no excess regularity
are needed, thereby mitigating the level of ill-posedness;  

(ii) one obtains stronger a priori estimates that are  uniform with respect to the regularization parameter;
(iii) error estimates no longer require consistent data; (iv) one obtains rigorous computable a posteriori bounds that
provide stopping criteria for iterative solvers and  allow one to estimate data inconsistency and model bias.
The price is to deal with dual norms and their efficient evaluation. We sketch the main concepts and illustrate the results
by numerical experiments.

Abstract:

We present a systematic framework for Nesterov's accelerated gradient flows and Newton flows in the spaces of probabilities embedded with general information metrics. Here two metrics are considered, including the Fisher-Rao metric and the Wasserstein-2 metric. For the Wasserstein-2 metric case, we prove the convergence properties of the accelerated gradient flows and introduce their formulations in Gaussian families. Furthermore, we propose a practical discrete-time algorithm in particle implementations with an adaptive restart technique. Finally, we formulate a novel bandwidth selection method, which learns the Wasserstein-2 gradient direction from Brownian-motion samples. Experimental results, including Bayesian inference, show the strength of the current approach compared with the state-of-the-art. Finally, we discuss some further connections between inverse problems and data/neural network optimization techniques.

Abstract:

We present some concepts for the construction of nonlinear reduced deep neural network models for parameter dependent families of PDEs. The proposed methodology is based on combining stable variational formulations for the PDE models and regression concepts in machine learning. Central objectives concern:
- avoiding the Curse of Dimensionality in high parameter dimensionality regimes;
- rigorous accuracy quantification of resulting estimators, based on contriving variationally correct training risks that avoid variational crimes, often encountered with Physics Informed Neural Network (PINN) formulations.
 
We highlight the role of optimization strategies involving dynamic network expansion, currently in progress in our group at UofSC, and high-dimensional sparsity concepts.

Abstract: It was observed that many real-world networks such as the
Internet, social networks, biological networks, and Collaboration
graphs have the so-called power law degree distributions.
A graph is called a power law graph if the fraction of vertices with
degree k is approximately proportional to k^{-b} for
some constant b. The classical Erdos and Renyi random graph
model G(n,p) is not suitable for modeling these power law graphs.  
Many random graphs models are developed. Among these models, we
directly generalize G(n,p) into ``random graphs with given expected
degree sequences''. We considered several graph properties such as
the size and volume of the giant component, the average distance/the diameter,
and the spectra. Some theoretic results will be compared to real data.




Challenge the conventional. Create the exceptional. No Limits.

©