Research interests: Data Science and Computational statistics, Bayesian hierarchial modelling, biological applications, Bayesian Machine Learning
Emneord:
Statistikk,
Statistikk og biostatistikk
Publikasjoner

Gramuglia, Emanuele; Storvik, Geir Olve & Stakkeland, Morten (2021). Clustering and automatic labelling within time series of cate gorical observations  with an application to marine log messages. The Journal of the Royal Statistical Society, Series C (Applied Statistics).
ISSN 00359254.
70(3), s 714 732 . doi: https://doi.org/10.1111/rssc.12483
Fulltekst i vitenarkiv.
Vis sammendrag
System logs or log files containing textual messages with associated time stamps are generated by many technologies and systems. The clustering technique proposed in this paper provides a tool to discover and identify patterns or macrolevel events in this data. The motivating application is logs generated by frequency converters in the propulsion system on a ship, while the general setting is fault identification and classification in complex industrial systems. The paper introduces an offline approach for dividing a time series of log messages into a series of discrete segments of random lengths. These segments are clustered into a limited set of states. A state is assumed to correspond to a specific operation or condition of the system, and can be a fault mode or a normal operation. Each of the states can be associated with a specific, limited set of messages, where messages appear in a random or semi‐structured order within the segments. These structures are in general not defined a priori. We propose a Bayesian hierarchical model where the states are characterised both by the temporal frequency and the type of messages within each segment. An algorithm for inference based on reversible jump MCMC is proposed. The performance of the method is assessed by both simulations and operational data.

Hubin, Aliaksandr; Storvik, Geir Olve; Grini, Paul Eivind & Butenko, Melinka Alonso (2020). A Bayesian binomial regression model with latent gaussian processes for modelling DNA methylation. Austrian Journal of Statistics.
ISSN 1026597X.
49(4), s 46 56 . doi:
10.17713/ajs.v49i4.1124
Fulltekst i vitenarkiv.
Vis sammendrag
Epigenetic observations are represented by the total number of reads from a given pool of cells and the number of methylated reads, making it reasonable to model this data by a binomial distribution. There are numerous factors that can influence the probability of success in a particular region. Moreover, there is a strong spatial (alongside the genome) dependence of these probabilities. We incorporate dependence on the covariates and the spatial dependence of the methylation probability for observations from a pool of cells by means of a binomial regression model with a latent Gaussian field and a logit link function. We apply a Bayesian approach including prior specifications on model configurations. We run a mode jumping Markov chain Monte Carlo algorithm (MJMCMC) across different choices of covariates in order to obtain the joint posterior distribution of parameters and models. This also allows finding the best set of covariates to model methylation probability within the genomic region of interest and individual marginal inclusion probabilities of the covariates.

Grytten, Ivar; Rand, Knut Dagestad; Nederbragt, Alexander Johan; Storvik, Geir Olve; Glad, Ingrid Kristine & Sandve, Geir Kjetil (2019). Graph Peak Caller: Calling ChIPseq peaks on graphbased reference genomes. PLoS Computational Biology.
ISSN 1553734X.
15(2), s 1 13 . doi:
10.1371/journal.pcbi.1006731
Fulltekst i vitenarkiv.

Hubin, Aliaksandr; Storvik, Geir Olve; Grini, Paul Eivind & Butenko, Melinka Alonso (2019). Bayesian binomial regression model with a latent Gaussian field for analysis of epigenetic data, In Y Kharin & Peter Filzmoser (ed.),
Proceedings of Computer Data Analysis and Modeling: Stochastics and Data Science 2019.
Belarusian State University Press.
ISBN 9789855668115.
1.
s 167
 171

Mysterud, Atle; Bleka, Øyvind; Nielsen, Anders; Steinheim, Geir; Yoccoz, Nigel Gilles & Storvik, Geir Olve (2019). Climate and synchrony of lamb body mass, In Jon Olaf Olaussen (ed.),
Contributions in natural resource economics. Festschrift to Anders Skonhoft.
Fagbokforlaget.
ISBN 9788245024715.
9.
s 183
 197
Se alle arbeider i Cristin

Hubin, Aliaksandr; Storvik, Geir Olve & Frommlet, Florian (2020). A novel algorithmic approach to Bayesian Logic Regression.
Vis sammendrag
Logic regression was developed more than a decade ago as a tool to construct predictors from Boolean combinations of binary covariates. It has been mainly used to model epistatic effects in genetic association studies, which is very appealing due to the intuitive interpretation of logic expressions to describe the interaction between genetic variations. Nevertheless logic regression has (partly due to computational challenges) remained less well known than other approaches to epistatic association mapping. Here we will adapt an advanced evolutionary algorithm called GMJMCMC (Genetically modified Mode Jumping Markov Chain Monte Carlo) to perform Bayesian model selection in the space of logic regression models. After describing the algorithmic details of GMJMCMC we perform a comprehensive simulation study that illustrates its performance given logic regression terms of various complexity. Specifically GMJMCMC is shown to be able to identify threeway and even fourway interactions with relatively large power, a level of complexity which has not been achieved by previous implementations of logic regression. We apply GMJMCMC to reanalyze QTL (quantitative trait locus) mapping data for Recombinant Inbred Lines in Arabidopsis thaliana and from a backcross population in Drosophila where we identify several interesting epistatic effects. The method is implemented in an R package which is available on github.

Hubin, Aliaksandr; Storvik, Geir Olve & Frommlet, Florian (2020). Rejoinder for the discussion of the paper "A Novel Algorithmic Approach to Bayesian Logic Regression". Bayesian Analysis.
ISSN 19360975.
15(1), s 312 333 . doi:
10.1214/18ba1141
Vis sammendrag
Logic regression was developed more than a decade ago as a tool to construct predictors from Boolean combinations of binary covariates. It has been mainly used to model epistatic effects in genetic association studies, which is very appealing due to the intuitive interpretation of logic expressions to describe the interaction between genetic variations. Nevertheless logic regression has (partly due to computational challenges) remained less well known than other approaches to epistatic association mapping. Here we will adapt an advanced evolutionary algorithm called GMJMCMC (Genetically modified Mode Jumping Markov Chain Monte Carlo) to perform Bayesian model selection in the space of logic regression models. After describing the algorithmic details of GMJMCMC we perform a comprehensive simulation study that illustrates its performance given logic regression terms of various complexity. Specifically GMJMCMC is shown to be able to identify threeway and even fourway interactions with relatively large power, a level of complexity which has not been achieved by previous implementations of logic regression. We apply GMJMCMC to reanalyze QTL (quantitative trait locus) mapping data for Recombinant Inbred Lines in Arabidopsis thaliana and from a backcross population in Drosophila where we identify several interesting epistatic effects. The method is implemented in an R package which is available on github.

Storvik, Geir Olve (2020). Neural networks vs generalized linear models.

Storvik, Geir Olve (2020). "Preliminaries for Deep Neural Networks: Recapture of Linear Algebra, Gradient Descents and Generalized Linear Models".

Hubin, Aliaksandr & Storvik, Geir Olve (2019). Combining Model and Parameter Uncertainty in Bayesian Neural Networks.

Hubin, Aliaksandr & Storvik, Geir Olve (2019). Combining Model and Parameter Uncertainty in Bayesian Neural Networks.
Vis sammendrag
Bayesian neural networks (BNNs) have recently regained a significant amount of attention in the deep learning community due to the development of scalable approximate Bayesian inference techniques. There are several advantages of using Bayesian approach: Parameter and prediction uncertainty become easily available, facilitating rigid statistical analysis. Furthermore, prior knowledge can be incorporated. However so far there have been no scalable techniques capable of combining both model (structural) and parameter uncertainty. In this paper we introduce the concept of model uncertainty in BNNs and hence make inference in the joint space of models and parameters. Moreover, we suggest an adaptation of a scalable variational inference approach with reparametrization of marginal inclusion probabilities to incorporate the model space constraints. Finally, we show that incorporating model uncertainty via Bayesian model averaging and Bayesian model selection allows to drastically sparsify the structure of BNNs without significant loss of predictive power.

Hubin, Aliaksandr & Storvik, Geir Olve (2019). Combining Model and Parameter Uncertainty in Bayesian Neural Networks.

Hubin, Aliaksandr & Storvik, Geir Olve (2019). Combining Model and Parameter Uncertainty in Bayesian Neural Networks.
Vis sammendrag
Bayesian neural networks (BNNs) have recently regained a significant amount of attention in the deep learning community due to the development of scalable approximate Bayesian inference techniques. There are several advantages of using Bayesian approach: Parameter and prediction uncertainty become easily available, facilitating rigid statistical analysis. Furthermore, prior knowledge can be incorporated. However so far there have been no scalable techniques capable of combining both model (structural) and parameter uncertainty. In this paper we introduce the concept of model uncertainty in BNNs and hence make inference in the joint space of models and parameters. Moreover, we suggest an adaptation of a scalable variational inference approach with reparametrization of marginal inclusion probabilities to incorporate the model space constraints. Finally, we show that incorporating model uncertainty via Bayesian model averaging and Bayesian model selection allows to drastically sparsify the structure of BNNs without significant loss of predictive power.

Hubin, Aliaksandr & Storvik, Geir Olve (2019). Combining Model and Parameter Uncertainty in Bayesian Neural Networks.

Hubin, Aliaksandr; Storvik, Geir Olve; Grini, Paul Eivind & Butenko, Melinka Alonso (2019). Bayesian binomial regression model with a latent Gaussian field for analysis of epigenetic data.

Storvik, Geir Olve (2019). Bayesian approaches to neural networks.

Storvik, Geir Olve (2019). Bayesian approaches to neural networks.

Storvik, Geir Olve (2019). Bayesian approaches to neural networks.

Storvik, Geir Olve (2019). Education in Statistics and Data Science at the University of Oslo.

Storvik, Geir Olve (2019). Flexible Bayesian Nonlinear Model Configuration.
Se alle arbeider i Cristin
Publisert 13. nov. 2010 14:06
 Sist endret 6. aug. 2020 10:16