Skip to main content

List of Publications of EMCL

JabRef references
Matching entries: 0
settings...
AuthorTitleYearJournal/ProceedingsReftypeDOI/URL
Huang, J., Brenna, C., Khan, A.u.M., Daniele, C., Rudolf, R., Heuveline, V. and Gretz, N.A cationic near infrared fluorescent agent and ethyl-cinnamate tissue clearing protocol for vascular staining and imaging 2019Nature Scientific Reports
Vol. 9(1), pp. 521 
articleURL 
Abstract: Understanding vascular structures and dysfunction is a fundamental challenge. This task has been approached by using traditi onal methodologies such as microscopic computed tomography and magnetic resonance imaging. Both techniques are not only expensive but also time-consuming. Here, we p resent a new method for visualizing vascular structures in different organs in an efficient manner. A cationic near infrared (NIR) fluorescent dye was developed with attractive features to specifically stain blood vessels. Furthermore, we refined the process of organ staining and harvesting by retrograde perfusion and optimized the subsequent dehydration and clearing process by the use of an automatic tissue processor and a non-toxic substance, ethyl-cinnamate. Using this approach, the time interval between organ harvesting and microscopic analysis can be reduced from day(s) or weeks to 4 hours. Finally, we have demonstrated that the new NIR fluorescen t agent in combination with confocal or light-sheet microscopy can be efficiently used for visualization of vascular structures, such as the blood vessels in differe nt organs e.g. glomeruli in kidneys, with an extremely high resolution. Our approach facilitates the development of automatic image processing and the quantitative a nalysis to study vascular and kidney diseases.
BibTeX:
@article{Huang2019,
  author = {Huang, Jiaguo and Brenna, Cinzia and Khan, Arif ul Maula and Daniele, Cristina and Rudolf, Rüdiger and Heuveline, Vincent and Gretz, Norbert},
  title = {A cationic near infrared fluorescent agent and ethyl-cinnamate tissue clearing protocol for vascular staining and imaging},
  journal = {Nature Scientific Reports},
  year = {2019},
  volume = {9},
  number = {1},
  pages = {521},
  url = {https://doi.org/10.1038/s41598-018-36741-1}
}
van de Kamp, T., Schwermann, A.H., dos Santos Rolo, T., Lösel, P.D., Engler, T., Etter, W., Faragó, T., Göttlicher, J., Heuveline, V., Kopmann, A., Mähler, B., Mörs, T., Odar, J., Rust, J., Tan Jerome, N., Vogelgesang, M., Baumbach, T. and Krogmann, L.Parasitoid biology preserved in mineralized fossils 2018Nature Communications
Vol. 9(1), pp. 3325 
articleDOIURL 
Abstract: About 50% of all animal species are considered parasites. The linkage of species diversity to a parasitic lifestyle is especially evident in the insect order Hymenoptera. However, fossil evidence for host-parasitoid interactions is extremely rare, rendering hypotheses on the evolution of parasitism assumptive. Here, using high-throughput synchrotron X-ray microtomography, we examine 1510 phosphatized fly pupae from the Paleogene of France and identify 55?parasitation events by four wasp species, providing morphological and ecological data. All species developed as solitary endoparasitoids inside their hosts and exhibit different morphological adaptations for exploiting the same hosts in one habitat. Our results allow systematic and ecological placement of four distinct endoparasitoids in the Paleogene and highlight the need to investigate ecological data preserved in the fossil record.
BibTeX:
@article{vandeKamp2018,
  author = {van de Kamp, Thomas and Schwermann, Achim H. and dos Santos Rolo, Tomy and Lösel, Philipp D. and Engler, Thomas and Etter, Walter and Faragó, Tomás and Göttlicher, Jörg and Heuveline, Vincent and Kopmann, Andreas and Mähler, Bastian and Mörs, Thomas and Odar, Janes and Rust, Jes and Tan Jerome, Nicholas and Vogelgesang, Matthias and Baumbach, Tilo and Krogmann, Lars},
  title = {Parasitoid biology preserved in mineralized fossils},
  journal = {Nature Communications},
  year = {2018},
  volume = {9},
  number = {1},
  pages = {3325},
  url = {https://doi.org/10.1038/s41467-018-05654-y},
  doi = {http://doi.org/10.1038/s41467-018-05654-y}
}
Seelig, T., Meyer, A., Gerstner, P., Meier, M., Jongmanns, M., Baumann, M., Heuveline, V., Egbers, C. Dielectrophoretic force-driven convection in annular geometry under Earth's 2018 arXiv preprint arXiv
1812.05460
preprintURL 
Abstract: A radial temperature gradient together with an inhomogeneous radial electric field gradient is applied to a dielectric fluid confined in a vertical cylindrical annulus inducing thermal electro-hydrodynamic convection. Aims: Identification of the stability of the flow and hence of the line of marginal stability separating stable laminar free (natural) convection from thermal electro-hydrodynamic convection, its flow structures, pattern formation and critical parameters. Methods: Combination of different measurement techniques, namely the shadowgraph method and particle image velocimetry, as well as numerical simulation are used to qualify/quantify the flow. Results: We identify the transition from stable laminar free convection to thermal electro-hydrodynamic convective flow in a wide range of Rayleigh number and electric potential. The line of marginal stability found confirms results from linear stability analysis. The flow after first transition forms a structure of vertically aligned stationary columnar modes. Additionally, we experimentally confirm critical parameters resulting from linear stability analysis.
BibTeX:
@article{seelig2018dielectrophoretic,
  title={Dielectrophoretic force-driven convection in annular geometry under Earth's gravity},
  author={Seelig, Torsten and Meyer, Antoine and Gerstner, Philipp and Meier, Martin and Jongmanns, Marcel and Baumann, Martin and Heuveline, Vincent and Egbers, Christoph},
  journal={arXiv preprint arXiv:1812.05460},
  year={2018}
}
Mexner, W., Bonn, M., Kopmann, A., Mauch, V., Ressmann, D., Chilingaryan, S., A Jerome, N.T., Van de Kamp, T., Heuveline, V., Lösel, P. OpenGL API-Based Analysis of Large Datasets in a Cloud Environment 2018Design and Use of Virtualization Technology in Cloud Computing, Publisher UGI Global
pp.161-181
article
BibTeX:
@incollection{mexner2018opengl,
  title={OpenGL{\textregistered} API-Based Analysis of Large Datasets in a Cloud Environment},
  author={Mexner, Wolfgang and Bonn, Matthias and Kopmann, Andreas and Mauch, Viktor and Ressmann, Doris and Chilingaryan, Suren A and Jerome, Nicholas Tan and van de Kamp, Thomas and Heuveline, Vincent and L{\"o}sel, Philipp},
  booktitle={Design and Use of Virtualization Technology in Cloud Computing},
  pages={161--181},
  year={2018},
  publisher={IGI Global}
}
John, D., Schick, M., Heuveline, V. Bayesian inference for estimating model discrepancy of an electric motor 2018PAMM, Wiley Online Library
Vol. 18(1)
article
BibTeX:
@article{john2018bayesian,
  title={Bayesian inference for estimating model discrepancy of an electric motor},
  author={John, David and Schick, Michael and Heuveline, Vincent},
  journal={PAMM},
  volume={18},
  number={1},
  pages={1--2},
  year={2018},
  publisher={Wiley Online Library}
}
Gawlok, S., Heuveline, V.Nested Schur-Complement Solver for a Low-Mach Number Model: Application to a Cyclone-Cyclone Interaction 2018Preprint Series of the Engineering Mathematics and Computing Lab
Vol. 02  
articleDOIURL 
Abstract: Forecasting the dynamical evolution of interacting tropical cyclones poses a computationally challenging problem. An attractive approach for modeling and simulating such type of dynamics is given by Low-mach models. A formulation of a Low-mach model is presented along with the description of a benchmark scenario of two interacting tropical cyclones. For the discrete nonlinear equations of this model, an improved variant of the nested Schur-complement preconditioner is proposed. A detailed discussion of the numerical results is conducted with a special emphasis on computational costs and scalability. It will be demonstrated that the new preconditioner clearly outperforms a prior version of this kind of preconditioner with respect to computing time by a factor of two, but at the cost of a slightly reduced scalability.
BibTeX:
@article{emclpp56653,
  author = {Simon Gawlok and Vincent Heuveline},
  title = {Nested Schur-Complement Solver for a Low-Mach Number Model: Application to a Cyclone-Cyclone Interaction},
  journal = {Preprint Series of the Engineering Mathematics and Computing Lab},
  year = {2018},
  volume = {0},
  number = {02},
  url = {https://journals.ub.uni-heidelberg.de/index.php/emcl-pp/article/view/56653},
  doi = {https://doi.org/10.11588/emclpp.2018.02.56653}
}
John, D., Schick, M. and Heuveline, V.Learning model discrepancy of an electric motor with Bayesian inference 2018Preprint Series of the Engineering Mathematics and Computing Lab
Vol. 01  
articleDOIURL 
Abstract: Uncertainty Quantification (UQ) is highly requested in computational modeling and simulation, especially in an industrial context. With the continuous evolution of modern complex systems demands on quality and reliability of simulation models increase. A main challenge is related to the fact that the considered computational models are rarely able to represent the true physics perfectly and demonstrate a discrepancy compared to measurement data. Further, an accurate knowledge of considered model parameters is usually not available. E.g. fluctuations in manufacturing processes of hardware components or noise in sensors introduce uncertainties which must be quantified in an appropriate way. Mathematically, such UQ tasks are posed as inverse problems, requiring efficient methods to solve. A popular approach for UQ in inverse problems is Bayesian inference. This work investigates the influence of model discrepancies onto the calibration of physical model parameters and further considers a Bayesian inference framework including an attempt to correct for model discrepancy by an additional term. A Markov Chain Monte Carlo (MCMC) method is utilized to approximate the posterior distribution. A polynomial expansion with unknown coefficients is used to approximate and learn model discrepancy and system parameters simultaneously. This work extends by discussion and specification of a guideline on how to define the model discrepancy term complexity, i.e. here the maximum polynomial degree, based on the available measurement data. Furthermore, the suggested method is applied to an electric motor model with synthetic measurement data and evaluated by comparing the results to the reference. The example illustrates the importance and promising perspective of the method by good approximation of discrepancy and parameters.
BibTeX:
@article{emclpp51320,
  author = {David John and Michael Schick and Vincent Heuveline},
  title = {Learning model discrepancy of an electric motor with Bayesian inference},
  journal = {Preprint Series of the Engineering Mathematics and Computing Lab},
  year = {2018},
  volume = {0},
  number = {01},
  url = {https://journals.ub.uni-heidelberg.de/index.php/emcl-pp/article/view/51320},
  doi = {http://doi.org/10.11588/emclpp.2018.1.51320}
}
Baumann, M., Effinger, M., Eller, D., Heuveline, V., Kempf, C., Loos, L., Maylein, L., Peltzer, Jö., Pfeiffer, M., Probst, V. and othersheiMAP: Virtueller Kartentisch 2017E-Science-Tage 2017: Forschungsdaten managen miscDOIURL 
BibTeX:
@misc{Baumann2017a,
  author = {Baumann, Martin and Effinger, Maria and Eller, Dirk and Heuveline, Vincent and Kempf, Christian and Loos, Lukas and Maylein, Leonhard and Peltzer, Jörg and Pfeiffer, Michelle and Probst, Veit and others},
  title = {heiMAP: Virtueller Kartentisch},
  year = {2017},
  url = {http://www.ub.uni-heidelberg.de/archiv/22880},
  doi = {http://dx.doi.org/10.11588/heidok.00022880}
}
Baumann, M., Gehbart, F., Mattes, O., Nikas, S. and Heuveline, V.Development and implementation of a temperature monitoring system for HPC systems 2017Parallel-Algorithmen, -Rechnerstrukturen und -Systemsoftware (PARS) miscURL 
BibTeX:
@misc{Baumann2017b,
  author = {Baumann, Martin and Gehbart, Fabian and Mattes, Oliver and Nikas, Sotirios and Heuveline, Vincent},
  title = {Development and implementation of a temperature monitoring system for HPC systems},
  booktitle = {Parallel-Algorithmen, -Rechnerstrukturen und -Systemsoftware (PARS)},
  publisher = {Gesellschaft für Informatik e.V.},
  year = {2017},
  url = {http://fg-pars.gi.de/pars-mitteilungen/}
}
Bertsch, V., Fichtner, W., Heuveline, V. and Leibfried, T.Advances in Energy System Optimization: Proceedings of the first International Symposium on Energy System Optimization 2017Proceedings of the first International Symposium on Energy System Optimization miscDOI  
BibTeX:
@misc{Bertsch2017a,
  author = {Bertsch, Valentin and Fichtner, Wolf and Heuveline, Vincent and Leibfried, Thomas},
  title = {Advances in Energy System Optimization: Proceedings of the first International Symposium on Energy System Optimization},
  publisher = {Springer},
  year = {2017},
  doi = {https://doi.org/10.1007/978-3-319-51795-7}
}
Loderer, T. and Heuveline, V.New sparsing approach for real-time simulations of stiff models on electronic control units 2017Preprint Series of the Engineering Mathematics and Computing Lab
Vol. 0(02) 
articleDOIURL 
Abstract: For real-time simulations of stiff models on electronic control units, one important ingredient is to reduce computation time within differentiation of the right-hand side of the underlying differential equation. A sparsing method represents a powerful tool in this context. For a reliable use of a real-time simulation of a stiff model in safety-critical conditions, it is important to be able to detect, when sparsing leads to an improper influence on the considered system. In this work we propose an extension which aims at addressing this numerically challenging problem in the context of real-time simulation.
BibTeX:
@article{emclpp38711,
  author = {Thomas Loderer and Vincent Heuveline},
  title = {New sparsing approach for real-time simulations of stiff models on electronic control units},
  journal = {Preprint Series of the Engineering Mathematics and Computing Lab},
  year = {2017},
  volume = {0},
  number = {02},
  url = {https://journals.ub.uni-heidelberg.de/index.php/emcl-pp/article/view/38711},
  doi = {http://dx.doi.org/10.11588/emclpp.2017.02.38711}
}
Schoch, N. and Heuveline, V.Towards an Intelligent Framework for Personalized Simulation-enhanced Surgery Assistance: Linking a Simulation Ontology to a Reinforcement Learning Algorithm for Calibration of Numerical Simulations 2017Preprint Series of the Engineering Mathematics and Computing Lab
Vol. 0(05) 
articleDOIURL 
Abstract: Evolving our previous research results in the context of cognition-guidance and patient-specifity for simulation-enhanced cardiac surgery assistance, in this work we further investigate on (1) a machine learning framework which allows to patient-individually calibrate soft tissue material parameters for subsequent simulation, and (2) a profound knowledge management framework which may enhance the ontology-driven overall setup of the cognition-guided surgery simulation in a clinic environment. Rather than being a closed research work with an in-depth theory backup and a complete evaluation, we here present a technical report and some interesting experimental works that are to serve for further research and development.
BibTeX:
@article{emclpp42079,
  author = {Nicolai Schoch and Vincent Heuveline},
  title = {Towards an Intelligent Framework for Personalized Simulation-enhanced Surgery Assistance: Linking a Simulation Ontology to a Reinforcement Learning Algorithm for Calibration of Numerical Simulations},
  journal = {Preprint Series of the Engineering Mathematics and Computing Lab},
  year = {2017},
  volume = {0},
  number = {05},
  url = {https://journals.ub.uni-heidelberg.de/index.php/emcl-pp/article/view/42079},
  doi = {http://dx.doi.org/10.11588/emclpp.2017.05.42079}
}
Gawlok, S., Gerstner, P., Haupt, S., Heuveline, V., Kratzke, J., Lösel, P., Mang, K., Schmidtobreick, M., Schoch, N., Schween, N., Schwegler, J., Song, C. and Wlotzka, M.HiFlow3 – Technical Report on Release 2.0 2017Preprint Series of the Engineering Mathematics and Computing Lab
Vol. 0(06) 
articleDOIURL 
BibTeX:
@article{emclpp42879,
  author = {Simon Gawlok and Philipp Gerstner and Saskia Haupt and Vincent Heuveline and Jonas Kratzke and Philipp Lösel and Katrin Mang and Mareike Schmidtobreick and Nicolai Schoch and Nils Schween and Jonathan Schwegler and Chen Song and Martin Wlotzka},
  title = {HiFlow3 – Technical Report on Release 2.0},
  journal = {Preprint Series of the Engineering Mathematics and Computing Lab},
  year = {2017},
  volume = {0},
  number = {06},
  url = {https://journals.ub.uni-heidelberg.de/index.php/emcl-pp/article/view/42879},
  doi = {http://dx.doi.org/10.11588/emclpp.2017.06.42879}
}
Heuveline, V., Schick, M., Webster, C. and Zaspel, P.Uncertainty Quantification and High Performance Computing (Dagstuhl Seminar 16372) 2017
Vol. 6(9)Dagstuhl Reports 
inproceedings 
BibTeX:
@inproceedings{Heuveline2017a,
  author = {Heuveline, Vincent and Schick, Michael and Webster, Clayton and Zaspel, Peter},
  title = {Uncertainty Quantification and High Performance Computing (Dagstuhl Seminar 16372)},
  booktitle = {Dagstuhl Reports},
  year = {2017},
  volume = {6},
  number = {9}
}
Klatt, S., Kraus, D., Kraft, P., Breuer, L., Wlotzka, M., Heuveline, V., Haas, E., Kiese, R. and Butterbach-Bahl, K.Exploring impacts of vegetated buffer strips on nitrogen cycling using a spatially explicit hydro-biogeochemical modeling approach 2017Environmental Modelling & Software
Vol. 90, pp. 55-67 
article 
BibTeX:
@article{Klatt2017a,
  author = {Klatt, S and Kraus, D and Kraft, P and Breuer, L and Wlotzka, M and Heuveline, V and Haas, E and Kiese, R and Butterbach-Bahl, Klaus},
  title = {Exploring impacts of vegetated buffer strips on nitrogen cycling using a spatially explicit hydro-biogeochemical modeling approach},
  journal = {Environmental Modelling & Software},
  publisher = {Elsevier},
  year = {2017},
  volume = {90},
  pages = {55--67}
}
Schmelzle, S., Heethoff, M., Heuveline, V., Lösel, P., Becker, J., Beckmann, F., Schluenzen, F., Hammel, J.U., Kopmann, A., Mexner, W., Vogelgesang, M., Jerome, N.T., Betz, O., Beutel, R., Wipfler, B., Blanke, A., Harzsch, S., Hörnig, M., Baumbach, T. and van de Kamp, T.The NOVA project: maximizing beam time efficiency through synergistic analyses of SRmCT data 2017Proc.SPIE
Vol. 10391, pp. 10391 - 10391 - 17 
inproceedingsDOIURL 
BibTeX:
@inproceedings{Losel2017a,
  author = { Sebastian Schmelzle and Michael Heethoff and Vincent Heuveline and Philipp Lösel and Jurgen Becker and Felix Beckmann and Frank Schluenzen and Jörg U. Hammel and Andreas Kopmann and Wolfgang Mexner and Matthias Vogelgesang and Nicholas Tan Jerome and Oliver Betz and Rolf Beutel and Benjamin Wipfler and Alexander Blanke and Steffen Harzsch and Marie Hörnig and Tilo Baumbach and Thomas van de Kamp},
  title = {The NOVA project: maximizing beam time efficiency through synergistic analyses of SRmCT data},
  journal = {Proc.SPIE},
  year = {2017},
  volume = {10391},
  pages = {10391 - 10391 - 17},
  url = {http://dx.doi.org/10.1117/12.2275959},
  doi = {http://dx.doi.org/10.1117/12.2275959}
}
Lösel, P. and Heuveline, V.A GPU Based Diffusion Method for Whole-Heart and Great Vessel Segmentation 2017Reconstruction, Segmentation, and Analysis of Medical Images, RAMBO 2016 and HVSMR 2016, Athens, Greece, October 17, 2016 , pp. 121-128 inbookDOIURL 
Abstract: Segmenting the blood pool and myocardium from a 3D cardiovascular magnetic resonance (CMR) image allows to create a patient-specific heart model for surgical planning in children with complex congenital heart disease (CHD). Implementation of semi-automatic or automatic segmentation algorithms is challenging because of a high anatomical variability of the heart defects, low contrast, and intensity variations in the images. Therefore, manual segmentation is the gold standard but it is labor-intensive. In this paper we report the set-up and results of a highly scalable semi-automatic diffusion algorithm for image segmentation. The method extrapolates the information from a small number of expert manually labeled reference slices to the remaining volume. While results of most semi-automatic algorithms strongly depend on well-chosen but usually unknown parameters this approach is parameter-free. Validation is performed on twenty 3D CMR images.
BibTeX:
@inbook{Losel2017b,
  author = {Lösel, Philipp
and Heuveline, Vincent}, title = {A GPU Based Diffusion Method for Whole-Heart and Great Vessel Segmentation}, booktitle = {Reconstruction, Segmentation, and Analysis of Medical Images, RAMBO 2016 and HVSMR 2016, Athens, Greece, October 17, 2016 }, publisher = {Lecture Notes in Computer Science Springer International Publishing}, year = {2017}, pages = {121--128}, url = {https://doi.org/10.1007/978-3-319-52280-7_12}, doi = {http://dx.doi.org/10.1007/978-3-319-52280-7_12} }
Meyer-Hübner, N., Suriyah, M., Leibfried, T., Slednev, V., Bertsch, V., Fichtner, W., Gerstner, P., Schick, M. and Heuveline, V.Optimal Storage Operation with Model Predictive Control in the German Transmission Grid 2017Advances in Energy System Optimization, pp. 31-45 incollectionDOI  
BibTeX:
@incollection{Meyer2017a,
  author = {Meyer-Hübner, Nico and Suriyah, Michael and Leibfried, Thomas and Slednev, Viktor and Bertsch, Valentin and Fichtner, Wolf and Gerstner, Philipp and Schick, Michael and Heuveline, Vincent},
  title = {Optimal Storage Operation with Model Predictive Control in the German Transmission Grid},
  booktitle = {Advances in Energy System Optimization},
  publisher = {Birkhäuser, Cham},
  year = {2017},
  pages = {31--45},
  doi = {https://doi.org/10.1007/978-3-319-51795-7}
}
Mexner, W., Bonn, M., Kopmann, A., Mauch, V., Ressmann, D., Chilingaryan, S.A., Jerome, N.T., Kamp, T.V.D., Heuveline, V., Lösel, P. and et al.OpenGL® API-Based Analysis of Large Datasets in a Cloud Environment 2017Advances in Computer and Electrical Engineering Design and Use of Virtualization Technology in Cloud Computing, pp. 161–181 articleDOI  
BibTeX:
@article{OpenGL2017a,
  author = {Mexner, Wolfgang and Bonn, Matthias and Kopmann, Andreas and Mauch, Viktor and Ressmann, Doris and Chilingaryan, Suren A. and Jerome, Nicholas Tan and Kamp, Thomas Van De and Heuveline, Vincent and Lösel, Philipp and et al.},
  title = {OpenGL® API-Based Analysis of Large Datasets in a Cloud Environment},
  journal = {Advances in Computer and Electrical Engineering Design and Use of Virtualization Technology in Cloud Computing},
  year = {2017},
  pages = {161–181},
  doi = {http://dx.doi.org/10.4018/978-1-5225-2785-5.ch006}
}
Schoch, N., Engelhardt, S., De Simone, R., Wolf, I. and Heuveline, V.High Performance Computing for Cognition-Guided Cardiac Surgery:
Soft Tissue Simulation for Mitral Valve Reconstruction in
Knowledge-based Surgery Assistance
2017Proc. 6th International Conference on High-Performance Scientific Computing 2015 (LNCS Springer), pp. [in press] article 
BibTeX:
@article{Schoch2015a,
  author = {Schoch, N. and Engelhardt, S. and De Simone, R. and Wolf, I. and Heuveline, V.},
  title = {High Performance Computing for Cognition-Guided Cardiac Surgery:
Soft Tissue Simulation for Mitral Valve Reconstruction in
Knowledge-based Surgery Assistance}, journal = {Proc. 6th International Conference on High-Performance Scientific Computing 2015 (LNCS Springer)}, publisher = {Springer Heidelberg}, year = {2017}, pages = {[in press]} }
Schoch, N. and Heuveline, V.Towards Cognition-Guided Patient-Specific FEM-based Cardiac Surgery Simulation 2017Lecture Notes in Computer Science, Proceedings of the "Functional Imaging and Modeling of the Heart (FIMH) 2017" articleDOI  
BibTeX:
@article{Schoch2017-CSA-FIMH,
  author = {Schoch, N. and Heuveline, V.},
  title = {Towards Cognition-Guided Patient-Specific FEM-based Cardiac Surgery Simulation},
  journal = {Lecture Notes in Computer Science, Proceedings of the "Functional Imaging and Modeling of the Heart (FIMH) 2017"},
  year = {2017},
  doi = {http://dx.doi.org/10.1007/978-3-319-59448-4_12}
}
Slednev, V., Ruppert, M., Bertsch, V., Fichtner, W., Meyer-Hübner, N., Suriyah, M., Leibfried, T., Gerstner, P., Schick, M. and Heuveline, V.Regionalizing Input Data for Generation and Transmission Expansion Planning Models 2017Advances in Energy System Optimization, pp. 205-217 incollectionDOI  
BibTeX:
@incollection{Slednev2017a,
  author = {Slednev, Viktor and Ruppert, Manuel and Bertsch, Valentin and Fichtner, Wolf and Meyer-Hübner, Nico and Suriyah, Michael and Leibfried, Thomas and Gerstner, Philipp and Schick, Michael and Heuveline, Vincent},
  title = {Regionalizing Input Data for Generation and Transmission Expansion Planning Models},
  booktitle = {Advances in Energy System Optimization},
  publisher = {Birkhäuser, Cham},
  year = {2017},
  pages = {205--217},
  doi = {https://doi.org/10.1007/978-3-319-51795-7}
}
Song, C., Stoll, M., Giske, K., Bendl, R. and Heuveline, V.Sparse Grids for quantifying motion uncertainties in biomechanical models of radiotherapy patients 2017Preprint Series of the Engineering Mathematics and Computing Lab
Vol. 0(01) 
articleDOIURL 
BibTeX:
@article{Song2017a,
  author = {Chen Song and Markus Stoll and Kristina Giske and Rolf Bendl and Vincent Heuveline},
  title = {Sparse Grids for quantifying motion uncertainties in biomechanical models of radiotherapy patients},
  journal = {Preprint Series of the Engineering Mathematics and Computing Lab},
  year = {2017},
  volume = {0},
  number = {01},
  url = {http://journals.ub.uni-heidelberg.de/index.php/emcl-pp/article/view/35059},
  doi = {http://dx.doi.org/10.11588/emclpp.2017.01.35059}
}
Song, C. and Heuveline, V.Multilevel preconditioner of Polynomial Chaos Method for quantifying uncertainties in a blood pump 2017International Conference on Uncertainty Quantification in Computational Sciences and Engineering (UNCECOMP), Greece, 2017 inproceedingsURL 
BibTeX:
@inproceedings{Song2017b,
  author = {Song, Chen and Heuveline, Vincent},
  title = {Multilevel preconditioner of Polynomial Chaos Method for quantifying uncertainties in a blood pump},
  booktitle = {International Conference on Uncertainty Quantification in Computational Sciences and Engineering (UNCECOMP), Greece, 2017},
  publisher = {Scopus, Elsevier},
  year = {2017},
  url = {https://2017.uncecomp.org/proceedings/pdf/17204.pdf}
}
Wlotzka, M. and Heuveline, V.Energy-efficient multigrid smoothers and grid transfer operators on multi-core and GPU clusters 2017Journal of Parallel and Distributed Computing
Vol. 100, pp. 181-192 
article 
BibTeX:
@article{Wlotzka2017a,
  author = {Wlotzka, Martin and Heuveline, Vincent},
  title = {Energy-efficient multigrid smoothers and grid transfer operators on multi-core and GPU clusters},
  journal = {Journal of Parallel and Distributed Computing},
  publisher = {Elsevier},
  year = {2017},
  volume = {100},
  pages = {181--192}
}
Wlotzka, M., Heuveline, V., Dolz, M.F., Heidari, M.R., Ludwig, T., Malossi, A.C.I. and Quintana-Orti, E.S.Energy-Aware High Performance Computing 2017ICT-Energy Concepts for Energy Efficiency and Sustainability incollection 
BibTeX:
@incollection{Wlotzka2017b,
  author = {Wlotzka, Martin and Heuveline, Vincent and Dolz, Manuel F and Heidari, M Reza and Ludwig, Thomas and Malossi, A Cristiano I and Quintana-Orti, Enrique S},
  title = {Energy-Aware High Performance Computing},
  booktitle = {ICT-Energy Concepts for Energy Efficiency and Sustainability},
  publisher = {InTech},
  year = {2017}
}
Wlotzka, M., Heuveline, V., Klatt, S., Kraus, D., Haas, E., Kiese, R., Butterbach-Bahl, K., Kraft, P. and Breuer, L.Parallel multiphysics simulations using OpenPALM with application to hydrology-biogeochemistry coupling 2017Modeling, Simulation and Optimization of Complex Processes - HPSC 2015 inbook 
BibTeX:
@inbook{Wlotzka2017c,
  author = {Wlotzka, M. and Heuveline, V. and Klatt, S. and Kraus, D. and Haas, E. and Kiese, R. and Butterbach-Bahl, K. and Kraft, P. and Breuer, L.},
  title = {Parallel multiphysics simulations using OpenPALM with application to hydrology-biogeochemistry coupling},
  booktitle = {Modeling, Simulation and Optimization of Complex Processes - HPSC 2015},
  publisher = {Springer},
  year = {2017}
}
Wlotzka, M. and Heuveline, V.An energy-efficient parallel multigrid methods for multi-core CPU platforms and HPC clusters 2017EMCL Preprint Series
Vol. 03 
article 
BibTeX:
@article{Wlotzka2017d,
  author = {Wlotzka, M. and Heuveline, V.},
  title = {An energy-efficient parallel multigrid methods for multi-core CPU platforms and HPC clusters},
  journal = {EMCL Preprint Series},
  year = {2017},
  volume = {03}
}
Wlotzka, M., Morel, T., Piacentini, A. and Heuveline, V.New features for advanced dynamic parallel communication routines in OpenPALM: Algorithms and documentation 2017EMCL Preprint Series
Vol. 04 
article 
BibTeX:
@article{Wlotzka2017e,
  author = {Wlotzka, M. and Morel, T. and Piacentini, A. and Heuveline, V.},
  title = {New features for advanced dynamic parallel communication routines in OpenPALM: Algorithms and documentation},
  journal = {EMCL Preprint Series},
  year = {2017},
  volume = {04}
}
Bromberger, M., Bastian, P., Bergeest, J.-P., Conrad, C., Heuveline, V., Rohr, K. and Karl, W.FPGA-accelerated Richardson-Lucy deconvolution for 3D image data 2016Biomedical Imaging (ISBI), 2016 IEEE 13th International Symposium on, pp. 132-135 inproceedings 
BibTeX:
@inproceedings{Bromberger2016a,
  author = {Bromberger, Michael and Bastian, Pascal and Bergeest, Jan-Philip and Conrad, Christian and Heuveline, Vincent and Rohr, Karl and Karl, Wolfgang},
  title = {FPGA-accelerated Richardson-Lucy deconvolution for 3D image data},
  booktitle = {Biomedical Imaging (ISBI), 2016 IEEE 13th International Symposium on},
  year = {2016},
  pages = {132--135}
}
Bromberger, M., Heuveline, V. and Karl, W.Reducing Energy Consumption of Data Transfers Using Runtime Data Type Conversion 2016International Conference on Architecture of Computing Systems, pp. 239-250 inproceedings 
BibTeX:
@inproceedings{Bromberger2016b,
  author = {Bromberger, Michael and Heuveline, Vincent and Karl, Wolfgang},
  title = {Reducing Energy Consumption of Data Transfers Using Runtime Data Type Conversion},
  booktitle = {International Conference on Architecture of Computing Systems},
  year = {2016},
  pages = {239--250}
}
Fetzer, A., Metzger, J., Katic, D., März, K., Wagner, M., Philipp, P., Engelhardt, S., Weller, T., Zelzer, S., Franz, A., Schoch, N., Heuveline, V., Maleshkova, M., Rettinger, A., Speidel, S., Wolf, I., Kenngott, H., Mehrabi, A., Müller, B., Maier-Hein, L., Meinzer, H.-P. and Nolden, M.Towards an open-source semantic data infrastructure for integrating clinical and scientific data in cognition-guided surgery 2016Proceedings of SPIE Medical Imaging 2016: PACS and Imaging Informatics: Next Generation and Innovations inproceedings 
BibTeX:
@inproceedings{FetzerSPIE2016,
  author = {Andreas Fetzer and Jasmin Metzger and Darko Katic and Keno März and Martin Wagner and Patrick Philipp and Sandy Engelhardt and Tobias Weller and Sascha Zelzer and Alfred Franz and Nicolai Schoch and Vincent Heuveline and Maria Maleshkova and Achim Rettinger and Stefanie Speidel and Ivo Wolf and Hannes Kenngott and Arianeb Mehrabi and Beat Müller and Lena Maier-Hein and Hans-Peter Meinzer and Marco Nolden},
  title = {Towards an open-source semantic data infrastructure for integrating clinical and scientific data in cognition-guided surgery},
  booktitle = {Proceedings of SPIE Medical Imaging 2016: PACS and Imaging Informatics: Next Generation and Innovations},
  year = {2016}
}
Gerstner, P., Schick, M., Heuveline, V., Meyer-Hübner, N., Suriyah, M., Leibfried, T., Slednev, V., Fichtner, W. and Bertsch, V.V.A Domain Decomposition Approach for Solving Dynamic Optimal Power Flow Problems in Parallel with Application to the German Transmission Grid 2016Preprint Series of the Engineering Mathematics and Computing Lab(1) article 
BibTeX:
@article{Gerstner2016a,
  author = {Gerstner, Philipp and Schick, Michael and Heuveline, Vincent and Meyer-Hübner, Nico and Suriyah, Michael and Leibfried, Thomas and Slednev, Viktor and Fichtner, Wolf and Bertsch, Valentin Valentin},
  title = {A Domain Decomposition Approach for Solving Dynamic Optimal Power Flow Problems in Parallel with Application to the German Transmission Grid},
  journal = {Preprint Series of the Engineering Mathematics and Computing Lab},
  year = {2016},
  number = {1}
}
Glaser, P., Schick, M., Petridis, K. and Heuveline, V.Comparasion between a Polynomial Chaos surrogate model and Markov Chain Monte-Carlo for invserseE Uncertainty Quantification based on an electric drive test bench 2016, pp. 8809-8826 inproceedings 
BibTeX:
@inproceedings{Glaser2016a,
  author = {Glaser, Philipp and Schick, Michael and Petridis, Kosmas and Heuveline, Vincent},
  title = {Comparasion between a Polynomial Chaos surrogate model and Markov Chain Monte-Carlo for invserseE Uncertainty Quantification based on an electric drive test bench},
  publisher = {Scopus, Elsevier},
  year = {2016},
  pages = {8809-8826}
}
Kratzke, J. and Heuveline, V.An analytically solvable benchmark problem for fluid-structure interaction with uncertain parameters 2016Preprint Series of the Engineering Mathematics and Computing Lab(2) article 
BibTeX:
@article{Kratzke2016a,
  author = {Kratzke, Jonas and Heuveline, Vincent},
  title = {An analytically solvable benchmark problem for fluid-structure interaction with uncertain parameters},
  journal = {Preprint Series of the Engineering Mathematics and Computing Lab},
  year = {2016},
  number = {2}
}
Jonas Kratzke Fabian Rengier, C.W.C.J.B. and Heuveline, V.In vitro flow assessment: From PC-MRI to computational fluid dynamics including fluid-structure interaction 2016Proceedings of SPIE Medical Imaging 2016:Physics of Medical Imaging inproceedings 
BibTeX:
@inproceedings{KratzkeSPIE2016,
  author = {Jonas Kratzke, Fabian Rengier, Christian Weis, Carsten J. Beller and Vincent Heuveline},
  title = {In vitro flow assessment: From PC-MRI to computational fluid dynamics including fluid-structure interaction},
  booktitle = {Proceedings of SPIE Medical Imaging 2016:Physics of Medical Imaging},
  year = {2016}
}
Lösel, P. and Heuveline, V.Enhancing a Diffusion Algorithm for 4D Image Segmentation Using Local Information 2016Proceedings of SPIE Medical Imaging 2016: Image Processing inproceedings 
BibTeX:
@inproceedings{LöselSPIE2016,
  author = {Philipp Lösel and Vincent Heuveline},
  title = {Enhancing a Diffusion Algorithm for 4D Image Segmentation Using Local Information},
  booktitle = {Proceedings of SPIE Medical Imaging 2016: Image Processing},
  year = {2016}
}
Reble, P., Ludwig, T., Müller, M.S., Nagel, W.E. and Heuveline, V.Editorial for the special issue on energy-aware high performance computing 2016 misc 
BibTeX:
@misc{Reble2016a,
  author = {Reble, Pablo and Ludwig, Thomas and Müller, Matthias S and Nagel, Wolfgang E and Heuveline, Vincent},
  title = {Editorial for the special issue on energy-aware high performance computing},
  publisher = {Springer Berlin Heidelberg},
  year = {2016}
}
Schick, M., Heuveline, V. and Le Ma, O.A Newton--Galerkin Method for Fluid Flow Exhibiting Uncertain Periodic Dynamics 2016SIAM Review
Vol. 58(1), pp. 119-140 
article 
BibTeX:
@article{Schick2016a,
  author = {Schick, Michael and Heuveline, Vincent and Le Ma, OP},
  title = {A Newton--Galerkin Method for Fluid Flow Exhibiting Uncertain Periodic Dynamics},
  journal = {SIAM Review},
  publisher = {Society for Industrial and Applied Mathematics},
  year = {2016},
  volume = {58},
  number = {1},
  pages = {119--140}
}
Schoch, N., Kißler, F., Stoll, M., Engelhardt, S., Simone, R., Wolf, I., Bendl, R. and Heuveline, V.Comprehensive patient-specific information preprocessing for cardiac surgery simulations 2016International journal of computer assisted radiology and surgery
Vol. 11(6), pp. 1051-1059 
article 
BibTeX:
@article{Schoch2016a,
  author = {Schoch, N and Kißler, F and Stoll, M and Engelhardt, S and Simone, R and Wolf, I and Bendl, R and Heuveline, V},
  title = {Comprehensive patient-specific information preprocessing for cardiac surgery simulations},
  journal = {International journal of computer assisted radiology and surgery},
  publisher = {Springer Berlin Heidelberg},
  year = {2016},
  volume = {11},
  number = {6},
  pages = {1051--1059}
}
Schoch, N., Speidel, S., Sure-Vetter, Y. and Heuveline, V.Towards Semantic Simulation for Patient-Specific Surgery Assistance 2016Proc. First International Workshop on Surgical Data Science
Vol. 1, pp. 58-63 
article 
BibTeX:
@article{Schoch2016b,
  author = {Schoch, N.
and Speidel, S.
and Sure-Vetter, Y.
and Heuveline, V.}, title = {Towards Semantic Simulation for Patient-Specific Surgery Assistance}, journal = {Proc. First International Workshop on Surgical Data Science}, year = {2016}, volume = {1}, pages = {58-63} }
Schoch, N., Speidel, S., Sure-Vetter, Y. and Heuveline, V.Towards Semantic Simulation for Patient-Specific Surgery Assistance 2016Online Proceedings of the International Workshop on Surgical Data Science 2016 articleURL 
BibTeX:
@article{Schoch2017-SemanticSim-SDS,
  author = {Schoch, N. and Speidel, S. and Sure-Vetter, Y. and Heuveline, V.},
  title = {Towards Semantic Simulation for Patient-Specific Surgery Assistance},
  journal = {Online Proceedings of the International Workshop on Surgical Data Science 2016},
  year = {2016},
  url = {http://www.surgical-data-science.org/wp-content/uploads/2016/06/14-Towards-Semantic-Simulation-for-Patient-Specific-Surgery-Assistance.pdf}
}
Schoch, N., Kissler, F., Stoll, M., Engelhardt, S., Simone, R.D., Wolf, I., Bendl, R. and Heuveline, V.Comprehensive patient-specific information preprocessing for cardiac surgery simulations 2016International Journal of Computer Assisted Radiology and Surgery, Special Issue IPCAI 2016 inbookDOI  
BibTeX:
@inbook{SchochIJCARS2016,
  author = {Nicolai Schoch and Fabian Kissler and Markus Stoll and Sandy Engelhardt and Raffaele De Simone and Ivo Wolf and Rolf Bendl and Vincent Heuveline},
  title = {Comprehensive patient-specific information preprocessing for cardiac surgery simulations},
  booktitle = {International Journal of Computer Assisted Radiology and Surgery, Special Issue IPCAI 2016},
  publisher = {Springer New York, LLC},
  year = {2016},
  doi = {http://dx.doi.org/10.1007/s11548-016-1397-0}
}
Schoch, N., Philipp, P., Weller, T., Engelhardt, S., Volovyk, M., Fetzer, A., Nolden, M., Simone, R.D., Wolf, I., Maleshkova, M., Rettinger, A., Studer, R. and Heuveline, V.Cognitive tools pipeline for assistance of mitral valve surgery 2016Proceedings of SPIE Medical Imaging 2016: Image-Guided Procedures, Robotic Interventions, and Modeling inproceedings 
BibTeX:
@inproceedings{SchochSPIE2016,
  author = {Nicolai Schoch and Patrick Philipp and Tobias Weller and Sandy Engelhardt and Mykola Volovyk and Andreas Fetzer and Marco Nolden and Raffaele De Simone and Ivo Wolf and Maria Maleshkova and Achim Rettinger and Rudi Studer and Vincent Heuveline},
  title = {Cognitive tools pipeline for assistance of mitral valve surgery},
  booktitle = {Proceedings of SPIE Medical Imaging 2016: Image-Guided Procedures, Robotic Interventions, and Modeling},
  year = {2016}
}
Baumann, M., Heuveline, V., Scheck, L. and Jones, S.C.Goal-oriented adaptivity for idealised tropical cyclones: A binary interaction scenario 2015Meteorologische Zeitschrift, pp. - articleURL 
BibTeX:
@article{baumann2015,
  author = {Baumann, Martin and Heuveline, Vincent and Scheck, Leonhard and Jones, Sarah C.},
  title = {Goal-oriented adaptivity for idealised tropical cyclones: A binary interaction scenario},
  journal = {Meteorologische Zeitschrift},
  publisher = {Schweizerbart Science Publishers},
  year = {2015},
  pages = {-},
  url = {http://dx.doi.org/10.1127/metz/2015/0591}
}
Baumann, M., Förstner, J., Heuveline, V., Kratzke, J., Ritterbusch, S., Vogel, B. and Vogel, H.Model-Based Visualization of Instationary Geo-Data with Application to Volcano Ash Data 2015Handbook of Geomathematics, pp. 1417-1441 incollectionDOIURL 
BibTeX:
@incollection{Baumann2015,
  author = {Baumann, Martin and Förstner, Jochen and Heuveline, Vincent and Kratzke, Jonas and Ritterbusch, Sebastian and Vogel, Bernhard and Vogel, Heike},
  title = {Model-Based Visualization of Instationary Geo-Data with Application to Volcano Ash Data},
  booktitle = {Handbook of Geomathematics},
  publisher = {Springer Berlin Heidelberg},
  year = {2015},
  pages = {1417-1441},
  url = {http://dx.doi.org/10.1007/978-3-642-54551-1_87},
  doi = {http://dx.doi.org/10.1007/978-3-642-54551-1_87}
}
Hahn, T., Huuk, T., Osberghaus, A., Doninger, K., Nath, S., Hepbildikler, S., Heuveline, V. and Hubbuch, J.Calibration-free inverse modeling of ion-exchange chromatography in industrial antibody purification 2015Engineering in Life Sciences
Vol. 5 
article 
BibTeX:
@article{hahnELScalibfreeinverse,
  author = {Tobias Hahn and Thiemo Huuk and Anna Osberghaus and Katharina Doninger and Susanne Nath and Stefan Hepbildikler and Vincent Heuveline and Jürgen Hubbuch},
  title = {Calibration-free inverse modeling of ion-exchange chromatography in industrial antibody purification},
  journal = {Engineering in Life Sciences},
  year = {2015},
  volume = {5}
}
Hahn, T., Baumann, P., Huuk, T., Heuveline, V. and Hubbuch, J.UV absorption-based inverse modelling of protein chromatography 2015Engineering in Life Sciences
Vol. 4 
article 
BibTeX:
@article{hahnELSuvinverse,
  author = {Tobias Hahn and Pascal Baumann and Thiemo Huuk and Vincent Heuveline and Jürgen Hubbuch},
  title = {UV absorption-based inverse modelling of protein chromatography},
  journal = {Engineering in Life Sciences},
  year = {2015},
  volume = {4}
}
Hahn, T., Huuk, T., Heuveline, V. and Hubbuch, J.Simulating and Optimizing Preparative Protein Chromatography with ChromX 2015Journal of Chemical Education
Vol. 6 
article 
BibTeX:
@article{hahnJCEchromx,
  author = {T. Hahn and T. Huuk and V. Heuveline and J. Hubbuch},
  title = {Simulating and Optimizing Preparative Protein Chromatography with ChromX},
  journal = {Journal of Chemical Education},
  year = {2015},
  volume = {6}
}
Hoecker, M., Polsterer, K.L., Kugler, S.D. and Heuveline, V.Clustering of Complex Data-Sets Using Fractal Similarity Measures and Uncertainties 2015Computational Science and Engineering (CSE), 2015 IEEE 18th International Conference on, pp. 82-91 inproceedingsDOIURL 
BibTeX:
@inproceedings{Hoecker2015,
  author = {Hoecker, Maximilian and Polsterer, Kai Lars and Kugler, Sven Dennis and Heuveline, Vincent},
  title = {Clustering of Complex Data-Sets Using Fractal Similarity Measures and Uncertainties},
  booktitle = {Computational Science and Engineering (CSE), 2015 IEEE 18th International Conference on},
  year = {2015},
  pages = {82-91},
  url = {fileadmin/images/Publications/Paper_Hoecker_Heuveline.pdf},
  doi = {http://dx.doi.org/10.1109/CSE.2015.35}
}
Hoffmann, M., Gawlok, S., Treiber, E., Karl, W. and Heuveline, V.Interdisciplinary Practical Course on Parallel Finite Element Method Using HiFlow^3 2015
Vol. 9523Euro-Par 2015: Parallel Processing Workshops, pp. 160-171 
incollectionDOIURL 
BibTeX:
@incollection{Hoffmann2015,
  author = {Hoffmann, Markus and Gawlok, Simon and Treiber, Eva and Karl, Wolfgang and Heuveline, Vincent},
  title = {Interdisciplinary Practical Course on Parallel Finite Element Method Using HiFlow^3},
  booktitle = {Euro-Par 2015: Parallel Processing Workshops},
  publisher = {Springer International Publishing},
  year = {2015},
  volume = {9523},
  pages = {160-171},
  url = {http://dx.doi.org/10.1007/978-3-319-27308-2_14},
  doi = {http://dx.doi.org/10.1007/978-3-319-27308-2_14}
}
Kratzke, J., Schick, M. and Heuveline, V.Fluid-Structure Interaction Simulation of an Aortic Phantom with Uncertain Young's Modulus Using the Polynomial Chaos Expansion 2015
Vol. 807Applied Mechanics and Materials, pp. 34-44 
article 
BibTeX:
@article{Kratzke2015,
  author = {Kratzke, Jonas and Schick, Michael and Heuveline, Vincent},
  title = {Fluid-Structure Interaction Simulation of an Aortic Phantom with Uncertain Young's Modulus Using the Polynomial Chaos Expansion},
  booktitle = {Applied Mechanics and Materials},
  year = {2015},
  volume = {807},
  pages = {34-44}
}
Kratzke, J., Schoch, N., Weis, C., Mueller-Eschner, M., Speidel, S., Farag, M., Beller, C. and Heuveline, V.Enhancing 4D PC-MRI in an aortic phantom considering numerical simulations 2015Proceedings of SPIE Medical Imaging 2015: Physics of Medical Imaging inproceedings 
BibTeX:
@inproceedings{KratzkeSPIE2015,
  author = {Jonas Kratzke and Nicolai Schoch and Christian Weis and Matthias Mueller-Eschner and Stefanie Speidel and Mina Farag and Carsten Beller and Vincent Heuveline},
  title = {Enhancing 4D PC-MRI in an aortic phantom considering numerical simulations},
  booktitle = {Proceedings of SPIE Medical Imaging 2015: Physics of Medical Imaging},
  year = {2015}
}
Schick, M., Song, C. and Heuveline, V.A Polynomial Chaos Method for Uncertainty Quantification in Blood Pump Simulation 2015International Conference on Uncertainty Quantification in Computational Sciences and Engineering (UNCECOMP), Greece, 2015 inproceedingsURL 
BibTeX:
@inproceedings{Schick2015,
  author = {Schick, Micheal and Song, Chen and Heuveline, Vincent},
  title = {A Polynomial Chaos Method for Uncertainty Quantification in Blood Pump Simulation},
  booktitle = {International Conference on Uncertainty Quantification in Computational Sciences and Engineering (UNCECOMP), Greece, 2015},
  publisher = {Scopus, Elsevier},
  year = {2015},
  url = {http://2015.uncecomp.org/proceedings/papers/uncecomp/U506.pdf}
}
Schoch, N., Kißler, F., Stoll, M., Engelhardt, S., de Simone, R., Wolf, I., Bendl, R. and Heuveline, V.Comprehensive Pre- & Post-Processing for Numerical Simulations in Cardiac Surgery Assistance 2015EMCL Preprint Series misc 
BibTeX:
@misc{Schoch2015,
  author = {Schoch, Nicolai and Kißler, Fabian and Stoll, Markus and Engelhardt, Sandy and de Simone, Raffaele and Wolf, Ivo and Bendl, Rolf and Heuveline, Vincent},
  title = {Comprehensive Pre- & Post-Processing for Numerical Simulations in Cardiac Surgery Assistance},
  year = {2015}
}
Schoch, N., Engelhardt, S., Zimmermann, N., Speidel, S., Simone, R.D., Wolf, I. and Heuveline, V.Integration of a biomechanical simulation for mitral valve reconstruction into a knowledge-based surgery assistance system 2015Proceedings of SPIE Medical Imaging 2015: Image-Guided Procedures, Robotic Interventions, and Modeling inproceedings 
BibTeX:
@inproceedings{SchochSPIE2015,
  author = {Nicolai Schoch and Sandy Engelhardt and Norbert Zimmermann and Stefanie Speidel and Raffaele De Simone and Ivo Wolf and Vincent Heuveline},
  title = {Integration of a biomechanical simulation for mitral valve reconstruction into a knowledge-based surgery assistance system},
  booktitle = {Proceedings of SPIE Medical Imaging 2015: Image-Guided Procedures, Robotic Interventions, and Modeling},
  year = {2015}
}
Vincze, M., Borchert, S., Achatz, U., von Larcher, T., Baumann, M., Liersch, C., Remmler, S., Beck, T., Alexandrov, K.D., Egbers, C., Fröhlich, J., Heuveline, V., Hickel, S. and Harlander, U.Benchmarking in a rotating annulus: a comparative experimental and numerical study of baroclinic wave dynamics 2015Meteorologische Zeitschrift
Vol. 23(6), pp. 611-635 
articleURL 
BibTeX:
@article{vincze2015,
  author = {Vincze, Miklos and Borchert, Sebastian and Achatz, Ulrich and von Larcher, Thomas and Baumann, Martin and Liersch, Claudia and Remmler, Sebastian and Beck, Teresa and Alexandrov, Kiril D. and Egbers, Christoph and Fröhlich, Jochen and Heuveline, Vincent and Hickel, Stefan and Harlander, Uwe},
  title = {Benchmarking in a rotating annulus: a comparative experimental and numerical study of baroclinic wave dynamics},
  journal = {Meteorologische Zeitschrift},
  publisher = {Schweizerbart Science Publishers},
  year = {2015},
  volume = {23},
  number = {6},
  pages = {611-635},
  url = {http://dx.doi.org/10.1127/metz/2014/0600}
}
Wlotzka, M. and Heuveline, V.Energy-aware mixed precision iterative refinement for linear systems on GPU-accelerated multi-node clusters 2015Proccedings of the 26th GI/ITG Workshop on Parallel Algorithms, Computer Architectures, and System Software inproceedings 
Abstract: Modern high-performance computing systems are often built as a cluster of interconnected compute nodes, where each node is built upon a hybrid hardware stack of multi-core processors and many-core accelerators. To efficiently use such systems, numerical methods must embrace the different levels of parallelism from the coarse-grained distributed memory cluster level to the fine-grained shared memory node level parallelism. Synchronization requirements of numerical methods may diminish parallel performance and result in increased energy consumption. We investigate block-asynchronous iteration methods in combination with mixed precision iterative refinement to address this issue. We depict our implementation for multi-node distributed systems using MPI with a hybrid node level parallelization for multi-core CPUs using OpenMP and multiple CUDA-capable accelerators. Our numerical experiments are based on a linear system arising from the finite element discretization of the Poisson equation. We present energy and runtime measurements for a quad-CPU and dual-GPU test system. We achieve temendous runtime and energy savings for block-asynchronous GPU-accelerated iteration using mixed precision compared to CPU-only computation. We also encounter configurations where the CPU-only computation is advantageous over the GPU-accelerated method.
BibTeX:
@inproceedings{Wlotzka2015,
  author = {M. Wlotzka and V. Heuveline},
  title = {Energy-aware mixed precision iterative refinement for linear systems on GPU-accelerated multi-node clusters},
  booktitle = {Proccedings of the 26th GI/ITG Workshop on Parallel Algorithms, Computer Architectures, and System Software},
  year = {2015}
}
Wlotzka, M. and Heuveline, V.Block-asynchronous and Jacobi smoothers for a multigrid solver on GPU-accelerated HPC clusters 2015Preprint Series of the Engineering Mathematics and Computing Lab(03) article 
BibTeX:
@article{Wlotzka2015a,
  author = {Martin Wlotzka and Vincent Heuveline},
  title = {Block-asynchronous and Jacobi smoothers for a multigrid solver on GPU-accelerated HPC clusters},
  journal = {Preprint Series of the Engineering Mathematics and Computing Lab},
  year = {2015},
  number = {03}
}
Wlotzka, M., Malossi, A., Heuveline, V. and Bekas, C.Power and Energy Characterization of Finite Element Matrices on IBM Power 2015ICT-Energy Letters
Vol. 10 
article 
Abstract: In this work we investigate the performance and energy characteristics of different finite element discretizations arising from the numerical solution of PDEs. We measure and compare performance of sparse matrix-vector multiplications and we analyze the behavior obtained for different mesh properties as the degree of the FE basis polynomial and the level of h-refinement. This analysis is carried out on the new IBM Power 8 platform.
BibTeX:
@article{Wlotzka2015b,
  author = {M. Wlotzka and A.C.I. Malossi and V. Heuveline and C. Bekas},
  title = {Power and Energy Characterization of Finite Element Matrices on IBM Power},
  journal = {ICT-Energy Letters},
  year = {2015},
  volume = {10}
}
Bauer, W., Baumann, M., Scheck, L., Gassmann, A., Heuveline, V. and Jones, S.Simulation of tropical-cyclone-like vortices in shallow-water ICON-hex using goal-oriented r-adaptivity 2014Theoretical and Computational Fluid Dynamics
Vol. 28(1), pp. 107-128 
articleDOIURL 
Abstract: We demonstrate how efficient r-adapted grids for the prediction of tropical cyclone (TC) tracks can be constructed with the help of goal-oriented error estimates. The binary interaction of TCs in a barotropic model is used as a test case. We perform a linear sensitivity analysis for this problem to evaluate the contribution of each grid cell to an error measure correlated with the cyclone positions. This information allows us to estimate the local grid resolution required to minimize the TC position error. An algorithm involving the solution of a Poisson problem is employed to compute how grid points should be moved such that the desired local resolution is achieved. A hexagonal shallow-water version of the next-generation numerical weather prediction and climate model ICON is used to perform model runs on these adapted grids. The results show that for adequately chosen grid adaptation parameters, the accuracy of the track prediction can be maintained even when a coarser grid is used in regions for which the estimated error contribution is low. Accurate track predictions are obtained only when a grid with high resolution consisting of cells with nearly constant size and regular shape covers the part of the domain where the estimated error contribution is large. The number of grid points required to achieve a certain accuracy in the track prediction can be decreased substantially with our approach.
BibTeX:
@article{Bauer2014,
  author = {Bauer, Werner and Baumann, Martin and Scheck, Leonhard and Gassmann, Almut and Heuveline, Vincent and Jones, SarahC.},
  title = {Simulation of tropical-cyclone-like vortices in shallow-water ICON-hex using goal-oriented r-adaptivity},
  journal = {Theoretical and Computational Fluid Dynamics},
  publisher = {Springer Berlin Heidelberg},
  year = {2014},
  volume = {28},
  number = {1},
  pages = {107-128},
  url = {http://dx.doi.org/10.1007/s00162-013-0303-4},
  doi = {http://dx.doi.org/10.1007/s00162-013-0303-4}
}
Breitbart, J., Schmidtobreick, M. and Heuveline, V.Evaluation of the Global Address Space Programming Interface (GASPI) 2014Proceedings of "28th IEEE International Parallel and Distributed Processing Symposium" article 
Abstract: The first exascale supercomputers are expected by the end of this decade and will presumably feature an increase in core count, but a decrease in the amount of memory available per core. As of now, it is still unclear if the current programming models will provide high performance on exascale systems. One programming model considered to be an alternative to MPI is the so-called partitioned global address space (PGAS) model. Within this paper we evaluate a relatively new PGAS API: the Global Address Space Programming Interface (GASPI) and compare it to MPI on the basis of microbenchmarks. These benchmarks show that GASPI provides about the same level of performance for single-threaded communication, but is up to an order of magnitude faster than both Intel and IBM MPI for multithreaded communication. Hereafter, we discuss the different features of GASPI in comparison to two main PGAS languages, namely UPC and CAF. In addition, we present a basic numerical algorithm, a dense matrix-matrix multiplication, as an example on how an implementation can make efficient use of GASPI’s features, especially the asynchronous and one-sided communication mechanisms.
BibTeX:
@article{Breitbart2014,
  author = {Breitbart, Jens and Schmidtobreick, Mareike and Heuveline, Vincent},
  title = {Evaluation of the Global Address Space Programming Interface (GASPI)},
  journal = {Proceedings of "28th IEEE International Parallel and Distributed Processing Symposium"},
  publisher = {IEEE Computer Society},
  year = {2014}
}
Paulus, C., Suwelack, S., Schoch, N., Speidel, S., Dillmann, R. and Heuveline, V.Simulation of Complex Cuts in Soft Tissue with the Extended Finite Element Method (X-FEM) 2014Preprint Series of the Engineering Mathematics and Computing Lab
Vol. 0(02) 
miscURL 
BibTeX:
@misc{emclpp17635,
  author = {Christoph Paulus and Stefan Suwelack and Nicolai Schoch and Stefanie Speidel and Rüdiger Dillmann and Vincent Heuveline},
  title = {Simulation of Complex Cuts in Soft Tissue with the Extended Finite Element Method (X-FEM)},
  journal = {Preprint Series of the Engineering Mathematics and Computing Lab},
  year = {2014},
  volume = {0},
  number = {02},
  url = {http://journals.ub.uni-heidelberg.de/index.php/emcl-pp/article/view/17635}
}
Hahn, T., Sommer, A., Osberghaus, A., Heuveline, V. and Hubbuch, J.Adjoint-based Estimation and Optimization for Column Liquid Chromatography Models 2014Computers & Chemical Engineering
Vol. 4 
article 
BibTeX:
@article{hahnCCEcolumnliquidchrom,
  author = {Tobias Hahn and Anja Sommer and Anna Osberghaus and Vincent Heuveline and Jürgen Hubbuch},
  title = {Adjoint-based Estimation and Optimization for Column Liquid Chromatography Models},
  journal = {Computers & Chemical Engineering},
  year = {2014},
  volume = {4}
}
Heuveline, V. and Schick, M.A hybrid generalized Polynomial Chaos method for stochastic dynamical systems 2014International Journal for Uncertainty Quantification article 
BibTeX:
@article{Schick2014,
  author = {Heuveline, V. and Schick, M.},
  title = {A hybrid generalized Polynomial Chaos method for stochastic dynamical systems},
  journal = {International Journal for Uncertainty Quantification},
  year = {2014}
}
Schick, M., Heuveline, V. and Le Maître, O.P.A Newton-Galerkin method for fluid flow exhibiting uncertain periodic dynamics 2014SIAM/ASA Journal on Uncertainty Quantification article 
BibTeX:
@article{Schick2014a,
  author = {Schick, M. and Heuveline, V. and Le Maître, O. P.},
  title = {A Newton-Galerkin method for fluid flow exhibiting uncertain periodic dynamics},
  journal = {SIAM/ASA Journal on Uncertainty Quantification},
  year = {2014}
}
Schick, M.A Parallel Multilevel Spectral Galerkin Solver for Linear Systems with Uncertain Parameters 20142014 22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing, pp. 352-359 inproceedingsDOI  
BibTeX:
@inproceedings{Schick2014b,
  author = {M. Schick},
  title = {A Parallel Multilevel Spectral Galerkin Solver for Linear Systems with Uncertain Parameters},
  booktitle = {2014 22nd Euromicro International Conference on Parallel, Distributed, and Network-Based Processing},
  year = {2014},
  pages = {352-359},
  doi = {http://dx.doi.org/10.1109/PDP.2014.82}
}
Song, C., Stavaker, K., Wlotzka, M., Fritzson, P. and Heuveline, V.PDE Modeling with Modelica via FMI Import of HiFlow3 C++ Components with Parallel Multi-Core Simulations 2014Proceedings of the 55th SIMS Conference on Modeling and Simulation inproceedings 
BibTeX:
@inproceedings{Song2014,
  author = {Song, Chen and Stavaker, Kristian and Wlotzka, Martin and Fritzson, Peter and Heuveline, Vincent},
  title = {PDE Modeling with Modelica via FMI Import of HiFlow3 C++ Components with Parallel Multi-Core Simulations},
  booktitle = {Proceedings of the 55th SIMS Conference on Modeling and Simulation},
  year = {2014}
}
Suwelack, S., Stoll, M., Schalck, S., Schoch, N., Dillmann, R., Bendl, R., Heuveline, V. and Speidel, S.The Medical Simulation Markup Language (MSML) – Simplifying the Biomechanical Modeling Workflow 2014MMVR 2014 (Medicine Meets Virtual Reality) article 
Abstract: Modeling and simulation of the human body by means of continuum mechanics has become an important tool in diagnostics, computer-assisted interventions and training. This modeling approach seeks to construct patient-specific biomechanical models from tomographic data. Usually many different tools such as segmentation and meshing algorithms are involved in this workflow. In this paper we present a generalized and flexible description for biomechanical models. The unique feature of the new modeling language is that it not only describes the final biomechanical simulation, but also the workflow how the biomechanical model is constructed from tomographic data. In this way, the MSML can act as a middleware between all tools used in the modeling pipeline. The MSML thus greatly facilitates the prototyping of medical simulation workflows for clinical and research purposes. In this paper, we not only detail the XML-based modeling scheme, but also present a concrete implementation. Different examples highlight the flexibility, robustness and ease-of-use of the approach.
BibTeX:
@article{Suwelack2014,
  author = {Stefan Suwelack and Markus Stoll and Sebastian Schalck and Nicolai Schoch and Ruediger Dillmann and Rolf Bendl and Vincent Heuveline and Stefanie Speidel},
  title = {The Medical Simulation Markup Language (MSML) – Simplifying the Biomechanical Modeling Workflow},
  journal = {MMVR 2014 (Medicine Meets Virtual Reality)},
  year = {2014}
}
Wlotzka, M. and Heuveline, V.A parallel solution scheme for multiphysics evolution problems using OpenPALM 2014EMCL Preprint Series misc 
BibTeX:
@misc{Wlotzka2014,
  author = {Wlotzka, Martin and Heuveline, Vincent},
  title = {A parallel solution scheme for multiphysics evolution problems using OpenPALM},
  year = {2014}
}
Baumann, M., Förstner, J., Heuveline, V., Kratzke, J., Ritterbusch, S., Vogel, B. and Vogel, H.Model-based Visualization of Instationary Geo-Data with Application to Volcano Ash Data 2013EMCL Preprint Series misc 
BibTeX:
@misc{Baumann2013,
  author = {Baumann, Martin and Förstner, Jochen and Heuveline, Vincent and Kratzke, Jonas and Ritterbusch, Sebastian and Vogel, Bernhard and Vogel, Heike},
  title = {Model-based Visualization of Instationary Geo-Data with Application to Volcano Ash Data},
  year = {2013}
}
Schindewolf, M., Rocker, B., Karl, W. and Heuveline, V.Evaluation of two Formulations of the Conjugate Gradients Method with Transactional Memory 2013EMCL Preprint Series(01)EMCL Preprint Series miscURL 
Abstract: Transactional Memory (TM) offers new possibilities for algorithmic design. This paper evaluates TM implementations of two algorithmic variations of the wide-spread conjugate gradients method (CG) regarding their performance on multi-core CPUs employing TM. Through applying tools for TM that visualize the TM application behavior, we show that the main bottleneck for both is the waiting times at barriers and illustrate the implementation of reduction operations with TM in a beneficial way. Performance monitoring through using the PAPI interface uncovers the quantity and type of instructions that each algorithms requires. This basic work is the key for environment-aware numerics as well as a hint for software developers who plan to use TM.
BibTeX:
@misc{emcl-preprint-2013-01,
  author = {Schindewolf, Martin and Rocker, Björn and Karl, Wolfgang and Heuveline, Vincent},
  title = {Evaluation of two Formulations of the Conjugate Gradients Method with Transactional Memory},
  journal = {EMCL Preprint Series},
  year = {2013},
  number = {01},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2013-01.pdf}
}
Hoecker, M. and Kunze, M.An on-demand scaling stereoscopic 3D video streaming service in the cloud 2013Journal of Cloud Computing: Advances, Systems and Applications
Vol. 2:14 
articleDOIURL 
Abstract: We describe a web service providing a complete stereoscopic 3D video multi-stream cloud application to serve a potentially very large number of clients over the Internet. The system architecture consists of a stream provider that leverages highly scalable and reliable cloud computing and storage services, with automatic load balancing capability for live and content streaming. By use of a suiting flash media plugin the content is displayed on a wide variety of 3D capable devices like for example 3D workstations or smart TV sets. Videos are made available by an on-line stream provider for live broadcasting or by cloud storage services. Compared to conventional 3D video streaming over satellite channels there are considerable savings in cost as well as a wider range of applicability and functional improvements. Possible areas of application are medical surgery, live concerts, and sports events.
BibTeX:
@article{Hoecker2013,
  author = {Hoecker, M. and Kunze, M.},
  title = {An on-demand scaling stereoscopic 3D video streaming service in the cloud},
  journal = {Journal of Cloud Computing: Advances, Systems and Applications},
  year = {2013},
  volume = {2:14},
  url = {http://www.journalofcloudcomputing.com/content/2/1/14},
  doi = {http://dx.doi.org/10.1186/2192-113X-2-14}
}
Nowak, F., Besenfelder, I., Karl, W., Schmidtobreick, M. and Heuveline, V.A Data-Driven Approach for Executing the CG Method on Reconfigurable High-Performance Systems 2013Proceedings of the 26th International Conference on Architecture of Computing Systems, pp. 171-182 inproceedingsDOIURL 
Abstract: Employing reconfigurable computing systems for numerical applications poses an interesting and promising approach toward increased performance. We study the applicability of the Convey HC-1 for numerical applications by decomposing a preconditioned conjugate gradient (CG) method into several independent kernels that can operate concurrently. To allow overlapped execution and to minimize data transfers, we stream the data between the kernel units using a central buffer set. A microprogrammable control unit orchestrates memory accesses, buffer writes/reads and kernel execution, and allows for further algorithms to be executedon the available kernel units. Solving the Poisson problem can thereby be accelerated up to 10 times compared to a single-threaded software version on the HC-1 and up to 1.2 times compared to a 2-socket hex-core Intel Xeon Westmere system with 24 hardware threads for large problem sizes with only a single application engine.
BibTeX:
@inproceedings{Nowak2013,
  author = {Nowak, Fabian and Besenfelder, Ingo and Karl, Wolfgang and Schmidtobreick, Mareike and Heuveline, Vincent},
  title = {A Data-Driven Approach for Executing the CG Method on Reconfigurable High-Performance Systems},
  booktitle = {Proceedings of the 26th International Conference on Architecture of Computing Systems},
  publisher = {Springer-Verlag},
  year = {2013},
  pages = {171--182},
  url = {http://dx.doi.org/10.1007/978-3-642-36424-2_15},
  doi = {http://dx.doi.org/10.1007/978-3-642-36424-2_15}
}
Ritterbusch, S., Ronna, S., Waltschlaeger, I., Gerstner, P. and Heuveline, V.Augmented Reality Visualization of Numerical Simulations in Urban Environments 2013International Journal on Advances in Systems and Measurements(06) article 
BibTeX:
@article{Ronnas2013,
  author = {Sebastian Ritterbusch and Staffan Ronna and Irina Waltschlaeger and Philipp Gerstner and Vincent Heuveline},
  title = {Augmented Reality Visualization of Numerical Simulations in Urban Environments},
  journal = {International Journal on Advances in Systems and Measurements},
  year = {2013},
  number = {06}
}
Schoch, N., Suwelack, S., Dillmann, R. and Heuveline, V.Simulation of Surgical Cutting in Soft Tissue using the Extended Finite Element Method (X-FEM) 2013EMCL Preprint Series misc 
BibTeX:
@misc{Schoch2013,
  author = {Schoch, Nicolai and Suwelack, Stefan and Dillmann, Rüdiger and Heuveline, Vincent},
  title = {Simulation of Surgical Cutting in Soft Tissue using the Extended Finite Element Method (X-FEM)},
  year = {2013}
}
Stavaker, K., Ronnas, S., Wlotzka, M., Heuveline, V. and Fritzson, P.PDE Modeling with Modelica via FMI Import of HiFlow3 C++ Components 2013Proceedings of the 54th SIMS Conference on Modeling and Simulation inproceedings 
BibTeX:
@inproceedings{Stavaker2013,
  author = {Stavaker, Kristian and Ronnas, Staffan and Wlotzka, Martin and Heuveline, Vincent and Fritzson, Peter},
  title = {PDE Modeling with Modelica via FMI Import of HiFlow3 C++ Components},
  booktitle = {Proceedings of the 54th SIMS Conference on Modeling and Simulation},
  year = {2013}
}
Wlotzka, M., Heuveline, V., Haas, E., Klatt, S., Kraus, D., Butterbach-Bahl, K., Kraft, P. and Breuer, L.Dynamic Simulation of Land Management Effects on Soil N2O Emissions using a coupled Hydrology-Ecosystem Model on the Landscape Scale 2013EMCL Preprint Series misc 
BibTeX:
@misc{Wlotzka2013,
  author = {Wlotzka, Martin and Heuveline, Vincent and Haas, Edwin and Klatt, Steffen and Kraus, David and Butterbach-Bahl, Klaus and Kraft, Philipp and Breuer, Lutz},
  title = {Dynamic Simulation of Land Management Effects on Soil N2O Emissions using a coupled Hydrology-Ecosystem Model on the Landscape Scale},
  year = {2013}
}
Barz, D., Bockelmann, H. and Heuveline, V.Optimization of an electrokinetic mixer for microfluidic applications 2012Biomicrofluidics
Vol. 6, pp. 024123 
article 
BibTeX:
@article{barzbiomicrofluidics,
  author = {Barz, Dominik and Bockelmann, Hendryk and Heuveline, Vincent},
  title = {Optimization of an electrokinetic mixer for microfluidic applications},
  journal = {Biomicrofluidics},
  year = {2012},
  volume = {6},
  pages = {024123}
}
Baumann, M., Heuveline, V., Scheck, L. and Jones, S.C.Construction of Economical Meshes for Tropical Cyclone Forecasting 201230th Conference on Hurricanes and Tropical Meteorology inproceedings 
BibTeX:
@inproceedings{Baumann_AMS_TC_Conference2012,
  author = {Baumann, M. and Heuveline, V. and Scheck, L. and Jones, S. C.},
  title = {Construction of Economical Meshes for Tropical Cyclone Forecasting},
  booktitle = {30th Conference on Hurricanes and Tropical Meteorology},
  year = {2012}
}
Rosen, R., Ritterbusch, S., Moeller, K. and Heuveline, V.Risk Analysis For Catheter Guided Aortic Valve Implantation 2012Biomedical Engineering / Biomedizinische Techink articleURL 
BibTeX:
@article{BioMed2012,
  author = {Rosen, Reem and Ritterbusch, Sebastian and Moeller, Knut and Heuveline, Vincent},
  title = {Risk Analysis For Catheter Guided Aortic Valve Implantation},
  journal = {Biomedical Engineering / Biomedizinische Techink},
  year = {2012},
  url = {http://www.degruyter.com/view/j/bmte.2012.57.issue-s1-I/bmt-2012-4494/bmt-2012-4494.xml}
}
Buchty, R., Heuveline, V., Karl, W. and Weiss, J.-P.A survey on hardware-aware and heterogeneous computing on multicore processors and accelerators 2012Concurrency and Computation: Practice and Experience
Vol. 24(7), pp. 663-675 
articleDOIURL 
BibTeX:
@article{Buchty2012,
  author = {Buchty, Rainer and Heuveline, Vincent and Karl, Wolfgang and Weiss, Jan-Philipp},
  title = {A survey on hardware-aware and heterogeneous computing on multicore processors and accelerators},
  journal = {Concurrency and Computation: Practice and Experience},
  publisher = {John Wiley & Sons, Ltd},
  year = {2012},
  volume = {24},
  number = {7},
  pages = {663--675},
  url = {http://dx.doi.org/10.1002/cpe.1904},
  doi = {http://dx.doi.org/10.1002/cpe.1904}
}
Anzt, H., Tomov, S., Gates, M., Dongarra, J. and Heuveline, V.Block-asynchronous Multigrid Smoothers for GPU-accelerated Systems 2012Procedia CS
Vol. 9, pp. 7-16 
article 
BibTeX:
@article{DBLP:journals-procedia-AnztTGDH12,
  author = {Hartwig Anzt and Stanimire Tomov and Mark Gates and Jack Dongarra and Vincent Heuveline},
  title = {Block-asynchronous Multigrid Smoothers for GPU-accelerated Systems},
  journal = {Procedia CS},
  year = {2012},
  volume = {9},
  pages = {7-16}
}
Janko, S., Rocker, B., Schindewolf, M., Heuveline, V. and Karl, W.Software Transactional Memory, OpenMP and Pthread implementations of the Conjugate Gradients Method - a Preliminary Evaluation 2012EMCL Preprint Series(01)EMCL Preprint Series miscURL 
Abstract: This paper shows the runtime and cache-efficiency of parallel implementations of the Conjugate Gradients Method based on the three paradigms Software Transactional Memory (STM), OpenMP and Pthreads. While the two last named concepts are used to manage parallelization as well as synchronization, STM was designed to handle only the latter. In our work we disclose that an improved cache efficiency does not necessarily lead to a better execution time because the execution time is dominated by the thread wait time at the barriers.
BibTeX:
@misc{emcl-preprint-2012-01,
  author = {Janko, Sven and Rocker, Bjoern and Schindewolf, Martin and Heuveline, Vincent and Karl, Wolfgang},
  title = {Software Transactional Memory, OpenMP and Pthread implementations of the Conjugate Gradients Method - a Preliminary Evaluation},
  journal = {EMCL Preprint Series},
  year = {2012},
  number = {01},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2012-01.pdf}
}
Barz, D.P.J., Bockelmann, H. and Heuveline, V.Electrokinetic optimization of a micromixer for lab-on-chip applications 2012EMCL Preprint Series(02)EMCL Preprint Series miscURL 
Abstract: This paper is concerned with the optimization of an electrokinetic micromixer suitable for Lab-on-Chip and other microfluidic applications. The mixing concept is based on the combination of an alternating electrical excitation applied to a pressure-driven base flow in a meandering microchannel geometry. The electrical excitation induces a secondary electrokinetic velocity component which results in a complex flow field within the meander bends. A mathematical model describing the physicochemical phenomena present within the micromixer is implemented in an in-house Finite-Element-Method code. We first perform simulations comparable to experiments concerned with the investigation of the flow field in the bends. The comparison of simulation and experiment reveals excellent agreement. Hence, the validated model and numerical schemes are employed for a numerical optimization of the micromixer performance. In detail, we optimize the secondary electrokinetic flow by finding the best electrical excitation parameters, i.e. frequency and amplitude, for a given waveform. The simulation results of two optimized electrical excitations featuring a discrete and a continuous waveform are compared and discussed. The results demonstrate that the micromixer is able to achieve high mixing degrees very rapidly.
BibTeX:
@misc{emcl-preprint-2012-02,
  author = {Barz, Dominik~P.~J. and Bockelmann, Hendryk and Heuveline, Vincent},
  title = {Electrokinetic optimization of a micromixer for lab-on-chip applications},
  journal = {EMCL Preprint Series},
  year = {2012},
  number = {02},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2012-02.pdf}
}
Heuveline, V. and Chandramowli, S.The Coffee-table Book of Pseudospectra 2012EMCL Preprint Series(03)EMCL Preprint Series miscURL 
BibTeX:
@misc{emcl-preprint-2012-03,
  author = {Heuveline, Vincent and Chandramowli, Subramanian},
  title = {The Coffee-table Book of Pseudospectra},
  journal = {EMCL Preprint Series},
  year = {2012},
  number = {03},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2012-03.pdf}
}
Anzt, H., Beglarian, A., Chilingaryan, S., Ferrone, A., Heuveline, V. and Kopmann, A.A unified Energy Footprint for Simulation Software 2012EMCL Preprint Series(04)EMCL Preprint Series miscURL 
Abstract: The focus in High-Performance Computing increasingly turns to energy efficiency. Therefore the pure concentration on floating point operations and runtime performance is no longer sufficient. In terms of hardware, this change of paradigm has already taken place: The GREEN500 list as counterpart to the runtime performance oriented TOP500 list has been established. The new metrics take runtime and energy consumption into account. Nevertheless, all these developments consider hardware only - still an inadequate situation to face the challenges of Energy-Efficient Exascale Computing. The necessity of optimizing simulation software with respect to power and energy draft demands for detailed profiling of the power consumption during the calculations and a norm quantifying the respective efficiency. In this paper we propose a unified energy footprint for simulation software that enables a fast comparison between different models, implementations and hardware configurations, respectively. By way of example we provide the footprints for the tomographic reconstruction code PyHST optimized for CPU and GPU operation as well as the operational numerical weather prediction model COSMO. We then discuss the power and energy profiles and investigate the effects of scaling with respect to hardware resources and simulation parameters.
BibTeX:
@misc{emcl-preprint-2012-04,
  author = {Anzt, Hartwig and Beglarian, Armen and Chilingaryan, Suren and Ferrone, Andrew and Heuveline, Vincent and Kopmann, Andreas},
  title = {A unified Energy Footprint for Simulation Software},
  journal = {EMCL Preprint Series},
  year = {2012},
  number = {04},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2012-04.pdf}
}
Heuveline, V., Ketelaer, E., Ronnas, S., Schmidtobreick, M. and Wlotzka, M.Scalability Study of HiFlow3 based on a Fluid Flow Channel Benchmark 2012EMCL Preprint Series(05)EMCL Preprint Series miscURL 
Abstract: Exploiting the compute power of high performance computing clusters efficiently is a key ingredient in order to solve large, fully coupled systems modeled by partial differential equations with high accuracy. We study strong and weak scalability properties of the parallel Finite Element software package HiFlow3 for a challenging instationary 3D fluid flow problem. For this benchmark study, we ran several simulations with up to 10 millions of unknowns using up to 512 cores on the bwGRiD cluster in Karlsruhe. For large problem sizes, the software package showed good characteristics regarding efficiency and speedup.
BibTeX:
@misc{emcl-preprint-2012-05,
  author = {Heuveline, Vincent and Ketelaer, Eva and Ronnas, Staffan and Schmidtobreick, Mareike and Wlotzka, Martin},
  title = {Scalability Study of HiFlow3 based on a Fluid Flow Channel Benchmark},
  journal = {EMCL Preprint Series},
  year = {2012},
  number = {05},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2012-05.pdf}
}
Thomas Henn Vincent Heuveline, M.J.K.S.R.Aortic Coarctation simulation based on the Lattice Boltzmann method: benchmark results 2012EMCL Preprint Series(06)EMCL Preprint Series miscURL 
Abstract: We investigate a patient specific blood flow simulation through a transverse aortic arch with a moderate thoracic aortic coarctation, where particular attention is paid to the blood pressure gradient through the coarctation. The challenge in this context is the complex geometry containing a stenosis, which results in complex flow patterns. The fluid is assumed to be incompressible and Newtonian. Its dynamic is usually described by an Navier-Stokes equation with appropriate boundary conditions. Instead, we modeled the problem mesoscopically by a family of BGK-Boltzmann equations those solutions reaches that of a corresponding Navier-Stokes system in a certain limit. For discretization we take advantage of lattice Boltzmann methods, which are realized within the open-source library OpenLB. A realistic transient flow profile of the cardiac output for a human at rest was used to specify the inflow boundary condition at the aortic root, whereas the outflow at the descending aorta was modeled by a pressure boundary condition. A short introduction to lattice Boltzmann methods is provided and especially the used boundary conditions are introduced in detail. The exact simulation setup is stated and the obtained results are discussed.
BibTeX:
@misc{emcl-preprint-2012-06,
  author = {Thomas Henn, Vincent Heuveline, Mathias J. Krause, Sebastian Ritterbusch},
  title = {Aortic Coarctation simulation based on the Lattice Boltzmann method: benchmark results},
  journal = {EMCL Preprint Series},
  year = {2012},
  number = {06},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2012-06.pdf}
}
Andreas Helfrich-Schkarbanenko Vincent Heuveline, R.R.S.R.Bandwidth-Efficient Parallel Visualization for Mobile Devices 2012EMCL Preprint Series(07)EMCL Preprint Series miscURL 
Abstract: For visual analysis of large numerical simulations on mobile devices, we introduce a remote parallelizable visualization method for low-bandwidth and high-latency networks. Based on a mathematical model for multi-layered planar impostor representation of arbitrary complex and unbounded scenes, we derive optimal impostor placement from a derived metric. Using stochastic usage models, we prove the optimal bandwidth consumption order for choosing corresponding viewport impostor sets, leading to bandwidth-efficient remote visualization concepts for high performance computing simulation results.
BibTeX:
@misc{emcl-preprint-2012-07,
  author = {Andreas Helfrich-Schkarbanenko, Vincent Heuveline, Roman Reiner, Sebastian Ritterbusch},
  title = {Bandwidth-Efficient Parallel Visualization for Mobile Devices},
  journal = {EMCL Preprint Series},
  year = {2012},
  number = {07},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2012-07.pdf}
}
Anzt, H., Beglarian, A., Chilingaryan, S., Ferrrone, A., Heuveline, V. and Kopmann, A.A unified Energy Footprint for Simulation Software 2012Computer Science - Research and Development articleDOI  
BibTeX:
@article{ENAHPC2012,
  author = {Anzt, Hartwig and Beglarian, Armen and Chilingaryan, Suren and Ferrrone, Andrew and Heuveline, Vincent and Kopmann, Andreas},
  title = {A unified Energy Footprint for Simulation Software},
  journal = {Computer Science - Research and Development},
  publisher = {Springer Berlin / Heidelberg},
  year = {2012},
  doi = {http://dx.doi.org/10.1007/s00450-012-0225-1}
}
Anzt, H., Luszcek, P., Dongarra, J. and Heuveline, V.GPU-Accelerated Asynchronous Error Correction for Mixed Precision Iterative Refinement 2012Proceedings of Euro-Par 2012
Vol. LNCS 7484, pp. 908-920 
article 
BibTeX:
@article{europar,
  author = {Hartwig Anzt and Piotr Luszcek and Jack Dongarra and Vincent Heuveline},
  title = {GPU-Accelerated Asynchronous Error Correction for Mixed Precision Iterative Refinement},
  journal = {Proceedings of Euro-Par 2012},
  year = {2012},
  volume = {LNCS 7484},
  pages = {908--920}
}
Fietz, J., Krause, M.J., Schulz, C., Sanders, P. and Heuveline, V.Optimized Hybrid Parallel Lattice Boltzmann Fluid Flow Simulations on Complex Geometries 2012Euro-Par 2012 Parallel Processing incollection 
BibTeX:
@incollection{fietz:12,
  author = {Fietz, J. and Krause, M. J. and Schulz, C. and Sanders, P. and Heuveline, V.},
  title = {Optimized Hybrid Parallel Lattice Boltzmann Fluid Flow Simulations on Complex Geometries},
  booktitle = {Euro-Par 2012 Parallel Processing},
  publisher = {Springer Berlin / Heidelberg},
  year = {2012},
  note = {submitted}
}
Hahn, T., Heuveline, V. and Hubbuch, J.ChromX – A Powerful and User?Friendly Software Package for Modeling Liquid Chromatography Processes 2012Chemie Ingenieur Technik
Vol. 84(8) 
article 
BibTeX:
@article{hahnELSinverse,
  author = {Tobias Hahn and Vincent Heuveline and Jürgen Hubbuch},
  title = {ChromX – A Powerful and User?Friendly Software Package for Modeling Liquid Chromatography Processes},
  journal = {Chemie Ingenieur Technik},
  year = {2012},
  volume = {84(8)}
}
Ritterbusch, S., Constantinescu, A. and Koch, V.Hapto-acoustic Scene Representation 2012
Vol. 7383Computers Helping People with Special Needs, pp. 644-650 
incollectionDOIURL 
BibTeX:
@incollection{haptoritterbusch,
  author = {Ritterbusch, Sebastian and Constantinescu, Angela and Koch, Volker},
  title = {Hapto-acoustic Scene Representation},
  booktitle = {Computers Helping People with Special Needs},
  publisher = {Springer Berlin Heidelberg},
  year = {2012},
  volume = {7383},
  pages = {644-650},
  url = {http://dx.doi.org/10.1007/978-3-642-31534-3_94},
  doi = {http://dx.doi.org/10.1007/978-3-642-31534-3_94}
}
Anzt, H., Tomov, S., Dongarra, J. and Heuveline, V.A Block-Asynchronous Relaxation Method for Graphics Processing Units 2012IPDPS Workshops inproceedings 
BibTeX:
@inproceedings{hcw,
  author = {Hartwig Anzt and Stanimire Tomov and Jack Dongarra and Vincent Heuveline},
  title = {A Block-Asynchronous Relaxation Method for Graphics Processing Units},
  booktitle = {IPDPS Workshops},
  year = {2012}
}
Anzt, H., Tomov, S., Dongarra, J. and Heuveline, V.Weighted Block-Asynchronous Iteration on GPU-Accelerated Systems 2012Euro-Par Parallel Processing Workshops - HeteroPar inproceedings 
BibTeX:
@inproceedings{heteropar,
  author = {Anzt, Hartwig and Tomov, Stanimire and Dongarra, Jack and Heuveline, Vincent},
  title = {Weighted Block-Asynchronous Iteration on GPU-Accelerated Systems},
  booktitle = {Euro-Par Parallel Processing Workshops - HeteroPar},
  year = {2012}
}
Anzt, H., Augustin, W., Baumann, M., Gengenbach, T., Hahn, T., Helfrich-Schkarbanenko, A., Heuveline, V., Ketelaer, E., Lukarski, D., Nestler, A., Ritterbusch, S., Ronnas, S., Schick, M., Schmidtobreick, M., Subramanian, C., Weiss, J.-P., Wilhelm, F. and Wlotzka, M.HiFlow3: A Hardware-Aware Parallel Finite Element Package 2012Tools for High Performance Computing 2011, pp. 139-151 incollectionDOIURL 
BibTeX:
@incollection{hiflowtools,
  author = {Anzt, H. and Augustin, W. and Baumann, M. and Gengenbach, T. and Hahn, T. and Helfrich-Schkarbanenko, A. and Heuveline, V. and Ketelaer, E. and Lukarski, D. and Nestler, A. and Ritterbusch, S. and Ronnas, S. and Schick, M. and Schmidtobreick, M. and Subramanian, C. and Weiss, J.-P. and Wilhelm, F. and Wlotzka, M.},
  title = {HiFlow3: A Hardware-Aware Parallel Finite Element Package},
  booktitle = {Tools for High Performance Computing 2011},
  publisher = {Springer Berlin Heidelberg},
  year = {2012},
  pages = {139-151},
  url = {http://dx.doi.org/10.1007/978-3-642-31476-6_12},
  doi = {http://dx.doi.org/10.1007/978-3-642-31476-6_12}
}
Helfrich-Schkarbanenko, A., Heuveline, V., Reiner, R. and Ritterbusch, S.Bandwidth-Efficient Parallel Visualization for Mobile Devices 2012The Second International Conference on Advanced Communications and Computation, pp. 106-112 inproceedings 
BibTeX:
@inproceedings{INFOCOMP2012,
  author = {Helfrich-Schkarbanenko, Andreas and Heuveline, Vincent and Reiner, Roman and Ritterbusch, Sebastian},
  title = {Bandwidth-Efficient Parallel Visualization for Mobile Devices},
  booktitle = {The Second International Conference on Advanced Communications and Computation},
  publisher = {IARIA},
  year = {2012},
  pages = {106-112}
}
Krause, M., Thäter, G. and Heuveline, V.Adjoint-based Fluid Flow Control and Optimisation with Lattice Boltzmann Methods 2012Computers & Mathematics with Applications article 
BibTeX:
@article{krause:12a,
  author = {Krause, M.J. and Gudrun Thäter and Heuveline, V.},
  title = {Adjoint-based Fluid Flow Control and Optimisation with Lattice Boltzmann Methods},
  journal = {Computers & Mathematics with Applications},
  year = {2012},
  note = {submitted}
}
Krause, M. and Heuveline, V.Parallel Fluid Flow Control and Optimisation with Lattice Boltzmann Methods and Automatic Differentiation 2012Computers and Fluids article 
BibTeX:
@article{krause:12b,
  author = {Krause, M.J. and Heuveline, V.},
  title = {Parallel Fluid Flow Control and Optimisation with Lattice Boltzmann Methods and Automatic Differentiation},
  journal = {Computers and Fluids},
  year = {2012},
  note = {submitted}
}
Lukarski, D.Parallel Sparse Linear Algebra for Multi-core and Many-core Platforms -- Parallel Solvers and Preconditioners 2012School: Karlsruhe Institute of Technology phdthesisURL 
BibTeX:
@phdthesis{Lukarski2012,
  author = {Lukarski, Dimitar},
  title = {Parallel Sparse Linear Algebra for Multi-core and Many-core Platforms -- Parallel Solvers and Preconditioners},
  school = {Karlsruhe Institute of Technology},
  year = {2012},
  url = {http://digbib.ubka.uni-karlsruhe.de/volltexte/1000026568}
}
Ronnas, S., Gengenbach, T., Ketelaer, E. and Heuveline, V.Design and Implementation of Distributed Meshes in HiFlow3 2012Competence in High Performance Computing 2010, pp. 61-71 incollectionURL 
BibTeX:
@incollection{Ronnas2011,
  author = {Ronnas, Staffan and Gengenbach, Thomas and Ketelaer, Eva and Heuveline, Vincent},
  title = {Design and Implementation of Distributed Meshes in HiFlow3},
  booktitle = {Competence in High Performance Computing 2010},
  publisher = {Springer Berlin Heidelberg},
  year = {2012},
  pages = {61-71},
  note = {10.1007/978-3-642-24025-6_6},
  url = {http://dx.doi.org/10.1007/978-3-642-24025-6_6}
}
Wilhelm, F.Parallel Preconditioners for an Ocean Model in Climate Simulations 2012School: Engineering Mathematics and Computing Lab (EMCL) -- Karlsruhe Institute of Technology (KIT) phdthesisURL 
BibTeX:
@phdthesis{SWB-362199361,
  author = {Wilhelm, Florian},
  title = {Parallel Preconditioners for an Ocean Model in Climate Simulations},
  school = {Engineering Mathematics and Computing Lab (EMCL) -- Karlsruhe Institute of Technology (KIT)},
  year = {2012},
  url = {http://digbib.ubka.uni-karlsruhe.de/volltexte/1000027114}
}
Heuveline, V., Lukarski, D., Trost, N. and Weiss, J.-P.Parallel Smoothers for Matrix-based Multigrid Methods on Locally Refined Meshes Using Multicore CPUs and GPUs 2012
Vol. 7174Facing the Multicore-Challenge II, pp. 158-171 
incollection 
BibTeX:
@incollection{Trost2011,
  author = {Heuveline, Vincent and Lukarski, Dimitar and Trost, Nico and Weiss, Jan-Philipp},
  title = {Parallel Smoothers for Matrix-based Multigrid Methods on Locally Refined Meshes Using Multicore CPUs and GPUs},
  booktitle = {Facing the Multicore-Challenge II},
  publisher = {Springer},
  year = {2012},
  volume = {7174},
  pages = {158--171}
}
Anzt, H., Aliaga, J., Castillo, M., Fernández, J.C., Heuveline, V., Mayo, R. and Quintana-Ort, E.S.Analysis and Optimization of Power Consumption in the Iterative Solution of Sparse Linear Systems on Multi-core and Many-core Platforms 2011Proceedings of The First International Workshop on Power Measurement and Profiling, PMP 2011Proceedings of The First International Workshop on Power Measurement and Profiling, PMP 2011 misc 
Abstract: Energy efficiency is a major concern in modern high-performance-computing. Still, few studies provide a deep insight into the power consumption of scientific applications. Especially for algorithms running on hybrid platforms equipped with hardware accelerators, like graphics processors, a detailed energy analysis is essential to identify the most costly parts, and to evaluate possible improvement strategies. In this paper we analyze the computational and power performance of iterative linear solvers applied to sparse systems arising in several scientific applications. We also study the gains yield by dynamic voltage/frequency scaling (DVFS), and illustrate that this technique alone cannot to reduce the energy cost to a considerable amount for iterative linear solvers. We then apply techniques that set the (multi-core processor in the) host system to a low-consuming state for the time that the GPU is executing. Our experiments conclusively reveal how the combination of these two techniques deliver a notable reduction of energy consumption without a noticeable impact on computational performance.
BibTeX:
@misc{AnQuHe_PMP11,
  author = {Anzt, Hartwig and Aliaga, José and Castillo, Maribel and Fernández, Juan C. and Heuveline, Vincent and Mayo, Rafael and Quintana-Ort, Enrique S.},
  title = {Analysis and Optimization of Power Consumption in the Iterative Solution of Sparse Linear Systems on Multi-core and Many-core Platforms},
  journal = {Proceedings of The First International Workshop on Power Measurement and Profiling, PMP 2011},
  year = {2011}
}
Augustin, W., Heuveline, V. and Weiss, J.-P.Convey HC-1 Hybrid Core Computer – The Potential of FPGAs in Numerical Simulation 20112nd Int. Workshop on New Frontiers in High-performance and Hardware-aware Computing (HipHaC’11), pp. 1-8 inproceedings 
BibTeX:
@inproceedings{Augustin2011,
  author = {Augustin, W. and Heuveline, V. and Weiss, J.-P.},
  title = {Convey HC-1 Hybrid Core Computer – The Potential of FPGAs in Numerical Simulation},
  booktitle = {2nd Int. Workshop on New Frontiers in High-performance and Hardware-aware Computing (HipHaC’11)},
  publisher = {KIT Scientific Publishing},
  year = {2011},
  pages = {1--8}
}
Baumann, M.Numerical Simulation of Tropical Cyclones using Goal-Oriented Adaptivity 2011School: Engineering Mathematics and Computing Lab (EMCL) -- Karlsruhe Institute of Technology (KIT) phdthesisURL 
BibTeX:
@phdthesis{baumann2011,
  author = {Baumann, Martin},
  title = {Numerical Simulation of Tropical Cyclones using Goal-Oriented Adaptivity},
  school = {Engineering Mathematics and Computing Lab (EMCL) -- Karlsruhe Institute of Technology (KIT)},
  year = {2011},
  url = {http://digbib.ubka.uni-karlsruhe.de/volltexte/1000025974}
}
Rocker, B.Hardware-aware Solvers for Large, Sparse Linear Systems - Parallel and Multi-precision Approaches 2011School: Engineering Mathematics and Computing Lab (EMCL) -- Karlsruhe Institute of Technology (KIT) phdthesis 
BibTeX:
@phdthesis{bjoern-rocker,
  author = {Rocker, B.},
  title = {Hardware-aware Solvers for Large, Sparse Linear Systems - Parallel and Multi-precision Approaches},
  school = {Engineering Mathematics and Computing Lab (EMCL) -- Karlsruhe Institute of Technology (KIT)},
  year = {2011}
}
Heuveline, V., Gierth, W.-D., Hauenstein, T., Helfrich-Schkarbanenko, A., Ritterbusch, S., Schilling, V., Watzke, M. and Wlotzka, M.Virtueller Flug durch Karlsruhe 2011Car Culture: Medien der Mobilität incollection 
BibTeX:
@incollection{CarCulture2011,
  author = {Heuveline, Vincent and Gierth, Wolf-Dietrich and Hauenstein, Thomas and Helfrich-Schkarbanenko, Andreas and Ritterbusch, Sebastian and Schilling, Veronica and Watzke, Michael and Wlotzka, Martin},
  title = {Virtueller Flug durch Karlsruhe},
  booktitle = {Car Culture: Medien der Mobilität},
  publisher = {ZKM Zentrum für Kunst und Medientechnologie Karlsruhe},
  year = {2011}
}
Helfrich-Schkarbanenko, A., Mitschele, M., Ritterbusch, S. and Heuveline, V.Linking the Dimensions 2011Proceedings of the 2011 COMSOL Conference in Stuttgart inproceedingsURL 
BibTeX:
@inproceedings{COMSOL2011,
  author = {Helfrich-Schkarbanenko, Andreas and Mitschele, Marc and Ritterbusch, Sebastian and Heuveline, Vincent},
  title = {Linking the Dimensions},
  booktitle = {Proceedings of the 2011 COMSOL Conference in Stuttgart},
  publisher = {COMSOL},
  year = {2011},
  url = {http://www.comsol.de/cd/direct/conf/mechspotlight/papers/11824/index.html}
}
Anzt, H., Heuveline, V., Rocker, B., Castillo, M., Fernández, J.C., Mayo, R. and Quintana-Ort, E.S.Power Consumption of Mixed Precision in the Iterative Solution of Sparse Linear Systems 2011IPDPS Workshops, pp. 829-836 inproceedings 
BibTeX:
@inproceedings{DBLP:conf-ipps-AnztHRCFMQ11,
  author = {Hartwig Anzt and Vincent Heuveline and Björn Rocker and Maribel Castillo and Juan C. Fernández and Rafael Mayo and Enrique S. Quintana-Ort},
  title = {Power Consumption of Mixed Precision in the Iterative Solution of Sparse Linear Systems},
  booktitle = {IPDPS Workshops},
  year = {2011},
  pages = {829-836}
}
Anzt, H., Castillo, M., Fernández, J.C., Heuveline, V., Mayo, R., Quintana-Ort, E.S. and Rocker, B.Power Consumption of Mixed Precision in the Iterative Solution of Sparse Linear Systems 2011EMCL Preprint Series(01)EMCL Preprint Series miscURL 
Abstract: This paper presents a detailed analysis of a mixed precision iterative refinement solver applied to a linear system obtained from the 2D discretization of a fluid flow problem. The total execution time and energy need of different soft- and hardware-implementations are measured and compared with those of a plain GMRES-based solver in double precision. The time and energy consumption of individual parts of the algorithm are monitored as well, enabling a deeper insight and the possibility of optimizing the energy consumption of the code on a general-purpose multi-core architecture and systems accelerated by a graphics processor.
BibTeX:
@misc{emcl-preprint-2011-01,
  author = {Anzt, H. and Castillo, M. and Fernández, J. C. and Heuveline, V. and Mayo, R. and Quintana-Ort, E. S. and Rocker, B.},
  title = {Power Consumption of Mixed Precision in the Iterative Solution of Sparse Linear Systems},
  journal = {EMCL Preprint Series},
  year = {2011},
  number = {01},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2011-01.pdf}
}
Adamidis, P., Heuveline, V. and Wilhelm, F.A High-Efficient Scalable Solver for the Global Ocean/Sea-Ice Model MPIOM 2011EMCL Preprint Series(02)EMCL Preprint Series miscURL 
Abstract: This paper presents the work of the "Scalable-Earth-System-Models for high productivity climate simulations" project in improving the linear solver of the barotropic sub-system in the global ocean/sea-ice model MPIOM developed by the Max-Planck-Institute in Hamburg, Germany. We demonstrate the implementation of the conjugate gradient method and an incomplete Cholesky preconditioner with fill-in p in order to achieve high scalability and performance on an IBM POWER6 based supercomputer. Benchmarks of the new solver compared to the traditionally used Successive-Over-Relaxation-Method are given and analyzed with respect to the number of iterations and runtime.
BibTeX:
@misc{emcl-preprint-2011-02,
  author = {Adamidis, Panagiotis and Heuveline, Vincent and Wilhelm, Florian},
  title = {A High-Efficient Scalable Solver for the Global Ocean/Sea-Ice Model MPIOM},
  journal = {EMCL Preprint Series},
  year = {2011},
  number = {02},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2011-02.pdf}
}
Heuveline, V. and Schick, M.Towards a hybrid numerical method using Generalized Polynomial Chaos for Stochastic Differential Equations 2011EMCL Preprint Series(03)EMCL Preprint Series miscURL 
Abstract: Generalized polynomial chaos (gPC) is known to fail for problems involving strong nonlinear dependencies on stochastic inputs, especially arising in the context of long term integration. The reason for this is that gPC is a time-independent projection method, not able to capture a dynamic behavior of probability distributions. Recent developments in addressing this problem are represented by decomposing the random space or employing discrete time-dependent basis functionals, both exhibiting promising results but also introducing increasing computational costs. This work focuses on a numerical analysis of these two approaches as well as their hybrid combination with regard to a simple ODE decay problem subject to a uniformly as well as a Gaussian distributed random input. It is observed that depending on the initial probability distribution strong differences occur with respect to the error developments, which efficiently can be reduced when employing local discrete time-dependent basis functionals.
BibTeX:
@misc{emcl-preprint-2011-03,
  author = {Heuveline, Vincent and Schick, Michael},
  title = {Towards a hybrid numerical method using Generalized Polynomial Chaos for Stochastic Differential Equations},
  journal = {EMCL Preprint Series},
  year = {2011},
  number = {03},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2011-03.pdf}
}
Heuveline, V. and Schick, M.A local time--dependent Generalized Polynomial Chaos method for stochastic dynamical systems 2011EMCL Preprint Series(04)EMCL Preprint Series miscURL 
Abstract: Generalized Polynomial Chaos (gPC) is known to fail for problems involving strong nonlinear dependencies on stochastic inputs, which especially arise in the context of long term integration or stochastic discontinuities. There are various attempts in the literature which address these difficulties, such as the time-dependent generalized Polynomial Chaos (TD-gPC) and the multi-element generalized Polynomial Chaos (ME-gPC) both leading to higher accuracies but higher numerical costs in comparison to the classical gPC approach. A combination of these methods is introduced, which leads to a powerful solution method since high accuracies can be maintained and computational cost can be distributed by utilizing parallel computation. However, to be able to apply the hybrid method to all types of ordinary differential equations subject to random inputs, new modifications with respect to TD-gPC are carried out by creating an orthogonal tensor basis consisting of the random input variable as well as the solution itself. Such modifications allow TD-gPC to capture the dynamics of the solution by increasing the approximation quality of its time derivatives.
BibTeX:
@misc{emcl-preprint-2011-04,
  author = {Heuveline, Vincent and Schick, Michael},
  title = {A local time--dependent Generalized Polynomial Chaos method for stochastic dynamical systems},
  journal = {EMCL Preprint Series},
  year = {2011},
  number = {04},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2011-04.pdf}
}
Anzt, H., Aliaga, J., Castillo, M., Fernández, J.C., Heuveline, V., Mayo, R. and Quintana-Ort, E.S.Analysis and Optimization of Power Consumption in the Iterative Solution of Sparse Linear Systems on Multi-core and Many-core Platforms 2011EMCL Preprint Series(05)EMCL Preprint Series miscURL 
Abstract: Energy efficiency is a major concern in modern high-performance-computing. Still, few studies provide a deep insight into the power consumption of scientific applications. Especially for algorithms running on hybrid platforms equipped with hardware accelerators, like graphics processors, a detailed energy analysis is essential to identify the most costly parts, and to evaluate possible improvement strategies. In this paper we analyze the computational and power performance of iterative linear solvers applied to sparse systems arising in several scientific applications. We also study the gains yield by dynamic voltage/frequency scaling (DVFS), and illustrate that this technique alone cannot to reduce the energy cost to a considerable amount for iterative linear solvers. We then apply techniques that set the (multi-core processor in the) host system to a low-consuming state for the time that the GPU is executing. Our experiments conclusively reveal how the combination of these two techniques deliver a notable reduction of energy consumption without a noticeable impact on computational performance.
BibTeX:
@misc{emcl-preprint-2011-05,
  author = {Anzt, Hartwig and Aliaga, José and Castillo, Maribel and Fernández, Juan C. and Heuveline, Vincent and Mayo, Rafael and Quintana-Ort, Enrique S.},
  title = {Analysis and Optimization of Power Consumption in the Iterative Solution of Sparse Linear Systems on Multi-core and Many-core Platforms},
  journal = {EMCL Preprint Series},
  year = {2011},
  number = {05},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2011-05.pdf}
}
Anzt, H., Castillo, M., Fernández, J.C., Heuveline, V., Igual, F.D., Mayo, R. and Quintana-Ort, E.S.Optimization of Power Consumption in the Iterative Solution of Sparse Linear Systems on Graphics Processors 2011EMCL Preprint Series(06)EMCL Preprint Series miscURL 
Abstract: In this paper, we analyze the power consumption of different GPU-accelerated iterative solver implementations enhanced with energy-saving techniques. Specifically, while conducting kernel calls on the graphics accelerator, we manually set the host system to a power-efficient idle-wait status so as to leverage dynamic voltage and frequency control. While the usage of iterative refinement combined with mixed precision arithmetic often improves the execution time of an iterative solver on a graphics processor, this may not necessarily be true for the power consumption as well. To analyze the trade-off between computation time and power consumption we compare a plain GMRES solver and its preconditioned variant to the mixed-precision iterative refinement implementations based on the respective solvers. Benchmark experiments conclusively reveal how the usage of idle-wait during GPU-kernel calls effectively leverages the power-tools provided by hardware, and improves the energy performance of the algorithm.
BibTeX:
@misc{emcl-preprint-2011-06,
  author = {Anzt, Hartwig and Castillo, Maribel and Fernández, Juan C. and Heuveline, Vincent and Igual, Francisco D. and Mayo, Rafael and Quintana-Ort, Enrique S.},
  title = {Optimization of Power Consumption in the Iterative Solution of Sparse Linear Systems on Graphics Processors},
  journal = {EMCL Preprint Series},
  year = {2011},
  number = {06},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2011-06.pdf}
}
Gengenbach, T., Heuveline, V., Krause, M.J., Mayer, R. and Zimny, S.A Preprocessing Approach for Innovative Patient-specific Intranasal Flow Simulations 2011EMCL Preprint Series(07)EMCL Preprint Series miscURL 
Abstract: A preprocessing approach which enables numerical simulations of patient-individual respiration flows based on computer tomography (CT) scans is presented. The challenge of an adequate 3D reconstruction and preparation of as highly complex geometries as the human inner nose or lung, which cannot be resolved in all their details by nowadays available CT scanners, is met by a holistic concept. In it, the preprocessing is understood as one part of the full numerical simulation where preprocessing, numerical simulation and optimization as well as postprocessing techniques are applied in an iteratively coupled fashion. In the proposed approach, lattice Boltzmann methods are chosen as discretization strategy in order to simulate the airflows. Since they impose the conditions for the preparation process of the geometries, this choice is of crucial importance for the preprocessing techniques which are to be applied. For the actual preprocessing steps, it is proposed to take advantage of techniques, which are implemented in the framework of Materialise's software packages Mimics and 3-matics. In this paper, main emphasis is placed on illustrating the preprocessing approach as part of the proposed full numerical simulation concept. Therefore, the whole chain of necessary processes from the segmentation over the surface generation, the actual volume mesh generation, the numerical simulation, the validation up to the visualization of the results is demonstrated by means of an example, namely the full numerical simulation of an expiration in an inner human nose. The researched geometry belongs to a patient with a severe peripheral obstructive ventilation disorder. Based on the obtained numerical results, possible causes and consequences are discussed, in particular, a stenosis is located.
BibTeX:
@misc{emcl-preprint-2011-07,
  author = {Gengenbach, Thomas and Heuveline, Vincent and Krause, Mathias J. and Mayer, Rolf and Zimny, Simon},
  title = {A Preprocessing Approach for Innovative Patient-specific Intranasal Flow Simulations},
  journal = {EMCL Preprint Series},
  year = {2011},
  number = {07},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2011-07.pdf}
}
Heuveline, V., Lukarski, D. and Weiss, J.-P.Enhanced Parallel ILU(p)-based Preconditioners for Multi-core CPUs and GPUs -- The Power(q)-pattern Method 2011EMCL Preprint Series(08)EMCL Preprint Series miscURL 
Abstract:

Application demands and grand challenges in numerical simulation require for both highly capable computing platforms and efficient numerical solution schemes. Power constraints and further miniaturization of modern and future hardware give way for multi- and manycore processors with increasing fine-grained parallelism and deeply nested hierarchical memory systems -- as already exemplified by recent graphics processing units. Accordingly, numerical schemes need to be adapted and re-engineered in order to deliver scalable solutions across diverse processor configurations. Portability of parallel software solutions across emerging hardware platforms is another challenge.

This work investigates multi-coloring and re-ordering schemes for block Gauß-Seidel methods and, in particular, for incomplete LU factorizations with and without fill-ins. We consider two matrix re-ordering schemes that deliver flexible and efficient parallel preconditioners. The general idea is to generate block decompositions of the system matrix such that the diagonal blocks are diagonal itself. In such a way, parallelism can be exploited on the block-level in a scalable manner. Our goal is to provide widely applicable, out-of-the-box preconditioners that can be used in the context of finite element solvers.

We propose a new method for anticipating the fill-in pattern of ILU($p$) schemes which we call the power($q$)-pattern method. This method is based on an incomplete factorization of the system matrix $A$ subject to a predetermined pattern given by the matrix power $|A|^(p+1)$ and its associated multi-coloring permutation $. We prove that the obtained sparsity pattern is a superset of our modified ILU($p$) factorization applied to pi A pi^(-1). As a result, this modified ILU($p$) applied to multi-colored system matrix has no fill-ins in its diagonal blocks. This leads to an inherently parallel execution of triangular ILU($p$) sweeps.

In addition, we describe the integration of the preconditioners into the HiFlow$^3$ open-source finite element package that provides a portable software solution across diverse hardware platforms. On this basis, we conduct performance analysis across a variety of test problems on multi-core CPUs and GPUs that proves efficiency, scalability and flexibility of our approach. Our preconditioners achieve a solver acceleration by a factor of up to 1.5, 8 and 85 for three different test problems. The GPU versions of the preconditioned solver are by a factor of up to 4 faster than an OpenMP parallel version on eight cores.

BibTeX:
@misc{emcl-preprint-2011-08,
  author = {Heuveline, Vincent and Lukarski, Dimitar and Weiss, Jan-Philipp},
  title = {Enhanced Parallel ILU(p)-based Preconditioners for Multi-core CPUs and GPUs -- The Power(q)-pattern Method},
  journal = {EMCL Preprint Series},
  year = {2011},
  number = {08},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2011-08.pdf}
}
Heuveline, V., Lukarski, D., Trost, N. and Weiss, J.-P.Parallel Smoothers for Matrix-based Multigrid Methods on Unstructured Meshes Using Multicore CPUs and GPUs 2011EMCL Preprint Series(09)EMCL Preprint Series miscURL 
Abstract:

Multigrid methods are efficient and fast solvers for problems typically modeled by partial differential equations of elliptic type. For problems with complex geometries and local singularities stencil-type discrete operators on equidistant Cartesian grids need to be replaced by more flexible concepts for unstructured meshes in order to properly resolve all problem-inherent specifics and for maintaining a moderate number of unknowns. However, flexibility in the meshes goes along with severe drawbacks with respect to parallel execution - especially with respect to the definition of adequate smoothers. This point becomes in particular pronounced in the framework of fine-grained parallelism on GPUs with hundreds of execution units. We use the approach of matrix-based multigrid that has high flexibility and adapts well to the exigences of modern computing platforms.

In this work we investigate multi-colored Gauss-Seidel type smoothers, the power(q)-pattern enhanced multi-colored ILU(p) smoothers with fill-ins, and factorized sparse approximate inverse (FSAI) smoothers. These approaches provide efficient smoothers with a high degree of parallelism. In combination with matrix-based multigrid methods on unstructured meshes our smoothers provide powerful solvers that are applicable across a wide range of parallel computing platforms and almost arbitrary geometries. We describe the configuration of our smoothers in the context of the portable lmpLAtoolbox and the HiFlow3 parallel finite element package. In our approach, a single source code can be used across diverse platforms including multicore CPUs and GPUs. Highly optimized implementations are hidden behind a unified user interface. Efficiency and scalability of our multigrid solvers are demonstrated by means of a comprehensive performance analysis on multicore CPUs and GPUs.

BibTeX:
@misc{emcl-preprint-2011-09,
  author = {Heuveline, Vincent and Lukarski, Dimitar and Trost, Nico and Weiss, Jan-Philipp},
  title = {Parallel Smoothers for Matrix-based Multigrid Methods on Unstructured Meshes Using Multicore CPUs and GPUs},
  journal = {EMCL Preprint Series},
  year = {2011},
  number = {09},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2011-09.pdf}
}
Heuveline, V., Lukarski, D., Oboril, F., Tahoori, M.B. and Weiss, J.-P.Numerical Defect Correction as an Algorithm-Based Fault Tolerance Technique for Iterative Solvers 2011EMCL Preprint Series(10)EMCL Preprint Series miscURL 
Abstract: As hardware devices like processor cores and memory sub--systems based on nano--scale technologies nodes become more unreliable, the need for fault tolerant numerical computing engines, as used in many critical applications with long computation/mission times, is becoming pronounced. In this paper, we present an algorithm--based fault tolerant (ABFT) scheme for an iterative linear solver engine based on the Conjugated Gradient method (CG) by taking the advantage of numerical defect correction. This method is ''pay as you go'', meaning that there is only a runtime overhead if errors occur and a correction is performed. Our experimental comparison with software-based triple modular redundancy (TMR) clearly shows the runtime benefit of the proposed approach, better fault tolerance and no occurrence of silent data corruption.
BibTeX:
@misc{emcl-preprint-2011-10,
  author = {Heuveline, Vincent and Lukarski, Dimitar and Oboril, F. and Tahoori, Mehdi B. and Weiss, Jan-Philipp},
  title = {Numerical Defect Correction as an Algorithm-Based Fault Tolerance Technique for Iterative Solvers},
  journal = {EMCL Preprint Series},
  year = {2011},
  number = {10},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2011-10.pdf}
}
Gengenbach, T., Heuveline, V. and Krause, M.J.Numerical Simulation of the Human Lung: A Two--scale Approach 2011EMCL Preprint Series(11)EMCL Preprint Series miscURL 
Abstract: In this paper we introduce a two-scale model for the numerical simulation of the human lung. The airflow in the upper, resolvable part of the human lungs is modelled by the Navier-Stokes equations, which couple to a dyadic bronchiole tree model through boundary conditions. This model depends mainly on the generation, where the inlet is located, and the radius of the bronchiole at this generation. In the bronchioles a linear flow is assumed, hence the Hagen-Poiseuille formula can be applied. The pressure at the alveoles and the starting resistance for the dyadic tree are used as input parameters. To illustrate the approach, we simulate a patient-specific human lung geometry with a Lattice-Boltzmann method and show qualitative convergence results for the pressure drop as well as a comparison with Neumann boundary conditions that are commonly used in one-scale models. Results for an overall flux of Q=150 ml/s, which corresponds to a Reynolds number of R=1650 are presented.
BibTeX:
@misc{emcl-preprint-2011-11,
  author = {Gengenbach, Thomas and Heuveline, Vincent and Krause, Mathias J.},
  title = {Numerical Simulation of the Human Lung: A Two--scale Approach},
  journal = {EMCL Preprint Series},
  year = {2011},
  number = {11},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2011-11.pdf}
}
Heuveline, V. and Thaeter, G.Proceedings of the 4th EMCL-Workshop ''Numerical Simulation, Optimization and High Performance Computing'' 2011EMCL Preprint Series(12)EMCL Preprint Series miscURL 
BibTeX:
@misc{emcl-preprint-2011-12,
  author = {Heuveline, Vincent and Thaeter, Gudrun},
  title = {Proceedings of the 4th EMCL-Workshop ''Numerical Simulation, Optimization and High Performance Computing''},
  journal = {EMCL Preprint Series},
  year = {2011},
  number = {12},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2011-12.pdf}
}
Heuveline, V., Karl, W., Nowak, F., Schmidtobreick, M. and Wilhelm, F.Employing a High-Level Language for Porting Numerical Applications to Reconfigurable Hardware 2011EMCL Preprint Series(13)EMCL Preprint Series miscURL 
Abstract: The deployment of FPGAs has become more and more common over the last years. Many applications have since then been accelerated by porting advantageous parts onto FPGA hardware. High-level, C-like programming languages and advanced tools such as Impulse CoDeveloper that produce hardware descriptions can potentially help with this task. We showcase the applicability of this new approach to FPGA acceleration in terms of solving the Poisson equation with the conjugate gradient (CG) method and a red-black symmetric successive over-relaxation (SSOR) preconditioner as a model problem. In this case, the CPU executes the CG method while an FPGA takes over the red-black SSOR preconditioning part. We compare a purely CPU-based algorithm to our FPGA-extended approach in order to evaluate the maturity and applicability of high-level language translators with regard to accelerating numerical applications.
BibTeX:
@misc{emcl-preprint-2011-13,
  author = {Heuveline, Vincent and Karl, Wolfgang and Nowak, Fabian and Schmidtobreick, Mareike and Wilhelm, Florian},
  title = {Employing a High-Level Language for Porting Numerical Applications to Reconfigurable Hardware},
  journal = {EMCL Preprint Series},
  year = {2011},
  number = {13},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2011-13.pdf}
}
Anzt, H., Dongarra, J., Heuveline, V. and Tomov, S.A Block-Asynchronous Relaxation Method for Graphics Processing Units 2011EMCL Preprint Series(14)EMCL Preprint Series miscURL 
Abstract: In this paper, we analyze the potential of asynchronous relaxation methods on Graphics Processing Units (GPUs). For this purpose, we developed a set of asynchronous iteration algorithms in CUDA and compared them with a parallel implementation of synchronous relaxation methods on CPU-based systems. For a set of test matrices taken from the University of Florida Matrix Collection we monitor the convergence behavior, the average iteration time and the total time-to-solution time. Analyzing the results, we observe that even for our most basic asynchronous relaxation scheme, despite its lower convergence rate compared to the Gauss-Seidel relaxation (that we expected), the asynchronous iteration running on GPUs is still able to provide solution approximations of certain accuracy in considerably shorter time than Gauss-Seidel running on CPUs. Hence, it overcompensates for the slower convergence by exploiting the scalability and the good fit of the asynchronous schemes for the highly parallel GPU architectures. Further, enhancing the most basic asynchronous approach with hybrid schemes -- using multiple iterations within the "subdomain" handled by a GPU thread block and Jacobi-like asynchronous updates across the "boundaries", subject to tuning various parameters -- we manage to not only recover the loss of global convergence but often accelerate convergence of up to two times (compared to the standard but difficult to parallelize Gauss-Seidel type of schemes), while keeping the execution time of a global iteration practically the same. This shows the high potential of the asynchronous methods not only as a stand alone numerical solver for linear systems of equations fulfilling certain convergence conditions but more importantly as a smoother in multigrid methods. Due to the explosion of parallelism in todays architecture designs, the significance and the need for asynchronous methods, as the ones described in this work, is expected to grow.
BibTeX:
@misc{emcl-preprint-2011-14,
  author = {Anzt, Hartwig and Dongarra, Jack and Heuveline, Vincent and Tomov, Stanimire},
  title = {A Block-Asynchronous Relaxation Method for Graphics Processing Units},
  journal = {EMCL Preprint Series},
  year = {2011},
  number = {14},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2011-14.pdf}
}
Anzt, H., Dongarra, J., Gates, M. and Tomov, S.Block-asynchronous multigrid smoothers for GPU-accelerated systems 2011EMCL Preprint Series(15)EMCL Preprint Series miscURL 
Abstract: This paper explores the need for asynchronous iteration algorithms as smoothers in multigrid methods. The hardware target for the new algorithms is top-of-the-line, highly parallel hybrid architectures -- multicore-based systems enhanced with GPGPUs. These architectures are the most likely candidates for future high-end supercomputers. To pave the road for their efficient use, challenges related to the established notion that "data movement, not FLOPS, is the bottleneck to performance" must be resolved. Our work is in this direction -- we designed block-asynchronous multigrid smoothers that perform more flops in order to reduce synchronization, and hence data movement. We show that the extra flops are done for "free", while synchronization is reduced and the convergence properties of multigrid with classical smoothers like Gauss-Seidel are preserved.
BibTeX:
@misc{emcl-preprint-2011-15,
  author = {Anzt, Hartwig and Dongarra, Jack and Gates, Mark and Tomov, Stanimire},
  title = {Block-asynchronous multigrid smoothers for GPU-accelerated systems},
  journal = {EMCL Preprint Series},
  year = {2011},
  number = {15},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2011-15.pdf}
}
Heuveline, V., Ritterbusch, S. and Ronnas, S.Augmented Reality for Urban Simulation Visualization 2011EMCL Preprint Series(16)EMCL Preprint Series miscURL 
Abstract: Visualizations of large simulations are not only computationally intensive but also difficult for the viewer to interpret, due to the huge amount of data to be processed. The case of urban wind flow simulations proves the benefits of mobile Augmented Reality visualizations, both in terms of selection of data relevant to the user and facilitated and comprehensible access to simulation results.
BibTeX:
@misc{emcl-preprint-2011-16,
  author = {Heuveline, Vincent and Ritterbusch, Sebastian and Ronnas, Staffan},
  title = {Augmented Reality for Urban Simulation Visualization},
  journal = {EMCL Preprint Series},
  year = {2011},
  number = {16},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2011-16.pdf}
}
Anzt, H., Dongarra, J., Heuveline, V. and Luszczek, P.GPU-Accelerated Asynchronous Error Correction for Mixed Precision Iterative Refinement 2011EMCL Preprint Series(17)EMCL Preprint Series miscURL 
Abstract: In hardware-aware high performance computing, block-asynchronous iteration and mixed precision iterative refinement are two techniques that may be used to leverage the computing power of SIMD accelerators like GPUs in the iterative solution of linear equation systems. although they use a very different approach for this purpose, they share the basic idea of compensating the convergence properties of an inferior numerical algorithm by a more efficient usage of the provided computing power. In this paper, we analyze the potential of combining both techniques. Therefore, we derive a mixed precision iterative refinement algorithm using a block-asynchronous iteration as an error correction solver, and compare its performance with a pure implementation of a block-asynchronous iteration and an iterative refinement method using double precision for the error correction solver. For matrices from the University of Florida Matrix collection, we report the convergence behaviour and provide the total solver runtime using different GPU architectures.
BibTeX:
@misc{emcl-preprint-2011-17,
  author = {Anzt, Hartwig and Dongarra, Jack and Heuveline, Vincent and Luszczek, Piotr},
  title = {GPU-Accelerated Asynchronous Error Correction for Mixed Precision Iterative Refinement},
  journal = {EMCL Preprint Series},
  year = {2011},
  number = {17},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2011-17.pdf}
}
Anzt, H., Castillo, M., Fernández, J., Heuveline, V., Igual, F., Mayo, R. and Quintana-Ortí, E.S.Optimization of power consumption in the iterative solution of sparse linear systems on graphics processors 2011Computer Science - Research and Development, pp. 1-9 articleURL 
Abstract: In this paper, we analyze the power consumption of different GPU-accelerated iterative solver implementations enhanced with energy-saving techniques. Specifically, while conducting kernel calls on the graphics accelerator, we manually set the host system to a power-efficient idle-wait status so as to leverage dynamic voltage and frequency control. While the usage of iterative refinement combined with mixed precision arithmetic often improves the execution time of an iterative solver on a graphics processor, this may not necessarily be true for the power consumption as well. To analyze the trade-off between computation time and power consumption we compare a plain GMRES solver and its preconditioned variant to the mixed-precision iterative refinement implementations based on the respective solvers. Benchmark experiments conclusively reveal how the usage of idle-wait during GPU-kernel calls effectively leverages the power-tools provided by hardware, and improves the energy performance of the algorithm.
BibTeX:
@article{ENAHPC2011,
  author = {Anzt, Hartwig and Castillo, Maribel and Fernández, Juan and Heuveline, Vincent and Igual, Francisco and Mayo, Rafael and Quintana-Ortí, Enrique S.},
  title = {Optimization of power consumption in the iterative solution of sparse linear systems on graphics processors},
  journal = {Computer Science - Research and Development},
  publisher = {Springer Berlin / Heidelberg},
  year = {2011},
  pages = {1-9},
  note = {10.1007/s00450-011-0195-8},
  url = {http://dx.doi.org/10.1007/s00450-011-0195-8}
}
Anzt, H., Augustin, W., Baumann, M., Gengenbach, T., Hahn, T., Helfrich-Schkarbanenko, A., Heuveline, V., Ketelaer, E., Lukarski, D., Nestler, A., Ritterbusch, S., Rocker, B., Ronnås, S., Schick, M., Schmidtobreick, M., Subramanian, C., Weiss, J.-P., Wilhelm, F. and Wlotzka, M.HiFlow3 – A Multi-Purpose and Flexible Parallel Finite Element Package 2011Open Source CFD International Conference, pp. 1-15 inproceedings 
BibTeX:
@inproceedings{HiFlow3OSCIC2011,
  author = {Anzt, Hartwig and Augustin, Werner and Baumann, Martin and Gengenbach, Thomas and Hahn, Tobias and Helfrich-Schkarbanenko, Andreas and Heuveline, Vincent and Ketelaer, Eva and Lukarski, Dimitar and Nestler, Andrea and Ritterbusch, Sebastian and Rocker, Bjoern and Ronnås, Staffan and Schick, Michael and Schmidtobreick, Mareieke and Subramanian, Chandramowli and Weiss, Jan-Philipp and Wilhelm, Florian and Wlotzka, Martin},
  title = {HiFlow3 – A Multi-Purpose and Flexible Parallel Finite Element Package},
  booktitle = {Open Source CFD International Conference},
  year = {2011},
  pages = {1-15},
  note = {OSCIC 2011 Open Source CFD International Conference in Paris Chantilly}
}
Anzt, H., Hahn, T., Heuveline, V. and Rocker, B.GPU Accelerated Scientific Computing: Evaluation of the NVIDIA Fermi Architecture; Elementary Kernels and Linear Solvers 2011Proceedings of HipHaC 2011 inproceedings 
BibTeX:
@inproceedings{hiphac11_anzt_hahn_heuveline_rocker,
  author = {Anzt, Hartwig and Hahn, Tobias and Heuveline, Vincent and Rocker, Björn},
  title = {GPU Accelerated Scientific Computing: Evaluation of the NVIDIA Fermi Architecture; Elementary Kernels and Linear Solvers},
  booktitle = {Proceedings of HipHaC 2011},
  year = {2011}
}
Anzt, H., Heuveline, V., Rocker, B., Castillo, M., Fernández, J.C., Mayo, R. and Quintana-Ortí, E.S.Power Consumption of Mixed Precision in the Iterative Solution of Sparse Linear Systems 2011IPDPS Workshops, pp. 829-836 inproceedingsDOI  
BibTeX:
@inproceedings{hppac,
  author = {Hartwig Anzt and Vincent Heuveline and Björn Rocker and Maribel Castillo and Juan C. Fernández and Rafael Mayo and Enrique S. Quintana-Ortí},
  title = {Power Consumption of Mixed Precision in the Iterative Solution of Sparse Linear Systems},
  booktitle = {IPDPS Workshops},
  year = {2011},
  pages = {829-836},
  doi = {http://dx.doi.org/10.1109/IPDPS.2011.226}
}
Heuveline, V., Ritterbusch, S. and Ronnas, S.Augmented Reality for Urban Simulation Visualization 2011The First International Conference on Advanced Communications and Computation, pp. 115-119 inproceedings 
BibTeX:
@inproceedings{INFOCOMP2011,
  author = {Heuveline, Vincent and Ritterbusch, Sebastian and Ronnas, Staffan},
  title = {Augmented Reality for Urban Simulation Visualization},
  booktitle = {The First International Conference on Advanced Communications and Computation},
  publisher = {IARIA},
  year = {2011},
  pages = {115-119}
}
Koch, V., Ritterbusch, S., Kopmann, A., Mueller, M., Habel, T. and von Both, P.Flying Augmented Reality: Supporting planning and simulation analysis by combining mixed reality methods using multicopter and pattern recognition 201129th eCAADe Conference Proceedings, pp. 843-849 conference 
BibTeX:
@conference{kochritterbuschaugmented,
  author = {Koch, Volker and Ritterbusch, Sebastian and Kopmann, Andreas and Mueller, Marius and Habel, Thorsten and von Both, Petra},
  title = {Flying Augmented Reality: Supporting planning and simulation analysis by combining mixed reality methods using multicopter and pattern recognition},
  booktitle = {29th eCAADe Conference Proceedings},
  year = {2011},
  pages = {843--849},
  note = {ISBN 978-9-4912070-1-3}
}
Heuveline, V. and Krause, M.OpenLB: Towards an Efficient Parallel Open Source Library for Lattice Boltzmann Fluid Flow Simulations 2011(6126, 6127)PARA'08 Workshop on State-of-the-Art in Scientific and Parallel Computing, May 13-16, 2008 inproceedings 
BibTeX:
@inproceedings{krause:08,
  author = {Heuveline, V. and Krause, M.J.},
  title = {OpenLB: Towards an Efficient Parallel Open Source Library for Lattice Boltzmann Fluid Flow Simulations},
  booktitle = {PARA'08 Workshop on State-of-the-Art in Scientific and Parallel Computing, May 13-16, 2008},
  year = {2011},
  number = {6126, 6127},
  note = {2008}
}
Krause, M.J., Gengenbach, T. and Heuveline, V.Hybrid Parallel Simulations of Fluid Flows in Complex Geometries: Application to the Human Lungs 2011
Vol. 6586Euro-Par 2010 Parallel Processing Workshops, pp. 209-216 
incollectionURL 
BibTeX:
@incollection{krause:10a,
  author = {Krause, M. J. and Gengenbach, T. and Heuveline, V.},
  title = {Hybrid Parallel Simulations of Fluid Flows in Complex Geometries: Application to the Human Lungs},
  booktitle = {Euro-Par 2010 Parallel Processing Workshops},
  publisher = {Springer Berlin / Heidelberg},
  year = {2011},
  volume = {6586},
  pages = {209-216},
  url = {http://dx.doi.org/10.1007/978-3-642-21878-1_26}
}
Krause, M.J., Gengenbach, T., Zimny, S., Mayer, R. and Heuveline, V.How to Breathe Life into CT-Data 2011Computer Aided Medical Engineering(4), pp. 29-33 article 
BibTeX:
@article{krause:11a,
  author = {Mathias J. Krause and Thomas Gengenbach and Simon Zimny and Rolf Mayer and Vincent Heuveline},
  title = {How to Breathe Life into CT-Data},
  journal = {Computer Aided Medical Engineering},
  year = {2011},
  number = {4},
  pages = {29--33}
}
Heuveline, V., Lukarski, D., Subramanian, C. and Weiss, J.-P.Parallel Preconditioning and Modular Finite Element Solvers on Hybrid CPU-GPU Systems 2011(36)Proceedings of the Second International Conference on Parallel, Distributed, Grid and Cloud Computing for Engineering inproceedings 
BibTeX:
@inproceedings{Lukarski2011b,
  author = {Heuveline, V. and Lukarski, D. and Subramanian, C. and Weiss, J.-P.},
  title = {Parallel Preconditioning and Modular Finite Element Solvers on Hybrid CPU-GPU Systems},
  booktitle = {Proceedings of the Second International Conference on Parallel, Distributed, Grid and Cloud Computing for Engineering},
  publisher = {Civil-Comp Press},
  year = {2011},
  number = {36}
}
Heuveline, V., Lukarski D. and Weiss, J.-P.Scalable Multi-Coloring Preconditioning for Multicore CPUs and GPUs 2011
Vol. 6586Euro-Par 2010 Parallel Processing Workshops, pp. 389-397 
incollection 
BibTeX:
@incollection{Lukarski2011c,
  author = {Heuveline, V. and Lukarski, D., and Weiss, J.-P.},
  title = {Scalable Multi-Coloring Preconditioning for Multicore CPUs and GPUs},
  booktitle = {Euro-Par 2010 Parallel Processing Workshops},
  publisher = {Springer Berlin / Heidelberg},
  year = {2011},
  volume = {6586},
  pages = {389--397}
}
Adamidis, P., Heuveline, V. and Wilhelm, F.An Efficient Scalable Solver for the Global Ocean Sea-Ice Model MPIOM 2011Proceedings of the Second International Conference on Parallel, Distributed, Grid and Cloud Computing for Engineering proceedingsDOI  
BibTeX:
@proceedings{MPIOMwilhelm,
  author = {Adamidis, Panagiotis and Heuveline, Vincent and Wilhelm, Florian},
  title = {An Efficient Scalable Solver for the Global Ocean Sea-Ice Model MPIOM},
  journal = {Proceedings of the Second International Conference on Parallel, Distributed, Grid and Cloud Computing for Engineering},
  publisher = {Civil-Comp Press},
  year = {2011},
  doi = {http://dx.doi.org/10.4203/ccp.95.58}
}
Oboril, F., Tahoori, M.B., Heuveline, V., Lukarski, D. and Weiss, J.-P.Numerical Defect Correction as an Algorithm-Based Fault Tolerance Technique for Iterative Solvers 2011Pacific Rim International Symposium on Dependable Computing, IEEE
Vol. 0, pp. 144-153 
articleDOI  
BibTeX:
@article{Oboril2011,
  author = {Fabian Oboril and Mehdi B. Tahoori and Vincent Heuveline and Dimitar Lukarski and Jan-Philipp Weiss},
  title = {Numerical Defect Correction as an Algorithm-Based Fault Tolerance Technique for Iterative Solvers},
  journal = {Pacific Rim International Symposium on Dependable Computing, IEEE},
  publisher = {IEEE Computer Society},
  year = {2011},
  volume = {0},
  pages = {144--153},
  doi = {http://doi.ieeecomputersociety.org/10.1109/PRDC.2011.26}
}
Schick, M.Uncertainty Quantification for Stochastic Dynamical Systems: Spectral Methods using Generalized Polynomial Chaos 2011School: Engineering Mathematics and Computing Lab (EMCL) -- Karlsruhe Institute of Technology (KIT) phdthesis 
BibTeX:
@phdthesis{phd-schick,
  author = {Schick, M.},
  title = {Uncertainty Quantification for Stochastic Dynamical Systems: Spectral Methods using Generalized Polynomial Chaos},
  school = {Engineering Mathematics and Computing Lab (EMCL) -- Karlsruhe Institute of Technology (KIT)},
  year = {2011}
}
Anzt, H., Heuveline, V., Aliaga, J.I., Castillo, M., Fernandez, J.C., Mayo, R. and Quintana-Orti, E.S.Analysis and optimization of power consumption in the iterative solution of sparse linear systems on multi-core and many-core platforms 2011Green Computing Conference and Workshops (IGCC), 2011 International, pp. 1-6 inproceedingsDOI  
BibTeX:
@inproceedings{pmp,
  author = {Anzt, Hartwig and Heuveline, Vincent and Aliaga, José I. and Castillo, Maribel and Fernandez, Juan C. and Mayo, Rafael and Quintana-Orti, Enrique S.},
  title = {Analysis and optimization of power consumption in the iterative solution of sparse linear systems on multi-core and many-core platforms},
  booktitle = {Green Computing Conference and Workshops (IGCC), 2011 International},
  year = {2011},
  pages = {1--6},
  doi = {http://dx.doi.org/10.1109/IGCC.2011.6008594}
}
Rocker, B., Kolberg, M. and Heuveline, V.The Impact of Data Distribution in Accuracy and Performance of Parallel Linear Algebra Subroutines 2011
Vol. 6449High Performance Computing for Computational Science – VECPAR 2010, pp. 394-407 
inproceedingsURL 
BibTeX:
@inproceedings{springerlink:10.1007--978-3-642-19328-6_36,
  author = {Rocker, Björn and Kolberg, Mariana and Heuveline, Vincent},
  title = {The Impact of Data Distribution in Accuracy and Performance of Parallel Linear Algebra Subroutines},
  booktitle = {High Performance Computing for Computational Science – VECPAR 2010},
  publisher = {Springer Berlin / Heidelberg},
  year = {2011},
  volume = {6449},
  pages = {394-407},
  note = {10.1007/978-3-642-19328-6_36},
  url = {http://dx.doi.org/10.1007/978-3-642-19328-6_36}
}
Schönfelder, M., Eßer-Frey, A., Schick, M., Fichtner, W., Heuveline, V. and Leibfried, T.New Developments in Modeling Network Constraints in Techno-economic Energy System Expansion Planning Models 2011Zeitschrift für Energiewirtschaft, pp. 1-9 articleURL 
BibTeX:
@article{springerlink:10.1007--s12398-011-0067-8,
  author = {Schönfelder, Martin and Eßer-Frey, Anke and Schick, Michael and Fichtner, Wolf and Heuveline, Vincent and Leibfried, Thomas},
  title = {New Developments in Modeling Network Constraints in Techno-economic Energy System Expansion Planning Models},
  journal = {Zeitschrift für Energiewirtschaft},
  publisher = {Vieweg Verlag},
  year = {2011},
  pages = {1-9},
  note = {10.1007/s12398-011-0067-8},
  url = {http://dx.doi.org/10.1007/s12398-011-0067-8}
}
Subramanian, C., Criekingen, S.V., Heuveline, V., Nataf, F. and Havé, P.The Davidson method as an alternative to power iterations for criticality calculations 2011Annals of Nuclear Energy
Vol. 38(12), pp. 2818 - 2823 
articleDOIURL 
BibTeX:
@article{Subramanian20112818,
  author = {C. Subramanian and S. Van Criekingen and V. Heuveline and F. Nataf and P. Havé},
  title = {The Davidson method as an alternative to power iterations for criticality calculations},
  journal = {Annals of Nuclear Energy},
  year = {2011},
  volume = {38},
  number = {12},
  pages = {2818 - 2823},
  url = {http://www.sciencedirect.com/science/article/pii/S0306454911003471},
  doi = {http://dx.doi.org/10.1016/j.anucene.2011.08.018}
}
Subramanian, C.High Performance Computing for Stability Problems -- Applications to Hydrodynamic Stability and Neutron Transport Criticality 2011School: School: Engineering Mathematics and Computing Lab (EMCL) -- Karlsruhe Institute of Technology (KIT) phdthesis 
BibTeX:
@phdthesis{subramanian-PHD,
  author = {Subramanian, Chandramowli},
  title = {High Performance Computing for Stability Problems -- Applications to Hydrodynamic Stability and Neutron Transport Criticality },
  school = {School: Engineering Mathematics and Computing Lab (EMCL) -- Karlsruhe Institute of Technology (KIT)},
  year = {2011}
}
Anzt, H., Heuveline, V. and Rocker, B.An Error Correction Solver for Linear Systems: Evaluation of Mixed Precision Implementations 2011
Vol. 6449High Performance Computing for Computational Science -- VECPAR 2010, pp. 58-70 
inproceedingsDOI  
BibTeX:
@inproceedings{vecpar,
  author = {Anzt, Hartwig and Heuveline, Vincent and Rocker, Björn},
  title = {An Error Correction Solver for Linear Systems: Evaluation of Mixed Precision Implementations},
  booktitle = {High Performance Computing for Computational Science -- VECPAR 2010},
  publisher = {Springer Berlin / Heidelberg},
  year = {2011},
  volume = {6449},
  pages = {58-70},
  doi = {http://dx.doi.org/10.1007/978-3-642-19328-6$\_$8}
}
Anzt, H., Vincent, H. and Rocker, B.Mixed precision error correction methods for linear systems Convergence analysis based on Krylov subspace methods 2010PARA 2010, Part II, LNCS 7134, pp. 237-248 inproceedings 
BibTeX:
@inproceedings{Anzt2010,
  author = {Anzt, Hartwig and Heuveline Vincent and Rocker, Björn},
  title = {Mixed precision error correction methods for linear systems Convergence analysis based on Krylov subspace methods},
  booktitle = {PARA 2010, Part II, LNCS 7134},
  publisher = {Springer, Heidelberg},
  year = {2010},
  pages = {237--248}
}
Anzt, H., Rocker, B. and Heuveline, V.An Error Correction Solver for Linear Systems: Evaluation of Mixed Precision Implementations 2010EMCL Preprint Series(01)EMCL Preprint Series miscURL 
Abstract: This paper proposes an error correction method for solving linear systems of equations and the evaluation of an implementation using mixed precision techniques. While different technologies are available, graphic processing units (GPUs) have been established as particularly powerful coprocessors in recent years. For this reason, our error correction approach is focused on a CUDA implementation executing the error correction solver on the GPU. Benchmarks are performed both for artificially created matrices with preset characteristics as well as matrices obtained from finite element discretizations of fluid flow problems.
BibTeX:
@misc{emcl-preprint-2010-01,
  author = {Anzt, Hartwig and Rocker, Björn and Heuveline, Vincent},
  title = {An Error Correction Solver for Linear Systems: Evaluation of Mixed Precision Implementations},
  journal = {EMCL Preprint Series},
  year = {2010},
  number = {01},
  url = {http://www.emcl.kit.edu/preprints/emcl-preprint-2010-01.pdf}
}
Anzt, H., Rocker, B. and Heuveline, V.Mixed Precision Error Correction Methods for Linear Systems: Convergence Analysis based on Krylov Subspace Methods 2010EMCL Preprint Series(02)EMCL Preprint Series miscURL 
Abstract: The convergence analysis of Krylov subspace solvers usually provides an estimation for the computational cost. Exact knowledge about the convergence theory of error correction methods using different floating point precision formats would enable to determine a priori whether the implementation of a mixed precision error correction solver using a certain Krylov subspace method as error correction solver outperforms the plain solver in high precision. This paper reveals characteristics of mixed precision error correction methods using Krylov subspace methods as inner solver.
BibTeX:
@misc{emcl-preprint-2010-02,
  author = {Anzt, Hartwig and Rocker, Björn and Heuveline, Vincent},
  title = {Mixed Precision Error Correction Methods for Linear Systems: Convergence Analysis based on Krylov Subspace Methods},
  journal = {EMCL Preprint Series},
  year = {2010},
  number = {02},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2010-02.pdf}
}
Anzt, H., Rocker, B. and Heuveline, V.Energy Efficiency of Mixed Precision Iterative Refinement Methods using Hybrid Hardware Platforms: An Evaluation of different Solver and Hardware Configurations 2010EMCL Preprint Series(03)EMCL Preprint Series miscURL 
Abstract: In this paper we evaluate the possibility of using mixed precision algorithms on different hardware platforms to obtain energy-efficient solvers for linear systems of equations. Our test-cases arise in the context of computational fluid dynamics. Therefore, we analyze the energy efficiency of common cluster nodes and a hybrid, GPU-accelerated cluster node, when applying a linear solver, that can benefit from the use of different precision formats. We show the high potential of hardware-aware computing in terms of performance and energy efficiency.
BibTeX:
@misc{emcl-preprint-2010-03,
  author = {Anzt, Hartwig and Rocker, Björn and Heuveline, Vincent},
  title = {Energy Efficiency of Mixed Precision Iterative Refinement Methods using Hybrid Hardware Platforms: An Evaluation of different Solver and Hardware Configurations},
  journal = {EMCL Preprint Series},
  year = {2010},
  number = {03},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2010-03.pdf}
}
Anzt, H., Hahn, T., Heuveline, V. and Rocker, B.GPU Accelerated Scientific Computing: Evaluation of the NVIDIA Fermi Architecture; Elementary Kernels and Linear Solvers 2010EMCL Preprint Series(04)EMCL Preprint Series miscURL 
Abstract: This paper evaluates the latest GPU generation from NVIDIA, called "Fermi", with respect to the previous generation. The experiments include benchmarks of elementary kernels as well as of linear solvers applied to problems arising in the area of computational fluid dynamics. Both the consumer version of the hardware (GeForce GTX 480 and GTX 280) as well as the professional line (Tesla C2050 and C1060) are taken into account.
BibTeX:
@misc{emcl-preprint-2010-04,
  author = {Anzt, Hartwig and Hahn, Tobias and Heuveline, Vincent and Rocker, Björn},
  title = {GPU Accelerated Scientific Computing: Evaluation of the NVIDIA Fermi Architecture; Elementary Kernels and Linear Solvers},
  journal = {EMCL Preprint Series},
  year = {2010},
  number = {04},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2010-04.pdf}
}
Baumann, M. and Heuveline, V.Evaluation of Different Strategies for Goal Oriented Adaptivity in CFD -- Part I: The Stationary Case 2010EMCL Preprint Series(05)EMCL Preprint Series misc 
BibTeX:
@misc{emcl-preprint-2010-05,
  author = {Baumann, Martin and Heuveline, Vincent},
  title = {Evaluation of Different Strategies for Goal Oriented Adaptivity in CFD -- Part I: The Stationary Case},
  journal = {EMCL Preprint Series},
  year = {2010},
  number = {05}
}
Anzt, H., Augustin, W., Baumann, M., Bockelmann, H., Gengenbach, T., Hahn, T., Heuveline, V., Ketelaer, E., Lukarski, D., Otzen, A., Ritterbusch, S., Rocker, B., Ronnas, S., Schick, M., Subramanian, C., Weiss, J.-P. and Wilhelm, F.HiFlow3 -- A Flexible and Hardware-Aware Parallel Finite Element Package 2010EMCL Preprint Series(06)EMCL Preprint Series miscURL 
Abstract: This paper details the concept and implementation of the parallel finite element software package HiFlow3 . HiFlow3 is driven by application requirements and aims at the solution of large-scale problems obtained by means of the finite element method for partial differential equations. By utilizing object-oriented concepts and the full capabilities of C++ the HiFlow3 project follows a modular and generic approach for building efficient parallel numerical solvers. It provides highly capable modules dealing with the mesh setup, finite element spaces, degrees of freedom, linear algebra routines, numerical solvers, and output data for visualization. Parallelism -- as the basis for high performance simulations on modern computing systems -- is introduced on two levels: coarse-grained parallelism by means of distributed grids and distributed data structures, and fine-grained parallelism by means of platform-optimized linear algebra back-ends. Modern numerical schemes in HiFlow3 are built on top of both levels of parallelism. This paper describes the project, its concept, and application scenarios in detail and outlines our hardware-aware cross-platform portable approach that benefits from various emerging technologies like GPU acceleration in a unified and user-friendly manner.
BibTeX:
@misc{emcl-preprint-2010-06,
  author = {Anzt, H. and Augustin, W. and Baumann, M. and Bockelmann, H. and Gengenbach, T. and Hahn, T. and Heuveline, V. and Ketelaer, E. and Lukarski, D. and Otzen, A. and Ritterbusch, S. and Rocker, B. and Ronnas, S. and Schick, M. and Subramanian, C. and Weiss, J.-P. and Wilhelm, F.},
  title = {HiFlow3 -- A Flexible and Hardware-Aware Parallel Finite Element Package},
  journal = {EMCL Preprint Series},
  year = {2010},
  number = {06},
  note = {06:1-36},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2010-06.pdf}
}
Augustin, W., Heuveline, V. and Weiss, J.-P.Convey HC-1 -- The Potential of FPGAs in Numerical Simulation 2010EMCL Preprint Series(07)EMCL Preprint Series miscURL 
Abstract: The Convey HC-1 Hybrid Core Computer brings FPGA technologies closer to numerical simulation. It combines two types of processor architectures in a single system. Highly capable FPGAs are closely connected to a host CPU and the accelerator-to-memory bandwidth has remarkable values. Reconfigurability by means of pre-defined application-specific instruction sets called personalities have the appeal of optimized hardware configuration with respect to application characteristics. Moreover, Conveyś solution eases the programming effort considerably. In contrast to hardware-centric and time- consuming classical coding of FPGAs, a dual-target compiler interprets pragma-extended C/C++ or Fortran code and produces implementations running on both, host and accelerator. In addition, a global view of host and device memory is provided by means of a cache-coherent shared virtual memory space. In this work we analyze Convey's programming paradigm and the associated programming effort, and we present practical results on the HC-1. We consider vectorization strategies for the single and double precision vector personalities and a suite of basic numerical routines. Furthermore, we assess the viability of the Convey HC-1 Hybrid Core Computer for numerical simulation.
BibTeX:
@misc{emcl-preprint-2010-07,
  author = {Augustin, W. and Heuveline, V. and Weiss, J.-P.},
  title = {Convey HC-1 -- The Potential of FPGAs in Numerical Simulation},
  journal = {EMCL Preprint Series},
  year = {2010},
  number = {07},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2010-07.pdf}
}
Grams, C., Jones, S., Marsham, J., Parker, D., Haywood, J. and Heuveline, V.The Atlantic inflow to the Saharan heat low: observations and modelling 2010Quarterly Journal of the Royal Meteorological Society(S1), pp. 125-140 articleDOIURL 
Abstract: The inflow of relatively cold and stably stratified air from the Atlantic Ocean into western Mauritania and into the southwestern part of the Saharan heat low is studied using the mesoscale COSMO model. This model was used to provide operational forecasts for the GERBILS field campaign, which was conducted by the Met Office in West Africa in June 2007. The forecasts were validated against airborne measurements as well as satellite imagery and were found to represent the main synoptic features of the region accurately.A complex mesoscale feature in western Mauritania, which we call the Atlantic Inflow, was identified in the COSMO model output. The main component of the Atlantic Inflow is the sea breeze and associated front. The sea breeze interacts with larger-scale, higher-altitude fluctuations in the thermal and humidity advection. During the day the balance between horizontal advection of cool maritime air and turbulence in the convective boundary layer over land results in a stationary sea breeze front at the coast. Once turbulence dies down in the evening, the sea breeze front penetrates inland. Above the sea breeze layer, thermal advection in the Saharan Atmospheric Boundary Layer (SABL) also controls the structure of the Atlantic Inflow. A marked baroclinic zone was observed, in which the temperature and humidity made a relatively smooth transition from values typical of the Atlantic air to values characteristic of the SABL. Budget calculations showed that, through its cooling and occasional moistening at low levels, the Atlantic Inflow has an important impact on the regional heat and moisture budgets. Copyright pyright Royal Meteorological Society and Crown Copyright, 2009.
BibTeX:
@article{Heuveline2010,
  author = {Grams, C.M. and Jones, S.C. and Marsham, J.H. and Parker, D.J. and Haywood, J.M. and Heuveline, V.},
  title = {The Atlantic inflow to the Saharan heat low: observations and modelling},
  journal = {Quarterly Journal of the Royal Meteorological Society},
  year = {2010},
  number = {S1},
  pages = {125-140},
  url = {http://dx.doi.org/10.1002/qj.429},
  doi = {http://dx.doi.org/10.1002/qj.429}
}
Heuveline, V., Subramanian, C., Lukarski, D. and Weiss, J.-P.A multi-platform linear algebra toolbox for finite element solvers on heterogeneous clusters 2010Cluster Computing Workshops and Posters (CLUSTER WORKSHOPS), 2010 IEEE International Conference on, pp. 1-6 inproceedingsDOI  
BibTeX:
@inproceedings{Heuveline2010a,
  author = {Heuveline, V. and Subramanian, C. and Lukarski, D. and Weiss, J.-P.},
  title = {A multi-platform linear algebra toolbox for finite element solvers on heterogeneous clusters},
  booktitle = {Cluster Computing Workshops and Posters (CLUSTER WORKSHOPS), 2010 IEEE International Conference on},
  year = {2010},
  pages = {1--6},
  doi = {http://dx.doi.org/10.1109/CLUSTERWKSP.2010.5613084}
}
Heuveline, V. and Wittwer, P.Exterior flows at low Reynolds numbers: concepts, solutions, and applications 2010Contemporary Challenges in Mathematical Fluid Mechanics and its Applications - Vol.1, pp. 77-170 inbook 
BibTeX:
@inbook{HeuvelineWittwer2010,
  author = {Heuveline, V. and Wittwer, P.},
  title = {Exterior flows at low Reynolds numbers: concepts, solutions, and applications},
  booktitle = {Contemporary Challenges in Mathematical Fluid Mechanics and its Applications - Vol.1},
  year = {2010},
  pages = {77-170}
}
Anzt, H., Augustin, W., Baumann, M., Bockelmann, H., Gengenbach, T., Hahn, T., Heuveline, V., Ketelaer, E., Lukarski, D., Otzen, A., Ritterbusch, S., Rocker, B., Ronnås, S., Schick, M., Subramanian, C., Weiss, J.-P. and Wilhelm, F.Hiflow3 - A Flexible and Hardware-Aware Parallel Finite Element Package 2010Proceedings of the 9th Workshop on Parallel/High-Performance Object-Oriented Scientific Computing, pp. 4:1-4:6 inproceedingsDOIURL 
BibTeX:
@inproceedings{HiFlow3POOSC2010,
  author = {Anzt, Hartwig and Augustin, Werner and Baumann, Martin and Bockelmann, Hendryk and Gengenbach, Thomas and Hahn, Tobias and Heuveline, Vincent and Ketelaer, Eva and Lukarski, Dimitar and Otzen, Andrea and Ritterbusch, Sebastian and Rocker, Björn and Ronnås, Staffan and Schick, Michael and Subramanian, Chandramowli and Weiss, Jan-Philipp and Wilhelm, Florian},
  title = {Hiflow3 - A Flexible and Hardware-Aware Parallel Finite Element Package},
  booktitle = {Proceedings of the 9th Workshop on Parallel/High-Performance Object-Oriented Scientific Computing},
  publisher = {ACM},
  year = {2010},
  pages = {4:1--4:6},
  url = {http://doi.acm.org/10.1145/2039312.2039316},
  doi = {http://doi.acm.org/10.1145/2039312.2039316}
}
Krause, M.J.Fluid Flow Simulation and Optimisation with Lattice Boltzmann Methods on High Performance Computers: Application to the Human Respiratory System 2010School: Karlsruhe Institute of Technology (KIT), Universität Karlsruhe (TH) phdthesisURL 
BibTeX:
@phdthesis{krause:10b,
  author = {Krause, M. J.},
  title = {Fluid Flow Simulation and Optimisation with Lattice Boltzmann Methods on High Performance Computers: Application to the Human Respiratory System},
  school = {Karlsruhe Institute of Technology (KIT), Universität Karlsruhe (TH)},
  year = {2010},
  note = {http://digbib.ubka.uni-karlsruhe.de/volltexte/1000019768},
  url = {http://digbib.ubka.uni-karlsruhe.de/volltexte/1000019768}
}
Oboril, F., Heuveline, V. and Weiss, J.-P.Parallel 3D Multigrid Methods on the STI Cell BE architecture 2010
Vol. 6310Facing the Multicore-Challenge, pp. 66-78 
incollection 
BibTeX:
@incollection{ObHeuWei2010,
  author = {Oboril, Fabian and Heuveline, Vincent and Weiss, Jan-Philipp},
  title = {Parallel 3D Multigrid Methods on the STI Cell BE architecture},
  booktitle = {Facing the Multicore-Challenge},
  publisher = {Springer, Heidelberg},
  year = {2010},
  volume = {6310},
  pages = {66--78}
}
Anzt, H., Heuveline, V. and Rocker, B.Mixed precision error correction methods for linear systems Convergence analysis based on Krylov subspace methods 2010PARA 2010, Part II, LNCS 7134, pp. 237-248 inproceedingsDOI  
BibTeX:
@inproceedings{para2010,
  author = {Anzt, Hartwig and Heuveline, Vincent and Rocker, Björn},
  title = {Mixed precision error correction methods for linear systems Convergence analysis based on Krylov subspace methods},
  booktitle = {PARA 2010, Part II, LNCS 7134},
  publisher = {Springer, Heidelberg},
  year = {2010},
  pages = {237--248},
  doi = {http://dx.doi.org/10.1007/978-3-642-28145-7$\_$24}
}
Anzt, H., Rocker, B. and Heuveline, V.Energy efficiency of mixed precision iterative refinement methods using hybrid hardware platforms 2010Computer Science - Research and Development
Vol. 25, pp. 141-148 
articleURL 
BibTeX:
@article{springerlink:10.1007--s00450-010-0124-2,
  author = {Anzt, Hartwig and Rocker, Björn and Heuveline, Vincent},
  title = {Energy efficiency of mixed precision iterative refinement methods using hybrid hardware platforms},
  journal = {Computer Science - Research and Development},
  publisher = {Springer Berlin / Heidelberg},
  year = {2010},
  volume = {25},
  pages = {141-148},
  note = {10.1007/s00450-010-0124-2},
  url = {http://dx.doi.org/10.1007/s00450-010-0124-2}
}
Augustin, W., Heuveline, V. and Weiss, J.-P.Optimized Stencil Computation Using In-Place Calculation on Modern Multicore Systems 2009Proceedings of the 15th International Euro-Par Conference on Parallel Processing, pp. 772-784 inproceedingsDOIURL 
BibTeX:
@inproceedings{AugHeuWei,
  author = {Augustin, Werner and Heuveline, Vincent and Weiss, Jan-Philipp},
  title = {Optimized Stencil Computation Using In-Place Calculation on Modern Multicore Systems},
  booktitle = {Proceedings of the 15th International Euro-Par Conference on Parallel Processing},
  publisher = {Springer},
  year = {2009},
  pages = {772--784},
  url = {http://dx.doi.org/10.1007/978-3-642-03869-3_72},
  doi = {http://dx.doi.org/10.1007/978-3-642-03869-3_72}
}
Heuveline, V., Rocker, B. and Ronnas, S.Numerical Simulation on the SiCortex Supercomputer Platform: a Preliminary Evaluation 2009EMCL Preprint Series(01)EMCL Preprint Series miscURL 
Abstract:

Most cluster systems used today for high-performance scientific computing are built from off-the-shelf standard components placed in racks. SiCortex has chosen a different strategy and offers a line of integrated cluster machines based on a customized low-frequency MIPS multicore processor and a specialized network fabric.

We investigate the potential of the SiCortex platform for numerical simulation by analyzing the performance of a set of elementary benchmarks and two fluid dynamics applications executed on the SC072 and the SC5832 systems. The elementary benchmarks quantify the performance in terms of computation rate, memory bandwidth and communication latency. The fluid dynamics applications provide insight into how well existing scientific code performs on the system. The results are compared to those obtained on a commodity cluster with Intel Xeon cores and Infiniband interconnect. The focus of the evaluation is computational performance, but we also consider the energy consumption for all three machines.

Our results indicate that while the SiCortex systems might be well suitable for applications that can be parallelized to a very fine level, they are outperformed by commodity clusters when this is not the case. However, an analysis of the CFD applications shows that the SiCortex systems makes it possible to significantly reduce the energy consumption compared to a commodity cluster.

BibTeX:
@misc{emcl-preprint-2009-01,
  author = {Heuveline, V. and Rocker, B. and Ronnas, S.},
  title = {Numerical Simulation on the SiCortex Supercomputer Platform: a Preliminary Evaluation},
  journal = {EMCL Preprint Series},
  year = {2009},
  number = {01},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2009-01.pdf}
}
Buchty, R., Heuveline, V., Karl, W. and Weiss, J.-P.A Survey on Hardware-aware and Heterogeneous Computing on Multicore Processors and Accelerators 2009EMCL Preprint Series(02)EMCL Preprint Series miscURL 
Abstract:

The paradigm shift towards multicore technologies is offering a great potential of computational power for scientific and industrial applications. It is, however, posing considerable challenges to software development. This problem is impaired by increasing heterogeneity of hardware platforms on both, processor level, and by adding dedicated accelerators. Performance gains for data- and compute-intensive applications can currently only be achieved by exploiting coarse- and fine-grained parallelism on all system levels, and improved scalability with respect to constantly increasing core counts. A key methodology is hardware-aware computing where in all production steps explicit knowledge of processor, memory, and network details is profitably utilized.

In this work we provide a survey on current multicore and accelerator technologies. We outline architectural features and show, how these features are exposed to the programmer and how they can be beneficially utilized in the application-mapping process. In particular, we characterize the discrepancy to conventional parallel platforms with respect to hierarchical memory sub-systems, fine-grained parallelism on several system levels, and chip- and system-level heterogeneity. We motivate the necessity of hardware-aware computing and summarize the challenges arising from high-performance heterogeneous computing. Furthermore, we investigate the interaction of hardware and application characteristics for selected applications in numerical simulation.

BibTeX:
@misc{emcl-preprint-2009-02,
  author = {Buchty, Rainer and Heuveline, Vincent and Karl, Wolfgang and Weiss, Jan-Philipp},
  title = {A Survey on Hardware-aware and Heterogeneous Computing on Multicore Processors and Accelerators},
  journal = {EMCL Preprint Series},
  year = {2009},
  number = {02},
  url = {fileadmin/images/Publications/Preprints/emcl-preprint-2009-02.pdf}
}
Hahn, T., Heuveline, V. and Rocker, B.GPU-based Simulation of Particulate Flows with CUDA 2009Proceedings of the PARS Workshop 2009 inproceedings 
Abstract: Particulate ?ows, which involve gas and liquid with suspended solid particles like dust, are generally highly CPU-time demanding. The question arises whether such computations can be performed on the GPU assuming highly parallel programming models like CUDA. In this paper we demonstrate that numerical simulation in that context can greatly bene?t from these technologies and present numerical simulation both in a 2D and 3D setup. We use the NVIDIA GT 200 GPU for these tests and evaluate this platform for basic scienti?c kernels encompassing operations on vectors, dense and sparse matrices.
BibTeX:
@inproceedings{HahnRockerHeuveline2009,
  author = {Hahn, T. and Heuveline, V. and Rocker, B.},
  title = {GPU-based Simulation of Particulate Flows with CUDA},
  booktitle = {Proceedings of the PARS Workshop 2009},
  publisher = {German Informatics Society},
  year = {2009}
}
Heuveline, V., Lukarski, D. and Weiss, J.-P.Performance of a Stream Processing Model on the Cell BE NUMA Architecture Applied to a 3D Conjugate Gradient Poisson Solver 2009International Journal of Computational Science
Vol. 3(5), pp. 473-490 
article 
BibTeX:
@article{HeuLukWei09,
  author = {Heuveline, V. and Lukarski, D. and Weiss, J.-P.},
  title = {Performance of a Stream Processing Model on the Cell BE NUMA Architecture Applied to a 3D Conjugate Gradient Poisson Solver},
  journal = {International Journal of Computational Science},
  year = {2009},
  volume = {3},
  number = {5},
  pages = {473--490}
}
Heuveline, V. and Strauss, F.Shape optimization towards stability in constrained hydrodynamic systems 2009Journal of Computational Physics
Vol. 228, pp. 938-951 
article 
BibTeX:
@article{Heuveline2009,
  author = {Heuveline, V. and Strauss, F.},
  title = {Shape optimization towards stability in constrained hydrodynamic systems},
  journal = {Journal of Computational Physics},
  year = {2009},
  volume = {228},
  pages = {938-951}
}
Heuveline, V. and Wittwer, P.Adaptive boundary conditions for exterior stationary flows in three dimensions 2009Journal of Mathematical Fluid Mechanics
Vol. 12(4), pp. 554-575 
articleDOI  
Abstract: Recently there has been an increasing interest for a better understanding of ultra low Reynolds number flows. In this context we present a new setup which allows to efficiently solve the stationary incompressible Navier-Stokes equations in an exterior domain in three dimensions numerically. The main point is that the necessity to truncate for numerical purposes the exterior domain to a finite sub-domain leads to the problem of finding so called "artificial boundary conditions" to replace the conditions at infinity. To solve this problem we provide a vector filed that describes the leading asymptotic behavior of the solution at large distances. This vector field depends explicitly on drag and lift which are determined in a self-consistent way as part of the solution process. When compared with other numerical schemes the size of the computational domain that is needed to obtain the hydrodynamic forces with a given precision is drastically reduced, which in turn leads to an overall gain in computational efficiency of typically several orders of magnitude.
BibTeX:
@article{Heuvelineaccepted,
  author = {Heuveline, Vincent and Wittwer, Peter},
  title = {Adaptive boundary conditions for exterior stationary flows in three dimensions},
  journal = {Journal of Mathematical Fluid Mechanics},
  year = {2009},
  volume = {12},
  number = {4},
  pages = {554-575},
  doi = {http://dx.doi.org/10.1007/s00021-009-0302-9}
}
Heuveline, V., Krause, M. and Latt, J.Towards a hybrid parallelization of lattice Boltzmann methods 2009Computers and Mathematics with Applications
Vol. 58, pp. 1071-1080 
articleDOIURL 
Abstract: Ongoing research towards the development of a hybrid parallelization concept for lattice Boltzmann methods is presented. It allows coping with platforms sharing both the properties of shared and distributed architectures. The proposed approach relies on spatial domain decomposition where each domain represents a basic block entity which is solved on a symmetric multiprocessing (SMP) system. Emphasis is placed on the software design and the reworking needed to achieve good performance using OpenMP in that context. Those ideas are implemented in the C++ project OpenLB, which is also sketched in this article. The efficiency of the proposed approaches is tested on a 3D benchmark problem and compared with a purely MPI based approach.
BibTeX:
@article{Krause2009,
  author = {Heuveline, V. and Krause, M.J. and Latt, J.},
  title = {Towards a hybrid parallelization of lattice Boltzmann methods},
  journal = {Computers and Mathematics with Applications},
  year = {2009},
  volume = {58},
  pages = {1071-1080},
  url = {http://dx.doi.org/10.1016/j.camwa.2009.04.001},
  doi = {http://dx.doi.org/10.1016/j.camwa.2009.04.001}
}
Heuveline, V., Rocker, B. and Ronnas, S.Numerical Simulation on the SiCortex Supercomputer Platform: a Preliminary Evaluation 2009EMCL Preprint Series inproceedings 
Abstract:

Most cluster systems used today for high-performance scientific computing are built from off-the-shelf standard components placed in racks. SiCortex has chosen a different strategy and offers a line of integrated cluster machines based on a customized low-frequency MIPS multicore processor and a specialized network fabric.

We investigate the potential of the SiCortex platform for numerical simulation by analyzing the performance of a set of elementary benchmarks and two fluid dynamics applications executed on the SC072 and the SC5832 systems. The elementary benchmarks quantify the performance in terms of computation rate, memory bandwidth and communication latency. The fluid dynamics applications provide insight into how well existing scientific code performs on the system. The results are compared to those obtained on a commodity cluster with Intel Xeon cores and Infiniband interconnect. The focus of the evaluation is computational performance, but we also consider the energy consumption for all three machines.

Our results indicate that while the SiCortex systems might be well suitable for applications that can be parallelized to a very fine level, they are outperformed by commodity clusters when this is not the case. However, an analysis of the CFD applications shows that the SiCortex systems makes it possible to significantly reduce the energy consumption compared to a commodity cluster.

BibTeX:
@inproceedings{RonnasRockers2010,
  author = {Heuveline, V. and Rocker, B. and Ronnas, S.},
  title = {Numerical Simulation on the SiCortex Supercomputer Platform: a Preliminary Evaluation},
  booktitle = {EMCL Preprint Series},
  year = {2009}
}
Strauß, F., Hoferer, J., Heuveline, V. and Kasper, G.Scalable Numerical Tools for Flow and Pressure Drop Computation in Fibrous Filter Media 2009Chemical Engineering & Technology
Vol. 32(5), pp. 820-825 
article 
Abstract: Due to the high complexity of filter structures it is still a challenge to determine pressure drop and filtration behavior of these media. Numerical calculation tools can considerably contribute to pressure drop determination for inhomogeneous filter structures. The capability of a numerical solution process based on the finite element method is demonstrated using the numerical toolbox HiFlow to determine flow field and pressure drop of filter media.
BibTeX:
@article{Strauss2009,
  author = {Strauß, F. and Hoferer, J. and Heuveline, V. and Kasper, G.},
  title = {Scalable Numerical Tools for Flow and Pressure Drop Computation in Fibrous Filter Media},
  journal = {Chemical Engineering & Technology},
  year = {2009},
  volume = {32},
  number = {5},
  pages = {820-825}
}
Heuveline, V. and Weiss, J.-P.Lattice Boltzmann methods on the ClearSpeed Advance accelerator board 2009The European Physical Journal -- Special Topics
Vol. 171, pp. 31-36 
articleURL 
BibTeX:
@article{Weiss2009,
  author = {Heuveline, V. and Weiss, J.-P.},
  title = {Lattice Boltzmann methods on the ClearSpeed Advance accelerator board},
  journal = {The European Physical Journal -- Special Topics},
  publisher = {Springer Berlin / Heidelberg},
  year = {2009},
  volume = {171},
  pages = {31--36},
  note = {10.1140/epjst/e2009-01008-8},
  url = {http://dx.doi.org/10.1140/epjst/e2009-01008-8}
}
Heuveline, V., Lukarski, D. and Weiss, J.-P.RapidMind Stream Processing on the PlayStation~3 for a 3D Chorin-based Navier-Stokes Solver 2008HipHaC'08: Proc. 1st Int. Workshop on New Frontiers in High-performance and Hardware-aware Computing, pp. 31-38 inproceedings 
BibTeX:
@inproceedings{Heuveline2008,
  author = {Heuveline, V. and Lukarski, D. and Weiss, Jan-Philipp},
  title = {RapidMind Stream Processing on the PlayStation~3 for a 3D Chorin-based Navier-Stokes Solver},
  booktitle = {HipHaC'08: Proc. 1st Int. Workshop on New Frontiers in High-performance and Hardware-aware Computing},
  publisher = {Universitätsverlag Karlsruhe},
  year = {2008},
  pages = {31-38}
}
Weygand, D., Senger, J., Motz, C., Augustin, W., Heuveline, V. and Gumbsch, P.High Performance computing and discrete dislocation dynamics: Plasticity of micrometer sized specimens 2008High Performance Computing in Science and Engineering '08, pp. 507-523 incollection 
Abstract: A parallel discrete dislocation dynamics tool is employed to study the size dependent plasticity of small metallic structures. The tool has been parallelised using OpenMP. An excellent overall scaling is observed for different loading scenarios. The size dependency of the plastic flow is confirmed by the performed simulations for uniaxial loading and micro-bending tests. The microstructural origin of the size effect is analysed. A strong influence of the initial microstructure on the statistics of the deformation behaviour is observed, for both the uniaxial and bending scenario.
BibTeX:
@incollection{Heuveline2008c,
  author = {Weygand, D. and Senger, J. and Motz, C. and Augustin, W. and Heuveline, V. and Gumbsch, P.},
  title = {High Performance computing and discrete dislocation dynamics: Plasticity of micrometer sized specimens},
  booktitle = {High Performance Computing in Science and Engineering '08},
  publisher = {Springer-Verlag, Berlin, Germany},
  year = {2008},
  pages = {507-523}
}
Deutschmann, O., Janardhanan, V., Tischer, S. and Heuveline, V.SOFC Modeling - From Micro-Kinetics to Stacks 2008Oral presentation A0201. 8th European Solid Oxide Fuel Cell Forum, Lucerne, Switzerland article 
BibTeX:
@article{Heuvelinetoappear,
  author = {Deutschmann, O. and Janardhanan, V. and Tischer, S. and Heuveline, V.},
  title = {SOFC Modeling - From Micro-Kinetics to Stacks},
  journal = {Oral presentation A0201. 8th European Solid Oxide Fuel Cell Forum, Lucerne, Switzerland},
  year = {2008}
}
Janardhanan, V., Heuveline, V. and Deutschmann, O.Three-phase Boundary Lenght in Solid-Oxide Fuel Cells: A mathematical Model 2008J. Power Sources
Vol. 178, pp. 368-372 
article 
Abstract: A mathematical model to calculate the volume specific three-phase boundary length in the porous composite electrodes of solid-oxide fuel cell is presented. The model is exclusively based on geometrical considerations accounting for porosity, particle diameter, particle size distribution, and solids phase distribution. Results are presented for uniform particle size distribution as well as for non-uniform particle size distribution.
BibTeX:
@article{Janardhanan2008,
  author = {Janardhanan, V.M. and Heuveline, V. and Deutschmann, O.},
  title = {Three-phase Boundary Lenght in Solid-Oxide Fuel Cells: A mathematical Model},
  journal = {J. Power Sources},
  year = {2008},
  volume = {178},
  pages = {368-372}
}
Karl, M., Seemann, G., Sachse, F., Dössel, O. and Heuveline, V.Time and Memory Efficient Implementation of the Cardiac Bidomain Equations 20084th European Conference of the International Federation for Medical and Biological Engineering inproceedings 
BibTeX:
@inproceedings{Karl2008,
  author = {Karl, M. and Seemann, G. and Sachse, F.B. and Dössel, O and Heuveline, V.},
  title = {Time and Memory Efficient Implementation of the Cardiac Bidomain Equations},
  booktitle = {4th European Conference of the International Federation for Medical and Biological Engineering},
  year = {2008}
}
Rüger, B., Carraro, T., Joos, J., Heuveline, V. and Ivers-Tiffée, E.A Numerical Method for the Optimization of MIEC Cathode Microstructures 2008Proceedings of the 8th European Solid Oxide Fuel Cell ForumProceedings of the 8th European Solid Oxide Fuel Cell Forum, pp. A0616 inproceedings 
BibTeX:
@inproceedings{Ruger2008,
  author = {Rüger, B. and Carraro, T. and Joos, J. and Heuveline, V. and Ivers-Tiffée, E.},
  title = {A Numerical Method for the Optimization of MIEC Cathode Microstructures},
  booktitle = {Proceedings of the 8th European Solid Oxide Fuel Cell Forum},
  journal = {Proceedings of the 8th European Solid Oxide Fuel Cell Forum},
  year = {2008},
  pages = {A0616}
}
Seemann, G., Sachse, F., Karl, M., Weiss, D., Heuveline, V. and Dössel, O.Framework for modular, flexible and efficient solving the cardiac bidomain equation using PETSc 2008ECMI inproceedings 
BibTeX:
@inproceedings{Seemann2008,
  author = {Seemann, G. and Sachse, F.B. and Karl, M. and Weiss, D.L. and Heuveline, V. and Dössel, O.},
  title = {Framework for modular, flexible and efficient solving the cardiac bidomain equation using PETSc},
  booktitle = {ECMI},
  year = {2008}
}
Strauss, F., Heuveline, V. and Schweizer, B.Existence and Approximation Results for Shape Optimization Problems in Rotordynamics 2008Numerische Mathematik
Vol. 109(2), pp. 313-332 
article 
Abstract: We consider a shape optimization problem in rotordynamics where the mass of a rotor is minimized subject to constraints on the natural frequencies. Our analyis is based on a class of rotors described by a Rayleigh beam model including effects of rotary inertia and gyroscopic moments. The solution of the equation of motion leads to a generalized eigenvalue problem. The governing operators are non-symmetric due to the gyroscopic terms. We prove the existence of solutions for the optimization problem by using the theory of compact operators. For the numerical treatment of the problem a finite element discretization based on a variational formulation is considered. Applying results on spectral approximation of linear operators we prove that the solution of the discretized optimization problem converges towards the solution of the continuous problem if the discretization parameter tends to zero. Finally, a priori estimates for the convergence order of the eigenvalues are presented and illustrated by a numerical example.
BibTeX:
@article{Strauss2008,
  author = {Strauss, F. and Heuveline, V. and Schweizer, B.},
  title = {Existence and Approximation Results for Shape Optimization Problems in Rotordynamics},
  journal = {Numerische Mathematik},
  year = {2008},
  volume = {109},
  number = {2},
  pages = {313-332}
}
Strauss, F., Désidéri, J.-A., Duvigneau, R. and V., H.Multiobjective optimization in hydrodynamic stability control 2008INRIA Research Report(6608) article 
BibTeX:
@article{Strauss2008b,
  author = {Strauss, F. and Désidéri, J.-A. and Duvigneau, R. and Heuveline V.},
  title = {Multiobjective optimization in hydrodynamic stability control},
  journal = {INRIA Research Report},
  year = {2008},
  number = {6608}
}
Bönisch, S. and Heuveline, V.On the Numerical Simulation of the Instationary Free Fall of a Solid in a Fluid. I. The Newtonian Case 2007Computers & Fluids
Vol. 36 (9)(9), pp. 1434-1445 
article 
Abstract: In the last decade, extensive experiments have been carried out in order to gain a better insight in the phenomenology related to the motion of a falling body in a quiescent Newtonian fluid. This renewal of interest in that subject is due not only to the lack of full theoretical explanations of the intricate body motion but also to the high relevance of that issue in applications ranging from meteorology, sedimentology and aerospace engineering to biology. We present a new numerical method for the simulation of the instationary free fall of a unique solid in a fluid. A key ingredient of the proposed approach is the reformulation of the conservation and kinetic equations in the solid frame as well as the explicit treatment of the fluid-body coupling. The issue of non-smooth data in time resulting from our explicit approach is addressed. The time stepping is based on the implicit fractional-step-$ scheme and the spatial discretization relies on the finite element method. Numerical experiments for the steady-falling regime, for the periodic oscillating motion as well as for the tumbling motion are presented following existing experimental set-up. The proposed method is validated by comparison with experimental data.
BibTeX:
@article{Bonisch2007,
  author = {Bönisch, S. and Heuveline, V.},
  title = {On the Numerical Simulation of the Instationary Free Fall of a Solid in a Fluid. I. The Newtonian Case},
  journal = {Computers & Fluids},
  year = {2007},
  volume = {36 (9)},
  number = {9},
  pages = {1434-1445}
}
Dörfler, W. and Heuveline, V.Convergence of an Adaptive hp Finite Element Strategy in One Space Dimension 2007Applied Numerical Mathematics
Vol. 57(10), pp. 1108-1124 
article 
Abstract: We show uniform decrease in energy error for an hp-adaptive algorithm with automatic hp selection on the elliptic model boundary value problem. The result is based on a new marking strategy for the finite element refinement. In case of a solution with algebraic singularity we demonstrate numerically that we achieve the known theoretical optimal exponential decrease of the error.
BibTeX:
@article{Dorfler2007,
  author = {Dörfler, W. and Heuveline, V.},
  title = {Convergence of an Adaptive hp Finite Element Strategy in One Space Dimension},
  journal = {Applied Numerical Mathematics},
  year = {2007},
  volume = {57},
  number = {10},
  pages = {1108-1124}
}
Galdi, G. and Heuveline, V.Lift and Sedimentation of Particles in the Flow of a Viscoelastic Liquid in a Channel 2007Free and Moving Boundaries: Analysis, Simulation and Control, pp. 65-95 incollection 
BibTeX:
@incollection{Galdi2007,
  author = {Galdi, G.P. and Heuveline, V.},
  title = {Lift and Sedimentation of Particles in the Flow of a Viscoelastic Liquid in a Channel},
  booktitle = {Free and Moving Boundaries: Analysis, Simulation and Control},
  publisher = {CRC Press},
  year = {2007},
  pages = {65-95}
}
Heuveline, V. and Schieweck, F.H1 Interpolation on Quadrilaterals and Hexaedral Meshes with hanging Nodes 2007Computing
Vol. 80, pp. 203-220 
article 
BibTeX:
@article{Heuveline2007,
  author = {Heuveline, V. and Schieweck, F.},
  title = {H1 Interpolation on Quadrilaterals and Hexaedral Meshes with hanging Nodes},
  journal = {Computing},
  year = {2007},
  volume = {80},
  pages = {203-220}
}
Heuveline, V. and Schieweck, F.On the Inf-Sup Condition for Higher Order Mixed FEM on Meshes with Hanging Nodes 2007Mathematical Modelling and Numerical Analysis M2AN
Vol. 41(1), pp. 1-20 
article 
BibTeX:
@article{Heuveline2007a,
  author = {Heuveline, V. and Schieweck, F.},
  title = {On the Inf-Sup Condition for Higher Order Mixed FEM on Meshes with Hanging Nodes},
  journal = {Mathematical Modelling and Numerical Analysis M2AN},
  year = {2007},
  volume = {41},
  number = {1},
  pages = {1-20}
}
Janarhanan, V., Heuveline, V. and Deutschmann, O.Performance analysis of a SOFC under direct internal reforming conditions 2007Journal of Power Sources
Vol. 172, pp. 296–307 
article 
Abstract: This paper presents the performance analysis of a planar solid-oxide fuel cell (SOFC) under direct internal reforming conditions. A detailed solid-oxide fuel cell model is used to study the influences of various operating parameters on cell performance. Significant differences in efficiency and power density are observed for isothermal and adiabatic operational regimes. The influence of air number, specific catalyst area, anode thickness, steam to carbon (s/c) ratio of the inlet fuel, and extend of pre-reforming on cell performance is analyzed. In all cases except for the case of pre-reformed fuel, adiabatic operation results in lower performance compared to isothermal operation. It is further discussed that, though direct internal reforming may lead to cost reduction and increased efficiency by effective utilization of waste heat, the efficiency of the fuel cell itself is higher for pre-reformed fuel compared to non-reformed fuel. Furthermore, criteria for the choice of optimal operating conditions for cell stacks operating under direct internal reforming conditions are discussed.
BibTeX:
@article{Janarhanan2007,
  author = {Janarhanan, V.M. and Heuveline, V. and Deutschmann, O.},
  title = {Performance analysis of a SOFC under direct internal reforming conditions},
  journal = {Journal of Power Sources},
  year = {2007},
  volume = {172},
  pages = {296–307}
}
Heuveline, V. and Latt, J.The OpenLB Project: An Open Source and Object Oriented Implementation of Lattice Boltzmann Methods 2007International Journal of Modern Physics C
Vol. 18(4), pp. 627-634 
article 
Abstract: The OpenLB project provides a C++ package for the implementation of lattice Boltzmann simulations that is general enough to address a vast range of problems in computational fluid dynamics. The package is mainly intended as a programming support for researchers and engineers who simulate fluid flows by means of a lattice Boltzmann method.
BibTeX:
@article{Latt2007,
  author = {Heuveline, V. and Latt, J.},
  title = {The OpenLB Project: An Open Source and Object Oriented Implementation of Lattice Boltzmann Methods},
  journal = {International Journal of Modern Physics C},
  year = {2007},
  volume = {18},
  number = {4},
  pages = {627-634}
}
Marinova, D., Lukarski, D. and Stavroulakis, G.Modeling and optimal control for plates with defects 2007J. Vib. Control
Vol. 13(9-10), pp. 1343-1353 
articleDOI  
Abstract: Summary: This paper presents modeling and vibration control design of thin lightweight plates taking into account defects. The classical plate bending theory is used for a finite element dynamical model. Defects are modeled in a smeared-type way by changing the physical parameters of the corresponding element. Optimal control is further studied for plates with defects supplied with sensors and actuators. LQR and $H_2$ control schemes are considered for vibration suppression. We investigate the influence of plate defects on the control of vibrations. The placement of control forces and their influence on the quality of the vibration-regulating process are considered. Numerical simulations are carried out in order to demonstrate the efficiency of the proposed control strategies.
BibTeX:
@article{MaLuSta4,
  author = {Marinova, Daniela and Lukarski, Dimitar and Stavroulakis, Georgios},
  title = {Modeling and optimal control for plates with defects},
  journal = {J. Vib. Control},
  year = {2007},
  volume = {13},
  number = {9-10},
  pages = {1343-1353},
  doi = {http://dx.doi.org/10.1177/1077546307077501}
}
Augustin, W., Heuveline, V., Meschkat, G., Schlünzen, K. and Schroeder, G.OpenMP Parallelization of the METRAS meteorology model: Application to the America's Cup 2007(6)High Performance Computing in Science and Engineering '07, pp. 547-559 incollection 
Abstract: We describe the parallelization of the meteorology model METRAS (MEsoscale TRAnsport and Stream) in the context of the America’s Cup 2007 for the South African sailing yacht Shosholoza. METRAS is a community model of the atmosphere whose development is coordinated at the Meteorological Institute, ZMAW, University of Hamburg. The parallelization which is based OpenMP was done at the Steinbuch Centre for Computing (SCC) of the University of Karlsruhe and took advantage of the specific features of the Itanium-2 processors available on the local parallel computer HP XC6000. In this paper, we report on the parallelization of the meteorology model METRAS as well as describe how this parallelized version is being used in the highly challenging context of the America’s Cup.
BibTeX:
@incollection{P21_2007,
  author = {Augustin, W. and Heuveline, V. and Meschkat, G. and Schlünzen, K.H. and Schroeder, G.},
  title = {OpenMP Parallelization of the METRAS meteorology model: Application to the America's Cup},
  booktitle = {High Performance Computing in Science and Engineering '07},
  publisher = {Springer-Verlag, Berlin, Germany},
  year = {2007},
  number = {6},
  pages = {547-559}
}
Bönisch, S. and Heuveline, V.Advanced Flow Visualization with Hivision 2006Reactive Flows, Diffusion and Transport incollection 
Abstract: HiVision is a visualization platform including advanced visualization techniques for the analysis and exploration of data supplied by numerical simulation. Although primarily developed for visualization in the area of computational fluid dynamics as part of the HiFlow Project (WWW.HiFlow.De) the HiVision framework is discipline independent and may be advantageously used in various areas such as structural mechanics and reactive flow simulation. HiVision has been developed using the powerful C++ Visualization Toolkit library (VTK) and for the graphical user interface, the Qt library. The main features supported in release 2.0 are the following:
  • 2D/3D unstructured grids (quadrangle, hexahedron, triangle, tetraeder),
  • Scalar data: texture mapping, transparency, colorbar, cutplanes, isolines, isosurfaces
  • Vector data: interactive creation, streamlines, pathlines, glyph, hedgehogs
  • Stereo rendering,
  • Animation (based on MPEG),
  • Mouse based interaction.
BibTeX:
@incollection{Bonisch2006,
  author = {Bönisch, S. and Heuveline, V.},
  title = {Advanced Flow Visualization with Hivision},
  booktitle = {Reactive Flows, Diffusion and Transport},
  publisher = {Springer, Berlin},
  year = {2006}
}
Bönisch, S., Heuveline, V. and Wittwer, P.Second Order Adaptive Boundary Conditions for Exterior Flow Problems: Non-symmetric Stationary Flows in two Dimensions 2006Journal of Mathematical Fluid Mechanics
Vol. 8, pp. 1-26 
article 
Abstract: We consider the problem of solving numerically the stationary incompressible Navier-Stokes equations in an exterior domain in two dimensions. For numerical purposes we truncate the domain to a finite sub-domain, which leads to the problem of finding so called artificial boundary conditions to replace the boundary conditions at infinity. To solve this problem we construct - by combining results from dynamical systems theory with matched asymptotic expansion techniques based on the old ideas of Goldstein and Van Dyke - a smooth divergence free vector field depending explicitly on drag and lift and describing the solution to second and dominant third order, asymptotically at large distances from the body. The resulting expression appears to be new, even on a formal level. This improves the method introduced by the authors in a previous paper and generalizes it to non-symmetric flows. The numerical scheme determines the boundary conditions and the forces on the body in a self-consistent way as an integral part of the solution process. When compared with our previous paper where first order asymptotic expressions were used on the boundary, the inclusion of second and third order asymptotic terms further reduces the computational cost for determining lift and drag to a given precision by typically another order of magnitude.
BibTeX:
@article{Bonisch2006a,
  author = {Bönisch, S. and Heuveline, V. and Wittwer, P.},
  title = {Second Order Adaptive Boundary Conditions for Exterior Flow Problems: Non-symmetric Stationary Flows in two Dimensions},
  journal = {Journal of Mathematical Fluid Mechanics},
  year = {2006},
  volume = {8},
  pages = {1-26}
}
Carraro, T., Heuveline, V. and Rannacher, R.Determination of Kinetic Parameters in Laminar Flow Reactors. I. Numerical Aspects 2006Reactive Flows, Diffusion and Transport incollection 
BibTeX:
@incollection{Carraro2006,
  author = {Carraro, T. and Heuveline, V. and Rannacher, R.},
  title = {Determination of Kinetic Parameters in Laminar Flow Reactors. I. Numerical Aspects},
  booktitle = {Reactive Flows, Diffusion and Transport},
  publisher = {Springer},
  year = {2006}
}
Heuveline, V. and Krause, M.Biotechnologie und Numerik auf Hochleistungsrechnern: ein zukünftiges Gespann in Baden-Württemberg, Marktstudie für HWW GmbH 2006 booklet 
BibTeX:
@booklet{Heuveline2006,
  author = {Heuveline, V. and Krause, M.J.},
  title = {Biotechnologie und Numerik auf Hochleistungsrechnern: ein zukünftiges Gespann in Baden-Württemberg, Marktstudie für HWW GmbH},
  year = {2006}
}
Heuveline, V. and Nam-Dung, H.On Two Numerical Approaches for the Boundary Control Stabilization of Semi-linear Parabolic Systems: A Comparison 2006International Series of Numerical Mathematics
Vol. 155, pp. 25-47 
article 
Abstract: The present article is concerned with boundary control stabilization of semi-linear parabolic systems which are unstable if uncontrolled. A particular emphasis is put on Dirichlet control in that context. We investigate two different numerical approaches to solve these problems. The first approach relies on the extension method proposed by A.V. Fursikov where the considered partial differential equations are first solved on an extended domain with suitable initial value leading a stable solution. The needed control is then defined as an appropriate trace of this solution. The second approach relies on the formulation of the stabilization problem as an optimization problem with constraints based on partial differential equations. We address the numerical issues related to both class of approaches toward a comparison of their specific stabilization properties. The considered methodology is applied to the solution of test parabolic problems assuming linear and nonlinear models.
BibTeX:
@article{Heuveline2006a,
  author = {Heuveline, V. and Nam-Dung, H.},
  title = {On Two Numerical Approaches for the Boundary Control Stabilization of Semi-linear Parabolic Systems: A Comparison},
  journal = {International Series of Numerical Mathematics},
  year = {2006},
  volume = {155},
  pages = {25-47}
}
Heuveline, V. and Nam-Dung, H.On Two Numerical Approaches for the Boundary Control Stabilization of Semi-linear Parabolic Systems: A Comparison 2006(06-02) techreport 
Abstract: The present article is concerned with boundary control stabilization of semi-linear parabolic systems which are unstable if uncontrolled. A particular emphasis is put on Dirichlet control in that context. We investigate two different numerical approaches to solve these problems. The first approach relies on the extension method proposed by A.V. Fursikov where the considered partial differential equations are first solved on an extended domain with suitable initial value leading a stable solution. The needed control is then defined as an appropriate trace of this solution. The second approach relies on the formulation of the stabilization problem as an optimization problem with constraints based on partial differential equations. We address the numerical issues related to both class of approaches toward a comparison of their specific stabilization properties. The considered methodology is applied to the solution of test parabolic problems assuming linear and nonlinear models.
BibTeX:
@techreport{Heuveline2006b,
  author = {Heuveline, V. and Nam-Dung, H.},
  title = {On Two Numerical Approaches for the Boundary Control Stabilization of Semi-linear Parabolic Systems: A Comparison},
  year = {2006},
  number = {06-02}
}
Heuveline, V. and Walther, A.Online Checkpointing for Parallel Adjoint Computation in PDEs: Application to Goal-Oriented Adaptivity and Flow Control 2006
Vol. 4128Proceedings of Euro-Par 2006, pp. 689-699 
inproceedings 
BibTeX:
@inproceedings{Heuveline2006c,
  author = {Heuveline, V. and Walther, A.},
  title = {Online Checkpointing for Parallel Adjoint Computation in PDEs: Application to Goal-Oriented Adaptivity and Flow Control},
  booktitle = {Proceedings of Euro-Par 2006},
  publisher = {Springer},
  year = {2006},
  volume = {4128},
  pages = {689-699}
}
Heuveline, V. and Rannacher, R.Adaptive FEM for Eigenvalue Problems with Application in Hydrodynamic Stability Analysis 2006Journal of Numerical Mathematics, pp. 1-32 article 
BibTeX:
@article{Heuveline2006d,
  author = {Heuveline, V. and Rannacher, R.},
  title = {Adaptive FEM for Eigenvalue Problems with Application in Hydrodynamic Stability Analysis},
  journal = {Journal of Numerical Mathematics},
  year = {2006},
  pages = {1-32}
}
Stavroulakis, G., Marianova, D. and Lukarski, D.Modeling of plates in bending and influence of damage 2006Applications of mathematics in engineering and economics. Proceedings of the 31st international conference, AMEE '05, pp. 249-261 inproceedings 
Abstract: Summary: We present modeling and vibration control of thin plates taking into account defects. The classical plate theory is used for the finite element dynamical model of the elastic plate. The defects are modeled by equivalent change of elastic modulus of the corresponding element. Optimal control design is implemented for the plates with defects supplied with sensors and actuators. Linear quadratic performance criterion is used for this purpose. It leads to linear control with constant gain that gives good properties of the closed loop system and easy implementation. We investigate the defects' influence on the plate vibration control taking into account placement of control forces. Numerical simulations are carried out in order to demonstrate the efficiency of the proposed control strategies.
BibTeX:
@inproceedings{MaStaLu1,
  author = {Stavroulakis, G. and Marianova, D. and Lukarski, D.},
  title = {Modeling of plates in bending and influence of damage},
  booktitle = {Applications of mathematics in engineering and economics. Proceedings of the 31st international conference, AMEE '05},
  year = {2006},
  pages = {249-261}
}
Marinova, D., Stavroulakis, G., Zacharenakis, E. and Lukarski, D.Active optimal control of damaged smart plates in bending 20066th European Solid Mechanics Conference, ESMC 2006 inproceedings 
BibTeX:
@inproceedings{MaStaZaLu3,
  author = {Marinova, D.G. and Stavroulakis, G.E. and Zacharenakis, E.C. and Lukarski, D.H.},
  title = {Active optimal control of damaged smart plates in bending},
  booktitle = {6th European Solid Mechanics Conference, ESMC 2006},
  year = {2006}
}
Kagei, Y., Ruzicka, M. and Thäter, G.A Limit Problem in Natural Convection 2006NoDEA : Nonlinear Differential Equations and Applications
Vol. 13, pp. 447-467 
articleURL 
BibTeX:
@article{springerlink:10.1007--s00030-006-4020-1,
  author = {Kagei, Y. and Ruzicka, M. and Thäter, G.},
  title = {A Limit Problem in Natural Convection},
  journal = {NoDEA : Nonlinear Differential Equations and Applications},
  publisher = {Birkhäuser Basel},
  year = {2006},
  volume = {13},
  pages = {447-467},
  note = {10.1007/s00030-006-4020-1},
  url = {http://dx.doi.org/10.1007/s00030-006-4020-1}
}
Stavroulakis, G., Marinova, D., Lukarski, D. and Zacharenakis, E.Nondestructive Identification of defects for smart plates in bending using genetic algorithms 2006III European Conference on Computational Mechanics - Solids, Structures and Coupled Problems in Engineering inproceedings 
BibTeX:
@inproceedings{StaMaLuZa2,
  author = {Stavroulakis, G.E. and Marinova, D.G. and Lukarski, D.H. and Zacharenakis, E.C.},
  title = {Nondestructive Identification of defects for smart plates in bending using genetic algorithms},
  booktitle = {III European Conference on Computational Mechanics - Solids, Structures and Coupled Problems in Engineering},
  publisher = {Springer},
  year = {2006}
}
Strauss, F., Heuveline, V. and Schweizer, B.Existence and Approximation Results for Shape Optimization Problems in Rotordynamics 2006(06-06) techreport 
Abstract: We consider a shape optimization problem in rotordynamics where the mass of a rotor is minimized subject to constraints on the natural frequencies. Our analyis is based on a class of rotors described by a Rayleigh beam model including effects of rotary inertia and gyroscopic moments. The solution of the equation of motion leads to a generalized eigenvalue problem. The governing operators are non-symmetric due to the gyroscopic terms. We prove the existence of solutions for the optimization problem by using the theory of compact operators. For the numerical treatment of the problem a finite element discretization based on a variational formulation is considered. Applying results on spectral approximation of linear operators we prove that the solution of the discretized optimization problem converges towards the solution of the continuous problem if the discretization parameter tends to zero. Finally, a priori estimates for the convergence order of the eigenvalues are presented and illustrated by a numerical example.
BibTeX:
@techreport{Strauss2006,
  author = {Strauss, F. and Heuveline, V. and Schweizer, B.},
  title = {Existence and Approximation Results for Shape Optimization Problems in Rotordynamics},
  year = {2006},
  number = {06-06}
}
Augustin, W., Straub, M.-O. and Worsch, T.Benchmarking One-Sided Communication with SKaMPI 5 2005
Vol. 3666PVM/MPI, pp. 301-308 
inproceedingsURL 
Abstract: SKaMPI is now an established benchmark for MPI implementations. Two important goals of the development of version 5 of SKaMPI were the extension of the benchmark to cover more functionality of MPI, and a redesign of the benchmark allowing it to be extended more easily. In the present paper we give an overview of the extension of SKaMPI 5 for the evaluation of one-sided communication and present a few selected results of benchmark runs, giving an impression of the breadth and depth of SKaMPI 5. A look at the source code, which is available under the GPL, reveals that it was easy to extend SKaMPI 5 with benchmarks for one-sided communication.
BibTeX:
@inproceedings{Augustin2005,
  author = {Werner Augustin and Marc-Oliver Straub and Thomas Worsch},
  title = {Benchmarking One-Sided Communication with SKaMPI 5},
  booktitle = {PVM/MPI},
  publisher = {Springer},
  year = {2005},
  volume = {3666},
  pages = {301--308},
  url = {http://dx.doi.org/10.1007/11557265_40}
}
Bönisch, S., Heuveline, V. and Wittwer, P.Adaptive Boundary Conditions for Exterior Flow Problems 2005Journal of Mathematical Fluid Mechanics
Vol. 7(1), pp. 85-107 
article 
Abstract: We consider the problem of solving numerically the stationary incompressible Navier-Stokes equations in an exterior domain in two dimensions. This corresponds to studying the stationary fluid flow past a body. The necessity to truncate for numerical purposes the infinite exterior domain to a finite domain leads to the problem of finding appropriate boundary conditions on the surface of the truncated domain. We solve this problem by providing a vector field describing the leading asymptotic behavior of the solution. This vector field is given in the form of an explicit expression depending on a real parameter. We show that this parameter can be determined from the total drag exerted on the body. Using this fact we set up a self-consistent numerical scheme that determines the parameter, and hence the boundary conditions and the drag, as part of the solution process. We compare the values of the drag obtained with our adaptive scheme with the results from using traditional constant boundary conditions. Computational times are typically reduced by several orders of magnitude.
BibTeX:
@article{Bonisch2005,
  author = {Bönisch, S. and Heuveline, V. and Wittwer, P.},
  title = {Adaptive Boundary Conditions for Exterior Flow Problems},
  journal = {Journal of Mathematical Fluid Mechanics},
  year = {2005},
  volume = {7},
  number = {1},
  pages = {85-107}
}
Dietz, B., Heine, A., Heuveline, V. and Richter, A.Test of New Numerical Approach to the Quantization of Billiards 2005Physical Review E
Vol. E 71, 02673 
article 
Abstract: A newly developed method for computing large numbers of eigenvalues of self-adjoint elliptic operators [J. Comp. Phys 184, 321 (2003)] is tested in numerical studies of the spectral properties of quantum billiards. To this extent, we study a time-reversal invariant quantum billiard of threefold symmetry, that undergoes a transformation in its symmetry properties from C_3 to C_3v. Thereby a transition from Gaussian orthogonal to Gaussian unitary ensemble statistics is observed, verifying earlier experimental indications and theoretical predictions. At the same time our numerical ansatz is shown to be unique in its applicability to arbitrary billiard shapes.
BibTeX:
@article{Dietz2005,
  author = {Dietz, B. and Heine, A. and Heuveline, V. and Richter, A.},
  title = {Test of New Numerical Approach to the Quantization of Billiards},
  journal = {Physical Review E},
  year = {2005},
  volume = {E 71, 02673}
}
Dörfler, W. and Heuveline, V.Convergence of an adaptive hp finite element method 2005(37) techreport 
BibTeX:
@techreport{Dorfler2005,
  author = {Dörfler, W. and Heuveline, V.},
  title = {Convergence of an adaptive hp finite element method},
  year = {2005},
  number = {37}
}
Heuveline, V.A posteriori error estimation and adaptivity for flow problems 2005Lecture Notes for the 9th Spring School on Fluid Mechanics of Roscoff, pp. 1-49 inbook 
BibTeX:
@inbook{Heuveline2005,
  author = {Heuveline, V.},
  title = {A posteriori error estimation and adaptivity for flow problems},
  booktitle = {Lecture Notes for the 9th Spring School on Fluid Mechanics of Roscoff},
  year = {2005},
  pages = {1-49}
}
Heuveline, V.Adaptive Finite Elements for the Steady Free Fall of a Body in Newtonian Fluid 2005Comptes Rendus de l'Académie des Sciences, Mécanique
Vol. 333(12), pp. 896-909 
article 
BibTeX:
@article{Heuveline2005a,
  author = {Heuveline, V.},
  title = {Adaptive Finite Elements for the Steady Free Fall of a Body in Newtonian Fluid},
  journal = {Comptes Rendus de l'Académie des Sciences, Mécanique},
  year = {2005},
  volume = {333},
  number = {12},
  pages = {896-909}
}
Heuveline, V.Optimal Control of Coupled System of PDE 2005(18) techreport 
BibTeX:
@techreport{Heuveline2005b,
  author = {Heuveline, V.},
  title = {Optimal Control of Coupled System of PDE},
  year = {2005},
  number = {18}
}
Heuveline, V., Bönisch, S. and Rannacher, R.On the Numerical Simulation of the Free Fall Problem 2005Modeling, Simulation and Optimization of Complex Processes: Proceedings of the International Conference on High Performance Scientific Computing, Hanoi, pp. 47-57 inproceedings 
Abstract: The numerical simulation of the free fall of a solid body in a viscous fluid is a challenging task since it requires computational domains which usually need to be several order of magnitude larger than the solid body in order to avoid the influence of artificial boundaries. Toward an optimal mesh design in that context, we propose a method based on the weighted a posteriori error estimation of the finite element approximation of the fluid/body motion. A key ingredient for the proposed approach is the reformulation of the conservation and kinetic equations in the solid frame as well as the implicit treatment of the hydrodynamic forces and torque acting on the solid body in the weak formulation. Informations given by the solution of an adequate dual problem allow to control the discretization error of given functionals. The analysis encompasses the control of the free fall velocity, the orientation of the body, the hydrodynamic force and torque on the body. Numerical experiments for the two dimensional sedimentation problem for symmetric and nonsymmetric bodies validate the method.
BibTeX:
@inproceedings{Heuveline2005c,
  author = {Heuveline, V. and Bönisch, S. and Rannacher, R.},
  title = {On the Numerical Simulation of the Free Fall Problem},
  booktitle = {Modeling, Simulation and Optimization of Complex Processes: Proceedings of the International Conference on High Performance Scientific Computing, Hanoi},
  publisher = {Springer},
  year = {2005},
  pages = {47-57}
}
Paillère, H., Le Quéré, P., Weisman, C., Vierendeels, J., Dick, E., Braack, M., Dabbene, F., Beccantini, A., Studer, E., Kloczko, T., Corre, C., Heuveline, V., Darbandi, M. and Hosseinizadeh, S.F.Modelling of Natural Convection Flows with Large Temperature Differences: A Benchmark Problem for Low Mach Number Solvers. II: Contributions to the June 2004 Conference 2005Mathematical Modelling and Numerical Analysis
Vol. 39(3), pp. 617-621 
article 
Abstract: In the second part of the paper, we compare the solutions produced in the framework of the conference "Mathematical and numerical aspects of low Mach number flows" organized by INRIA and MAB in Porquerolles, June 2004, to the reference solutions described in Part 1. We make some recommendations on how to produce good quality solutions, and list a number of pitfalls to be avoided.
BibTeX:
@article{Paillere2005,
  author = {Paillère, H. and Le Quéré, P. and Weisman, C. and Vierendeels, J. and Dick, E. and Braack, M. and Dabbene, F. and Beccantini, A. and Studer, E. and Kloczko, T. and Corre, C. and Heuveline, V. and Darbandi, M. and Hosseinizadeh, S. F.},
  title = {Modelling of Natural Convection Flows with Large Temperature Differences: A Benchmark Problem for Low Mach Number Solvers. II: Contributions to the June 2004 Conference},
  journal = {Mathematical Modelling and Numerical Analysis},
  year = {2005},
  volume = {39},
  number = {3},
  pages = {617-621}
}
Augustin, W., Haller, M., Straub, M.-O. and Worsch, T.SKaMPI -- towards version 5 2004High Performance Computing in Science and Engineering '04 incollection 
Abstract: SKaMPI is now an established benchmark for MPI implementations. The development of SKaMPI-5 strives for improvements in several directions:
  1. extension of the benchmark to cover more functionality of MPI,
  2. construction of a collection of collective algorithm kernels which are not supported by core MPI collective operations,
  3. a redesign of the SKaMPI benchmark allowing it to be extended more easily (thus matching requests from SKaMPI users).
In the present paper we give an overview of the extension of SKaMPI for the evaluation of virtual topologies, describe the foundations of new algorithms for fast all-to-all communication specifically tailored for the case of differing message sizes, and give a first impression of what SKaMPI-5 will look like, for which we now have a prototype running.
BibTeX:
@incollection{Augustin2004,
  author = {Werner Augustin and Michael Haller and Marc-Oliver Straub and Thomas Worsch},
  title = {SKaMPI -- towards version 5},
  booktitle = {High Performance Computing in Science and Engineering '04},
  publisher = {Springer-Verlag, Berlin, Germany},
  year = {2004}
}
Bönisch, S. and Heuveline, V.On the Numerical Simulation of the Free Fall of a Solid in a Fluid. II. The Viscoelastic Case 2004(2004-32) techreport 
Abstract: We present a numerical scheme for the simulation of the steady, translational sedimentation of a single solid body in a non-Newtonian fluid of second-order type. This investigation is motivated by the problem of the orientation of falling particles in non-Newtonian liquids which finds its roots in various engineering applications such as flow-induced microstructures, manufacturing of short fiber composites and separation of macromolecules by electrophoresis. The proposed approach is based on a reformulation of the conservation and kinetic equations in the body frame as well as a solver oriented toward the determination of steady state solutions of the underlying fluid-structure coupling. Unstable solutions are filtered out by means of a numerical linearized stability analysis with respect to the body equations. Numerical experiments are presented and validate the proposed approach.
BibTeX:
@techreport{Bonisch2004,
  author = {Bönisch, S. and Heuveline, V.},
  title = {On the Numerical Simulation of the Free Fall of a Solid in a Fluid. II. The Viscoelastic Case},
  year = {2004},
  number = {2004-32}
}
Bönisch, S., Heuveline, V. and Wittwer, P.Second Order Adaptive Boundary Conditions for Exterior Flow Problems: Non-symmetric Stationary Flows in two Dimensions 2004(2004-10) techreport 
Abstract: We consider the problem of solving numerically the stationary incompressible Navier-Stokes equations in an exterior domain in two dimensions. For numerical purposes we truncate the domain to a finite sub-domain, which leads to the problem of finding so called artificial boundary conditions to replace the boundary conditions at infinity. To solve this problem we construct - by combining results from dynamical systems theory with matched asymptotic expansion techniques based on the old ideas of Goldstein and Van Dyke - a smooth divergence free vector field depending explicitly on drag and lift and describing the solution to second and dominant third order, asymptotically at large distances from the body. The resulting expression appears to be new, even on a formal level. This improves the method introduced by the authors in a previous paper and generalizes it to non-symmetric flows. The numerical scheme determines the boundary conditions and the forces on the body in a self-consistent way as an integral part of the solution process. When compared with our previous paper where first order asymptotic expressions were used on the boundary, the inclusion of second and third order asymptotic terms further reduces the computational cost for determining lift and drag to a given precision by typically another order of magnitude.
BibTeX:
@techreport{Bonisch2004a,
  author = {Bönisch, S. and Heuveline, V. and Wittwer, P.},
  title = {Second Order Adaptive Boundary Conditions for Exterior Flow Problems: Non-symmetric Stationary Flows in two Dimensions},
  year = {2004},
  number = {2004-10}
}
Dietz, B., Heine, A., Heuveline, V. and Richter, A.Test of New Numerical Approach to the Quantization of Billiards 2004(2004-08) techreport 
Abstract: A newly developed method for computing large numbers of eigenvalues of self-adjoint elliptic operators [J. Comp. Phys 184, 321 (2003)] is tested in numerical studies of the spectral properties of quantum billiards. To this extent, we study a time-reversal invariant quantum billiard of threefold symmetry, that undergoes a transformation in its symmetry properties from C_3 to C_3v. Thereby a transition from Gaussian orthogonal to Gaussian unitary ensemble statistics is observed, verifying earlier experimental indications and theoretical predictions. At the same time our numerical ansatz is shown to be unique in its applicability to arbitrary billiard shapes.
BibTeX:
@techreport{Dietz2004,
  author = {Dietz, B. and Heine, A. and Heuveline, V. and Richter, A.},
  title = {Test of New Numerical Approach to the Quantization of Billiards},
  year = {2004},
  number = {2004-08}
}
Galdi, G. and Heuveline, V.Lift and Sedimentation of Particles in the Flow of a Viscoelastic Liquid in a Channel 2004(2004-34) techreport 
BibTeX:
@techreport{Galdi2004,
  author = {Galdi, G.P. and Heuveline, V.},
  title = {Lift and Sedimentation of Particles in the Flow of a Viscoelastic Liquid in a Channel},
  year = {2004},
  number = {2004-34}
}
Heuveline, V.Numerical Methods for Instationary Control Problems 2004Oberwolfach Reports
Vol. 1, pp. 191-217 
article 
BibTeX:
@article{Heuveline2004,
  author = {Heuveline, V.},
  title = {Numerical Methods for Instationary Control Problems},
  journal = {Oberwolfach Reports},
  year = {2004},
  volume = {1},
  pages = {191-217}
}
Heuveline, V.Self-Adaptive Methods for PDE 2004Oberwolfach Reports
Vol. 1, pp. 829-872 
article 
BibTeX:
@article{Heuveline2004a,
  author = {Heuveline, V.},
  title = {Self-Adaptive Methods for PDE},
  journal = {Oberwolfach Reports},
  year = {2004},
  volume = {1},
  pages = {829-872}
}
Heuveline, V. and Schieweck, F.An Interpolation Operator for H1 Functions on General Quadrilateral and Hexahedral Meshes with Hanging Nodes 2004(2004-21) techreport 
Abstract: We propose a Scott-Zhang type interpolation operator for the approximation of non-smooth functions in H^1 by means of continuous piecewise polynomials of low order. The novelty of the proposed interpolation operator is that it is defined for a general non-affine family of quadrilateral and hexahedral meshes with possibly hanging nodes. The derived interpolator exhibits optimal approximation properties for functions in H^1 and preserves homogeneous Dirichlet boundary conditions naturally.
BibTeX:
@techreport{Heuveline2004b,
  author = {Heuveline, V. and Schieweck, F.},
  title = {An Interpolation Operator for H1 Functions on General Quadrilateral and Hexahedral Meshes with Hanging Nodes},
  year = {2004},
  number = {2004-21}
}
Thäter, G.Natural convection, dissipation & power-law rheology: mathematical models & results 2004 bookURL 
BibTeX:
@book{thäter2004natural,
  author = {Thäter, G.},
  title = {Natural convection, dissipation & power-law rheology: mathematical models & results},
  publisher = {Schr.},
  year = {2004},
  url = {http://books.google.de/books?id=ndaOZwEACAAJ}
}
Augustin, W. and Worsch, T.Usefulness and Usage of SKaMPI-Bench 2003
Vol. 2840PVM/MPI, pp. 63-70 
inproceedingsURL 
Abstract: SKaMPI is a benchmark for measuring the performance of MPI implementations. Some examples of surprising behaviour of MPI libraries are presented. These result in certain new requirements for MPI benchmarks and will lead to major extensions in the new SKaMPI-Bench.
BibTeX:
@inproceedings{Augustin2003,
  author = {Werner Augustin and Thomas Worsch},
  title = {Usefulness and Usage of SKaMPI-Bench},
  booktitle = {PVM/MPI},
  publisher = {Springer},
  year = {2003},
  volume = {2840},
  pages = {63--70},
  url = {http://springerlink.metapress.com/openurl.asp?genre=article&issn=0302-9743&volume=2840&spage=63}
}
Bönisch, S., Heuveline, V. and Wittwer, P.Adaptive Boundary Conditions for Exterior Flow Problems 2003(2003-02) techreport 
Abstract: We consider the problem of solving numerically the stationary incompressible Navier-Stokes equations in an exterior domain in two dimensions. This corresponds to studying the stationary fluid flow past a body. The necessity to truncate for numerical purposes the infinite exterior domain to a finite domain leads to the problem of finding appropriate boundary conditions on the surface of the truncated domain. We solve this problem by providing a vector field describing the leading asymptotic behavior of the solution. This vector field is given in the form of an explicit expression depending on a real parameter. We show that this parameter can be determined from the total drag exerted on the body. Using this fact we set up a self-consistent numerical scheme that determines the parameter, and hence the boundary conditions and the drag, as part of the solution process. We compare the values of the drag obtained with our adaptive scheme with the results from using traditional constant boundary conditions. Computational times are typically reduced by several orders of magnitude.
BibTeX:
@techreport{Bonisch2003,
  author = {Bönisch, S. and Heuveline, V. and Wittwer, P.},
  title = {Adaptive Boundary Conditions for Exterior Flow Problems},
  year = {2003},
  number = {2003-02}
}
Nazarov, S. and Thäter, G.Asymptotics at infinity of solutions to the Neumann problem in a sieve-type layer 2003Comptes Rendus Mecanique
Vol. 331, pp. 85-90 
articleDOI  
BibTeX:
@article{CRMec.331...85N_2003,
  author = {Nazarov, S. and Thäter, G.},
  title = {Asymptotics at infinity of solutions to the Neumann problem in a sieve-type layer},
  journal = {Comptes Rendus Mecanique},
  year = {2003},
  volume = {331},
  pages = {85-90},
  doi = {http://dx.doi.org/10.1016/S1631-0721(02)00005-0}
}
Heuveline, V.On Higher-Order Mixed FEM for Low Mach Number Flows: Application to a Natural Convection Benchmark Problem 2003International Journal for Numerical Methods in Fluids
Vol. 41(12), pp. 1339-1356 
article 
Abstract: We consider higher-order mixed finite elements with continuous pressures for the computation of stationary compressible flows at low Mach number. The proposed approach is based on a fully coupled treatment of the governing equations and therefore, for steady-state calculations, does not rely on time-stepping techniques. The non-linear problem is solved by means of a quasi-Newton iteration. The strongly coupled system resulting from higher-order discretization of the linearized equations requires adequate solvers. We propose a new scheme based on multigrid methods with varying FEM ansatz orders on the grid hierarchy as well as multiplicative smoothers based on blocking techniques. Computational results are described for a benchmark configuration including a flow with heat transfer in the low Mach number regime. Furthermore, the issue of anisotropic grids is addressed in that context.
BibTeX:
@article{Heuveline2003,
  author = {Heuveline, V.},
  title = {On Higher-Order Mixed FEM for Low Mach Number Flows: Application to a Natural Convection Benchmark Problem},
  journal = {International Journal for Numerical Methods in Fluids},
  year = {2003},
  volume = {41},
  number = {12},
  pages = {1339-1356}
}
Heuveline, V.On the Computation of a Very Large Number of Eigenvalues for Selfadjoint Elliptic Operators by Means of Multigrid Methods 2003Journal of Computational Physics
Vol. 184(1), pp. 321-337 
article 
Abstract: Recent results in the study of quantum manifestations in classical chaos raise the problem of computing a very large number of eigenvalues of selfadjoint elliptic operators. The standard numerical methods for large eigenvalue problems cover the range of applications where a few of the leading eigenvalues are needed. They are not appropriate and generally fail to solve problems involving a number of eigenvalues exceeding a few hundreds. Further, the accurate computation of a large number of eigenvalues leads to much larger problem dimension in comparison with the usual case dealing with only a few eigenvalues. A new method is presented which combines multigrid techniques with the Lanczos process. The resulting scheme requires O(mn) arithmetic operations and O(n) storage requirement, where n is the number of unknowns and m, the number of needed eigenvalues. The discretization of the considered differential operators is realized by means of p-finite elements and is applicable on general geometries. Numerical experiments validate the proposed approach and demonstrate that it allows to tackle problems considered to be beyond the range of standard iterative methods, at least on current workstations. The ability to compute more than 9000 eigenvalues of an operator of dimension exceeding 8 million on a PC shows the potential of this method. Practical applications are found, e.g. in the numerical simulation of quantum billiards.
BibTeX:
@article{Heuveline2003a,
  author = {Heuveline, V.},
  title = {On the Computation of a Very Large Number of Eigenvalues for Selfadjoint Elliptic Operators by Means of Multigrid Methods},
  journal = {Journal of Computational Physics},
  year = {2003},
  volume = {184},
  number = {1},
  pages = {321-337}
}
Heuveline, V. and Rannacher, R.Adaptive FEM for eigenvalue problems 2003Numerical mathematics and advanced applications. Proceedings of Enumath 2001, the 4th European conference, Ischia, pp. 713-722 inproceedings 
BibTeX:
@inproceedings{Heuveline2003c,
  author = {Heuveline, V. and Rannacher, R.},
  title = {Adaptive FEM for eigenvalue problems},
  booktitle = {Numerical mathematics and advanced applications. Proceedings of Enumath 2001, the 4th European conference, Ischia},
  year = {2003},
  pages = {713-722}
}
Heuveline, V. and Rannacher, R.Duality-Based Adaptivity in the hp-Finite Element Method 2003Journal of Numerical Mathematics
Vol. 11(2), pp. 95-113 
article 
Abstract: A posteriori error estimates for hp-finite element method are proved. Numerical tests are given.
BibTeX:
@article{Heuveline2003d,
  author = {Heuveline, V. and Rannacher, R.},
  title = {Duality-Based Adaptivity in the hp-Finite Element Method},
  journal = {Journal of Numerical Mathematics},
  year = {2003},
  volume = {11},
  number = {2},
  pages = {95-113}
}
Rannacher, R. and Heuveline, V.A numerical tool for flow simulation in a Wankel motor 2003Mathematics - Key Technology for the Future. Joint Projects between Universities and Industry, pp. 33-43 incollection 
BibTeX:
@incollection{Rannacher2003,
  author = {Rannacher, R. and Heuveline, V.},
  title = {A numerical tool for flow simulation in a Wankel motor},
  booktitle = {Mathematics - Key Technology for the Future. Joint Projects between Universities and Industry},
  publisher = {Springer},
  year = {2003},
  pages = {33-43}
}
Worsch, T., Reussner, R.H. and Augustin, W.On Benchmarking Collective MPI Operations 2002
Vol. 2474Recent advances in parallel virtual machine and message passing interface: 9th European PVM MPI Users' Group Meeting, Linz, Austria, September 29 -- October 02, 2002 
inproceedings 
Abstract: This article concentrates on recent work on benchmarking collective operations with SKaMPI. The goal of the SKaMPI project is the creation of a database containing performance measurements of parallel computers in terms of MPI operations. These data support software developers in creating portable and fast programs. Existing algorithms for measuring the timing of collective operations are discussed and a new algorithm is presented, taking into account the differences of local clocks. Results of measurements on a Cray T3E/900 and an IBM RS 6000 SP are presented.
BibTeX:
@inproceedings{Augustin2002,
  author = {Thomas Worsch and Ralf H. Reussner and Werner Augustin},
  title = {On Benchmarking Collective MPI Operations},
  booktitle = {Recent advances in parallel virtual machine and message passing interface: 9th European PVM MPI Users' Group Meeting, Linz, Austria, September 29 -- October 02, 2002},
  publisher = {Springer-Verlag, Berlin, Germany},
  year = {2002},
  volume = {2474}
}
Becker, R., Heuveline, V. and Rannacher, R.An Optimal Control Approach to Adaptivity in Computational Fluid Mechanics 2002International Journal for Numerical Methods in Fluids
Vol. 40(1-2), pp. 105-120 
article 
Abstract: We consider a typical design cycle in computational fluid mechanics. First, an output value such as the drag coefficient is computed. Then, this value is optimized by varying certain control parameters. Finally, the stability of the resulting optimized flow is analysed. For each of these tasks, we derive a posteriori error estimators within a uniform framework based on the optimal control approach described in it R. Becker and it R. Rannacher [Acta Numerica, Iserles A. (ed.), 1-102 CUP: Cambridge (2001)]. These estimators are used for adaptive mesh refinement in order to increase the accuracy up to a given tolerance.
BibTeX:
@article{Becker2002,
  author = {Becker, R. and Heuveline, V. and Rannacher, R.},
  title = {An Optimal Control Approach to Adaptivity in Computational Fluid Mechanics},
  journal = {International Journal for Numerical Methods in Fluids},
  year = {2002},
  volume = {40},
  number = {1-2},
  pages = {105-120}
}
Heuveline, V.Finite element approximations of eigenvalue problems for elliptic partial differential operators 2002School: Universität Heidelberg other 
BibTeX:
@other{Heuveline2002,
  author = {Heuveline, V.},
  title = {Finite element approximations of eigenvalue problems for elliptic partial differential operators},
  school = {Universität Heidelberg},
  year = {2002}
}
Becker, R., Braack, M., Heuveline, V. and Rannacher, R.Efficient Numerical Methods for the Simulation of Flow Reactors 2001Technical Proceedings of the Fourth International Conference on Modelling and Simulation of Microsystems, pp. 194-197 inproceedings 
Abstract: We describe recent developments in the design of numerical methods for the simulation of heat transport and chemical processes in ow reactors. For viscous ow in the low-Mach-number regime, we use a stable finite element method for discretizing the compressible Navier-Stokes equations which is oriented by the incompressible limit case. The ow and chemical equations are solved as a fully coupled system by an adaptive defect correction method. This approach is illustrated by several examples of ow reactors including a case with steep temperature gradients. The eÆciency and predictive capability of our codes are verified by comparison with traditional ad-hoc methods and experimental data.
BibTeX:
@inproceedings{Becker2001,
  author = {Becker, R. and Braack, M. and Heuveline, V. and Rannacher, R.},
  title = {Efficient Numerical Methods for the Simulation of Flow Reactors},
  booktitle = {Technical Proceedings of the Fourth International Conference on Modelling and Simulation of Microsystems},
  year = {2001},
  pages = {194-197}
}
Heuveline, V.On Higher-Order Mixed FEM for Low Mach Number Flows: Applications to a Natural Convection Benchmark Problem 2001(2001-27) techreport 
BibTeX:
@techreport{Heuveline2001,
  author = {Heuveline, V.},
  title = {On Higher-Order Mixed FEM for Low Mach Number Flows: Applications to a Natural Convection Benchmark Problem},
  year = {2001},
  number = {2001-27}
}
Heuveline, V.On the Computation of a Very Large Number of Eigenvalues for Selfadjoint Elliptic Operators by Means of Multigrid Methods 2001(2001-37) techreport 
Abstract: Recent results in the study of quantum manifestations in classical chaos raise the problem of computing a very large number of eigenvalues of selfadjoint elliptic operators. The standard numerical methods for large eigenvalue problems cover the range of applications where a few of the leading eigenvalues are needed. They are not appropriate and generally fail to solve problems involving a number of eigenvalues exceeding a few hundreds. Further, the accurate computation of a large number of eigenvalues leads to much larger problem dimension in comparison with the usual case dealing with only a few eigenvalues. A new method is presented which combines multigrid techniques with the Lanczos process. The resulting scheme requires O(mn) arithmetic operations and O(n) storage requirement, where n is the number of unknowns and m, the number of needed eigenvalues. The discretization of the considered differential operators is realized by means of p-finite elements and is applicable on general geometries. Numerical experiments validate the proposed approach and demonstrate that it allows to tackle problems considered to be beyond the range of standard iterative methods, at least on current workstations. The ability to compute more than 9000 eigenvalues of an operator of dimension exceeding 8 million on a PC shows the potential of this method. Practical applications are found, e.g. in the numerical simulation of quantum billiards.
BibTeX:
@techreport{Heuveline2001a,
  author = {Heuveline, V.},
  title = {On the Computation of a Very Large Number of Eigenvalues for Selfadjoint Elliptic Operators by Means of Multigrid Methods},
  year = {2001},
  number = {2001-37}
}
Rannacher, R. and Heuveline, V.A Posteriori Error Control for Finite Element Approximations of Elliptic Eigenvalue Problems 2001Advances in Computational Mathematics
Vol. 15(1-4), pp. 107-138 
article 
Abstract: The authors present a posteriori error estimates for the Galerkin finite element approximations of the eigenelements of elliptic eigenvalue problems with Dirichlet boundary conditions. The a posteriori error estimates provide means for mesh refinement processes. The theory is illustrated by results of comprehensive numerical experiments.
BibTeX:
@article{Rannacher2001,
  author = {Rannacher, R. and Heuveline, V.},
  title = {A Posteriori Error Control for Finite Element Approximations of Elliptic Eigenvalue Problems},
  journal = {Advances in Computational Mathematics},
  year = {2001},
  volume = {15},
  number = {1-4},
  pages = {107-138}
}
Rannacher, R. and Heuveline, V.A Posteriori Error Control for Finite Element Approximations of Elliptic Eigenvalue Problems 2001(2001-08) techreport 
Abstract: The authors present a posteriori error estimates for the Galerkin finite element approximations of the eigenelements of elliptic eigenvalue problems with Dirichlet boundary conditions. The a posteriori error estimates provide means for mesh refinement processes. The theory is illustrated by results of comprehensive numerical experiments.
BibTeX:
@techreport{Rannacher2001a,
  author = {Rannacher, R. and Heuveline, V.},
  title = {A Posteriori Error Control for Finite Element Approximations of Elliptic Eigenvalue Problems},
  year = {2001},
  number = {2001-08}
}
Heuveline, V. and Bertsch, C.On Multigrid Methods for the Eigenvalue Computation of Nonselfadjoint Elliptic Operators 2000East-West Journal of Numerical Mathematics
Vol. 8(4), pp. 275-297 
article 
Abstract: The authors present two different approaches for the eigenvalue computation of nonselfadjoint operators. The authors first consider a pure multigrid approach very similar to the method proposed by it W. Hackbusch [Multigrid method and applications (1985; Zbl 0595.65106)]. It relies on the knowledge of a good initial guess on coarse grids which restricts the field of its applications. In the second approach it is proposed a new scheme which couples the Jacobi-Davidson method with a multigrid process based on a defect correction. Numerical experiments for the equation of convection-diffusion considering various Péclet numbers are included and show a drastic overall cost reduction compared to standard pure algebraic methods.
BibTeX:
@article{Heuveline2000,
  author = {Heuveline, V. and Bertsch, C.},
  title = {On Multigrid Methods for the Eigenvalue Computation of Nonselfadjoint Elliptic Operators},
  journal = {East-West Journal of Numerical Mathematics},
  year = {2000},
  volume = {8},
  number = {4},
  pages = {275-297}
}
Heuveline, V. and Bertsch, C.On Multigrid Methods for the Eigenvalue Computation of Nonselfadjoint Elliptic Operators 2000(2000-38) techreport 
Abstract: The authors present two different approaches for the eigenvalue computation of nonselfadjoint operators. The authors first consider a pure multigrid approach very similar to the method proposed by it W. Hackbusch [Multigrid method and applications (1985; Zbl 0595.65106)]. It relies on the knowledge of a good initial guess on coarse grids which restricts the field of its applications. In the second approach it is proposed a new scheme which couples the Jacobi-Davidson method with a multigrid process based on a defect correction. Numerical experiments for the equation of convection-diffusion considering various Péclet numbers are included and show a drastic overall cost reduction compared to standard pure algebraic methods.
BibTeX:
@techreport{Heuveline2000a,
  author = {Heuveline, V. and Bertsch, C.},
  title = {On Multigrid Methods for the Eigenvalue Computation of Nonselfadjoint Elliptic Operators},
  year = {2000},
  number = {2000-38}
}
Heuveline, V.Polynomial acceleration techniques for large non hermitian eigenvalue problems and pseudospectra 1997 techreport 
BibTeX:
@techreport{Heuveline1997,
  author = {Heuveline, V.},
  title = {Polynomial acceleration techniques for large non hermitian eigenvalue problems and pseudospectra},
  year = {1997}
}
Heuveline, V., Philippe, B. and Sadkane, M.Parallel Computation of Spectral Portrait of Large Matrices by Davidson Type Methods 1997Numerical Algorithms
Vol. 16(1), pp. 55-75 
article 
Abstract: The paper concerns spectral portrait computations and plotting for non-normal sparse and large matrices. The method chosen has been proposed by it E. R. Davidson [J. Comput. Phys. 17, 87-94 (1975; Zbl 0293.65022)]. The authors develop an efficient parallelization of this method which allows to compute the spectral portraits on distributed-memory parallel computer systems. For the general Davidson method, a selection of two preconditioners is shown which improves the performanc of parallel implementations. The computational experiments performed on a Paragon computer confirm the efficiency of the parallelization approach.
BibTeX:
@article{Heuveline1997a,
  author = {Heuveline, V. and Philippe, B. and Sadkane, M.},
  title = {Parallel Computation of Spectral Portrait of Large Matrices by Davidson Type Methods},
  journal = {Numerical Algorithms},
  year = {1997},
  volume = {16},
  number = {1},
  pages = {55-75}
}
Heuveline, V. and Sadkane, M.Arnoldi-Faber Method for Large non-Hermitian Eigenvalue Problems 1997Electronic Transactions on Numerical Analysis
Vol. 5, pp. 62-76 
article 
Abstract: it Y. Saad and others have used Chebyshev polynomials to accelerate the convergence of the Arnoldi method for computing the right-most eigenvalues of large non Hermitian matrices. This paper describes how Faber polynomials, which were used by it G. Starke and it R. S. Varga for the solution of linear systems, may also be used for the eigenvalue problem instead of Chebyshev polynomials. Numerical tests confirm the usefulness of the proposed method for the computation of eigenvalues and eigenvectors.
BibTeX:
@article{Heuveline1997b,
  author = {Heuveline, V. and Sadkane, M.},
  title = {Arnoldi-Faber Method for Large non-Hermitian Eigenvalue Problems},
  journal = {Electronic Transactions on Numerical Analysis},
  year = {1997},
  volume = {5},
  pages = {62-76}
}
Heuveline, V. and Sadkane, M.Parallel Computation of Polynomials with Minimal Uniform Norm and its Application to Large Eigenproblems 1997Journal of Computational and Applied Mathematics
Vol. 82(1-2), pp. 185-198 
article 
Abstract: We investigate the parallelization of an algorithm that computes the polynomial with minimal uniform norm on polygonal domains. The obtained polynomial is used to accelerate large non-Hermitian eigenvalue problems. We report the performance results obtained on the machine Paragon and compare this method with the one based on Chebyshev acceleration techniques.
BibTeX:
@article{Heuveline1997c,
  author = {Heuveline, V. and Sadkane, M.},
  title = {Parallel Computation of Polynomials with Minimal Uniform Norm and its Application to Large Eigenproblems},
  journal = {Journal of Computational and Applied Mathematics},
  year = {1997},
  volume = {82},
  number = {1-2},
  pages = {185-198}
}
Heuveline, V. and Sadkane, M.Chebyshev Acceleration Techniques for Large Complex non-Hermitian Eigenvalue Problems 1996Reliable Computing
Vol. 2(2), pp. 111-117 
article 
Abstract: We propose an extension of the Arnoldi-Chebyshev algorithm to the large complex non Hermitian case. We demonstrate the algorithm on two applied problems.
BibTeX:
@article{Heuveline1996,
  author = {Heuveline, V. and Sadkane, M.},
  title = {Chebyshev Acceleration Techniques for Large Complex non-Hermitian Eigenvalue Problems},
  journal = {Reliable Computing},
  year = {1996},
  volume = {2},
  number = {2},
  pages = {111-117}
}
Heuveline, V. and Sadkane, M.Parallel Computation of Spectral Portrait of Large Matrices 1996Lecture Notes in Computer Science
Vol. 1184, pp. 398-407 
article 
Abstract: This paper presents a parallel version of a method of the type presented by it E. R. Davidov [J. Comput. Phys. 17, 87-94 (1975; Zbl 0293.65022)] for the computation of spectral portrait of large non-Hermitian matrices. Performance results obtained on the machine Paragon are reported.
BibTeX:
@article{Heuveline1996a,
  author = {Heuveline, V. and Sadkane, M.},
  title = {Parallel Computation of Spectral Portrait of Large Matrices},
  journal = {Lecture Notes in Computer Science},
  year = {1996},
  volume = {1184},
  pages = {398-407}
}
Heuveline, V.Lösung von linearen Programmen durch Extrapolation bei dem Verfahren der analytischen Zentren und Reduzierung auf affine Teilräume 1993 mastersthesis 
BibTeX:
@mastersthesis{Heuveline1993,
  author = {Heuveline, V.},
  title = {Lösung von linearen Programmen durch Extrapolation bei dem Verfahren der analytischen Zentren und Reduzierung auf affine Teilräume},
  year = {1993}
}
Created by JabRef on 23/11/2017.