Automated calculation of matrix elements and physics motivated observables
Z. Was

TL;DR
This paper discusses the development of automated computational methods for calculating matrix elements and observables in high-energy physics, aiding precision tests of the Standard Model.
Contribution
It introduces automated calculation techniques for complex matrix elements and physics observables, enhancing accuracy and efficiency in high-energy physics research.
Findings
Automated calculations of spin effects in e+ e- to tau+ tau- gamma.
Phenomenological analysis of e+ e- to 4f and nu_e nu_e gamma gamma processes.
CP-sensitive observable analysis for Higgs boson parity.
Abstract
The central aspect of my personal scientific activity, has focused on calculations useful for interpretation of High Energy accelerator experimental results, especially in a domain of precision tests of the Standard Model. My activities started in early 80's, when computer support for algebraic manipulations was in its infancy. But already then it was important for my work. It brought a multitude of benefits, but at the price of some inconvenience for physics intuition. Calculations became more complex, work had to be distributed over teams of researchers and due to automatization, some aspects of the intermediate results became more difficult to identify. In my talk I will not be very exhaustive, I will present examples from my personal research only: (i) calculations of spin effects for the process e+ e- to tau+tau- gamma at Petra/PEP energies, calculations (with the help of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Automated calculation of matrix elements and physics motivated observables
Z. Was
Institute of Nuclear Physics, Polish Academy of Sciences 31 342 Krakow, Poland [email protected]
Abstract
The central aspect of my personal scientific activity, has focused on calculations useful for interpretation of High Energy accelerator experimental results, especially in a domain of precision tests of the Standard Model. My activities started in early 80’s, when computer support for algebraic manipulations was in its infancy. But already then it was important for my work. It brought a multitude of benefits, but at the price of some inconvenience for physics intuition. Calculations became more complex, work had to be distributed over teams of researchers and due to automatization, some aspects of the intermediate results became more difficult to identify.
In my talk I will not be very exhaustive, I will present examples from my personal research only: (i) calculations of spin effects for the process at Petra/PEP energies, calculations (with the help of the Grace system of Minami-tateya group) and phenomenology of spin amplitudes for (ii) and for (iii) processes, (iv) phenomenology of CP-sensitive observables for Higgs boson parity in cascade decays.
Presented at “4th Computational Particle Physics Workshop” 8 - 11 October 2016, Hayama Japan
Preprint IFJ PAN-IV-2017-2, January 2017
1 Introduction
Once computers equipped with algebraic languages became available, approaches to phenomenology of High Energy accelerator experiments changed substantially. For me personally, it all started in 1980. It was clear that numerous benefits appeared. Control of large expression became easier, in many cases it simply became possible for the first time. Nearly immediately some drawbacks appeared as well. For example, some seemingly obsolete expertise, like methods for special function expansions, started to disappear. At least for some years and for some communities.
This was all part of a complex and generally very fruitful development. In this presentation I will concentrate on my personal experience. I do not have any intentions to be systematic or balanced. A more balanced picture will hopefully appear together with other talks collected in the proceedings. That is also why, I think, I do not need to focus on successes of the field. These are well known. I will rather review difficulties or traps I have encountered myself. In fact traps, once resolved, turned out to be rewarding, often in an unexpected way.
The presentation is organized as follows. Section 2 will discuss computer algebra techniques which were applied in our work for Monte Carlo program KORALB [1] for the process at Petra/PEP energies. The following Section 3, is devoted to work on spin amplitudes for KKMC Monte Carlo [2]. In particular to studies of spin amplitude sub-structure for the process, necessary to accommodate resulting expressions to Yennie Frautschi Suura exponentiation [3]; variant of exclusive exponentiation of initial and final state radiation [4]. The t-channel (even though non-singular) exchange required careful attention. The following Section 4 is devoted to phenomenology of four-fermion state production at high energy and the study of unexpected formation of peaks due to conspiracy between spin and selection cut effects [5]. Finally, in Section 5, we turn our attention to methods of Machine Learning in applications for evaluation of (massively multi-dimensional) observables for Higgs parity [6]. Section 6 closes the paper with a Summary.
2 Year 1981: KORALB Monte Carlo for at Petra/PEP energies
At the time, Poland seemed to be an isolated place, but with enormous in-flow of young talents to research. In reality a lot of contacts existed, but it was not to be seen by me. Limited, and in fact quite awkward, access to computing existed. It looked like a hopeless loss of time, but a lot of bright minds were attracted to the computing center of Jagellonian University. I had access to algebraic manipulation language: Shoonship [7] too.
One of my first projects was to evaluate the spin density matrix for the process at Petra/PEP energies [1, 8] This work was performed under the guidance of Prof. S. Jadach. This was quite an experience in looking at spin amplitudes as (reducible) representations of (Lorentzgauge) groups. It was a great opportunity, programs were queuing for execution time. We could then concentrate and understand the details of what we were actually doing. Later, in all my work, it came as an enormous benefit: how to represent complicated formulas (moderately complicated for today standards) of spin amplitudes in a compact form. We have used a tree of reference frames to obtain the goal, see Fig. 1. Instead of spin projections of individual fermions, we used differences (or sums) of such projections for incoming and/or outgoing leptons. In this way we could visualize spin properties of intermediate (formally virtual) photons. Simplifications of lowest order amplitudes, remained for bremsstrahlung as well.
Nearly all calculations were done by hand, but every step was cross checked with the help of algebraic calculation with Shoonship. This was an enormous help. We could also observe how some features of amplitudes could be identified if calculations were done by hand, but how easily these features were overlooked if we relied too much on automatization. Only if we knew what we were looking for, could we confirm the patterns with the algebraic tool. It is important to keep this in mind. Similar problems are encountered in modern times with applications of Machine Learning as well. I will return to this point in Section 5.
3 The t-channel contribution to spin amplitudes of s-channel Exclusive Exponentiation and double gluon emission in QCD.
There is a multitude of factorization schemes available for Field Theory calculations. Over years, I was checking, on particular examples, if hints for that could be identified already at the spin amplitude level. The general principle of the searches was rather simple. One tries to identify in amplitudes, gauge invariat sub-structures (parts) responsible for Matrix Element enhancements: in some regions of the pase-space (collinear-soft etc.). This is fundamental, especially from the point of view of Monte Carlo algorithm construction. Discussions with Shimizu-sensei were important for some stages of that work. The principle was to start from the complete expression, coded in a numerical program, to identify the most singular term and then group some other terms necessary to obtain gauge invariant part of spin amplitude. Then, for the remaining part, the search was repeated, until all interesting parts were identified. Starting point for that work had to be amplitudes guaranteed to be correct. In the two cases [9] and [10] expressions obtained with the help of algebraic programs were used for that purpose.
Even though in principle there were no general rules to follow, separation of amplitudes into gauge invariant parts was a straightforward and seemingly unique procedure. However I was not able to perform the task automatically. In fact, only some of the patterns appeared as a consequence of ordering singular terms. Feynman diagrams 1 and 2 of Fig. 2, combined, complete the amplitude for production, that is why they form gauge invariant part of amplitude for .
Further, because these two diagrams represent initial state QED bremsstrahlung amplitude, they must divide into parts, corresponding to of Yennie-Frautshi-Suura exponentiation [3]. This has long been known, but the question was, whether it can be expanded to other cases, to higher orders or to terms of different singularities/enhancements than in case of QED photon emissions. Unexpectedly, from my experience the answer seemed to be always “yes”. I could observe it not only in QED and QCD cases but also for example, in scalar QED [11]. I was encouraged by Prof. Shimizu-sensei to follow this path, also in case of complete electroweak effects, and for loop contributions, but so far I have not found the solution.
Instead, let me present first, single photon emission amplitude for (formula 1) and later for double gluon emission in (formula 2). For notation conventions references [9, 10] are used.
[TABLE]
Once manipulations are completed, we separate the spin amplitude into six individually QED gauge invariant parts. This is rather easy to check, replacing photon polarization vector with its four-momentum. Each of the obtained parts has a well defined physical interpretation. It is also possible to verify that the gauge invariance of each part can be preserved in the case of the extrapolation, necessary in case of QED exclusive exponentiation (see Refs. [2, 4]). Then, because of additional photons, the condition is not valid.
Let us now turn to double gluon emission amplitude, that is for the process :
[TABLE]
For the -part, we find
[TABLE]
The part proportional to other order of SU(3) group generators , is obtained by a permutation of the momenta and polarization vectors of the gluons. Each line of the above expression is individually gauge invariant. Also, this expression for double gluon emission is rather compact too.
4 The process
The main purpose of my visit to KEK MinamiTateya group in 1996, was to work on Grace spin amplitudes [5] and to prepare them for use in our KORALW Monte Carlo [12] for the fermion processes at LEP II energies. In this work, because of Monte Carlo integration, phase space regions of collinear configurations, resulted in numerical difficulties. This required careful and painful work to avoid ‘trivial’ mistakes due to rounding errors damaging gauge cancellations. On the margins of this work, kinds of fake ‘New Physics’, phenomena appeared. Let me show, for myself at first rather unexpected example, see Ref. [13]. It illustrates that the interplay of theoretical effects and selection cuts can be confusing. In two jets were requested to be lost in the beam pipe the invariant mass of the other two was monitored. If all diagrams of the lowest order Standard Model amplitudes were taken into account, a clear peak is present (Fig. 3 left side). But there is another peak at a much higher mass, which do not disappear, even if only -pair production and decay amplitudes are taken into account (see Fig. 3 right side). It is a consequence of the veto cut, and peaks, and spin correlations as explained in Ref. [13].
Such phenomena need to be included in generators used for simulation of backgrounds. Otherwise intuition may fail to recognize importance for background estimation effects such as generally small spin correlations. Cross-check with Grace spin amplitudes was helpful to confirm the origin of the phenomenon.
5 Machine Learning for Higgs parity measurement in decay
Let us now turn to another example, where complex observables need to be defined and computer techniques of the Neural Network or Machine Learning (ML) type [14, 15, 16, 17, 18] are useful or even irreplaceable. Fifteen years ago, in Ref. [19], we have proposed to measure Higgs boson parity in its decay with the help of acoplanarity angle for planes built on visible decay products for consecutive decays. A variant of the observable where impact parameter was used, is presented in [20]. In attempt to extend the method to decays, one can construct 4 or 16 such angles for each event. Each of these distributions provide some CP sensitivity, but distributions are correlated and possible backgrounds will complicate future measurements even further. Such observables of multidimensional nature may be controlled today with ML techniques. An attempt in this direction was presented in Ref. [6]. In the following, let us recall some details of this work.
The or parity information, thanks to the sign difference, can be extracted from the correlations between and transverse spin components. The decay probability
[TABLE]
is sensitive to the polarization vectors and (defined in their respective rest frames). The symbols , denote components parallel/transverse to the Higgs boson momentum as seen from the respective rest frames.
Because of the narrow width, cross-section for the process reads:
[TABLE]
With only spin indices explicit reads:
[TABLE]
The expression for can be re-written into core formula of spin algorithms of -pair production and decay:
[TABLE]
To complete explanations, we need to first recall, details of general formalism of semileptonic -lepton decays. The Matrix Element used in TAUOLA Monte Carlo [21] for semileptonic decay can be written as follows: , the current depends on the momenta of all hadrons. Then
[TABLE]
In the following we will use .
The Higgs decay probability in the formalism of Refs. [22, 23] in case when both scalar and pseudo-scalar coupling are allowed , reads
[TABLE]
The denotes the operator for the rotation by angle around the direction.
As a consequence, spin weight for the combined production and decay of tau pair reads as:
[TABLE]
Naturally, the Higgs parity should reflects itself in some kind of correlations between the and decay products in the directions transverse to the axes.
Let us recall first the case when both leptons decay to . Then h^{i}={\cal N}\Bigl{(}2(q\cdot N)q^{i}-q^{2}N^{i}\Bigr{)}, where and four momentum is build from four momenta of ’s. The denote four momentum of neutrino again in the lepton rest-frame. Because may be either positive or negative, corresponding regions of phase space would contribute cancelling out parity effects. To separate we may use , variables. The can be calculated directly from ’s energies as measured in the laboratory frame. It is enough to take the sign of the only. The sensitivity to parity can be seen in Fig. 4. Acoplanarity angle (see Fig. 5 for definition) was used. Already in this case, the observable is in principle of 3-dimensional nature.
In case of decay products, four distinct planes can be spanned on its visible decay products, thus if both and decay in this chanel, 16 acoplanarity angles can be defined, see Fig. 6, and also 8 variables similar to . The observable is thus built on 24 dimensional space. Fortunately, with the modern techniques, such as used in Ref. [6] an overall sensitivity can be evaluated. On the other hand, as it may be in general difficult to develop intuition, the risk of misinterpretation can not be ignored. Difficulties of all pattern recognition projects are well known, see Fig. 7. These challenges, were explored already by Giuseppe Arcimboldo (1572 - 1593) in his paintings.
6 Summary
Inspired by Shimizu-sensei conference, I have focused on an aspect of work for Monte Carlo generators and on phenomenology of High Energy Physics experiments related to the use of algebraic manipulation programs. My aim was to show several simple examples of challenges, resulting from complexity: how automated calculations were of help, but also sources of difficulties. Some of the important examples originate from my work in Minami Tateya group I was visiting over the last 25 years. Each example deserve substantial introduction. This was impossible for a short talk. Whenever possible, I delegate the reader to references. Inescapably, I have presented only scattered projects, where use of computer algebraic methods or pattern recognition techniques (Machine Learning) were necessary. Collecting material for my talk was inspiring and educative for myself as I could look back at particular context of my scientific activities over all these years.
\ack
This talk would not be possible without contribution and collaboration with the co-authors of the presented papers: S. Jadach, E. Richter-Was, M. Skrzypek and B. F. L. Ward. I would like to stress importance of research which was performed in collaboration with T. Ishikawa and Y. Kurihara at times when I was visiting Minanmi Tateya group of Prof. Y. Shimizu. This collaboration was essential part of my experience with algebraic manipulation programs and programs constructed with their help.
Work supported in part by funds of Polish National Science Centre under decisions UMO-2014/15/ST2/00049 and by the Research Executive Agency (REA) of the European Union under the Grant Agreement PITNGA2012316704 (HiggsTools). I would like to express gratitude to the conference organizers for the invitation.
References
- [1]
Jadach S and Was Z 1985 Comput. Phys. Commun. 36 191–211
- [2]
Jadach S, Ward B F L and Was Z 2000 Comput. Phys. Commun. 130 260–325 (Preprint hep-ph/9912214)
- [3]
Yennie D R, Frautschi S and Suura H 1961 Ann. Phys. (NY) 13 379
- [4]
Jadach S, Ward B F L and Was Z 2001 Phys. Rev. D63 113009 (Preprint hep-ph/0006359)
- [5]
Fujimoto J et al. 2003 Comput. Phys. Commun. 153 106–134 (Preprint hep-ph/0208036)
- [6]
Józefowicz R, Richter-Was E and Was Z 2016 Phys. Rev. D94 093001 (Preprint 1608.02609)
- [7]
Veltman M 1965, an IBM Programme for Symbolic Manipulation of Algebriac Expressions, Especially Feynman Diagrams (CERN, unpublished).
- [8]
Jadach S and Was Z 1984 Acta Phys. Polon. B15 1151 [Erratum: Acta Phys. Polon.B16,483(1985)]
- [9]
Was Z 2005 Eur. Phys. J. C44 489–503 (Preprint hep-ph/0406045)
- [10]
van Hameren A and Was Z 2009 Eur. Phys. J. C61 33–49 (Preprint 0802.2182)
- [11]
Nanava G and Was Z 2007 Eur. Phys. J. C51 569–583 (Preprint hep-ph/0607019)
- [12]
Jadach S, Placzek W, Skrzypek M, Ward B F L and Was Z 2001 Comput. Phys. Commun. 140 475–512 (Preprint hep-ph/0104049)
- [13]
Ishikawa T, Kurihara Y, Skrzypek M and Was Z 1998 Eur. Phys. J. C4 75–84 (Preprint hep-ph/9702249)
- [14]
LeCun Y, Bengio Y and Hinton G 2015 Nature 521 436–444
- [15]
Kingma D and Ba J 2014 arXiv preprint arXiv:1412.6980
- [16]
Ioffe S and Szegedy C 2015 arXiv preprint arXiv:1502.03167
- [17]
Srivastava N, Hinton G E, Krizhevsky A, Sutskever I and Salakhutdinov R 2014 Journal of Machine Learning Research 15 1929–1958
- [18]
Abadi M, Agarwal A, Barham P, Brevdo E, Chen Z, Citro C, Corrado G S, Davis A, Dean J, Devin M et al. 2015 Software available from tensorflow. org 1
- [19]
Bower G R, Pierzchala T, Was Z and Worek M 2002 Phys. Lett. B543 227–234 (Preprint hep-ph/0204292)
- [20]
Desch K, Was Z and Worek M 2003 Eur. Phys. J. C29 491–496 (Preprint hep-ph/0302046)
- [21]
Jadach S, Was Z, Decker R and Kuhn J H 1993 Comput. Phys. Commun. 76 361–380
- [22]
Kramer M, Kuhn J H, Stong M L and Zerwas P M 1994 Z. Phys. C64 21–30 (Preprint hep-ph/9404280)
- [23]
Desch K, Imhof A, Was Z and Worek M 2004 Phys. Lett. B579 157–164 (Preprint hep-ph/0307331)
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Jadach S and Was Z 1985 Comput. Phys. Commun. 36 191–211
- 2[2] Jadach S, Ward B F L and Was Z 2000 Comput. Phys. Commun. 130 260–325 ( Preprint hep-ph/9912214 )
- 3[3] Yennie D R, Frautschi S and Suura H 1961 Ann. Phys. (NY) 13 379
- 4[4] Jadach S, Ward B F L and Was Z 2001 Phys. Rev. D 63 113009 ( Preprint hep-ph/0006359 )
- 5[5] Fujimoto J et al. 2003 Comput. Phys. Commun. 153 106–134 ( Preprint hep-ph/0208036 )
- 6[6] Józefowicz R, Richter-Was E and Was Z 2016 Phys. Rev. D 94 093001 ( Preprint 1608.02609 )
- 7[7] Veltman M 1965, an IBM Programme for Symbolic Manipulation of Algebriac Expressions, Especially Feynman Diagrams (CERN, unpublished).
- 8[8] Jadach S and Was Z 1984 Acta Phys. Polon. B 15 1151 [Erratum: Acta Phys. Polon.B 16,483(1985)]
