WP4 Logbook: Frameworks and Visualisation.

WP Manager: Peter Sherwood

Objectives:

ATLAS Validation Framework

Purpose:
Provision of a validation framework for ATLAS developers (including the HLT)
Clients:
ATLAS offline release coordinator, ATLAS analysts, ATLAS developers, ATLAS commissioning, TDAQ
Success:
Adoption of validation framework by significant communities and adoption into the release cycle
Risks:
Failure to provide robustness to the validation framework.
Instability of computing cluster(s) used to run the tests

L1 Deliverables

 DescriptionMetricDue dateReference
WP4.1.1.1 Requirement capture and preliminary design document document 01/11/04 requirements document
preliminary design document
WP4.1.1.2 Prototype framework release Code in software release 01/02/05  
WP4.1.1.3 Prototype evaluation document Document 01/09/05 RunTimeTester Prototype Evaluation
WP4.1.1.4 Heavy use of framework by the ATLAS release Packages Tested > 50% 01/04/06  
WP4.1.1.5 Heavy use of revised framework by the ATLAS release Packages Tested > 70% 01/04/07  
WP4.1.1.6 Detailed assessments of framework performance Journal publication 01/12/07  

L2 deliverables

Physics framework and tools

Purpose:
Development of fast simulation tools
Clients:
ATLAS analysts
Success:
Use by ATLAS members to perform physics analysis
Risks:
Full simulation is used exclusively to perform physics studies

We are adopting a cyclical approach, which is driven by the offline release schedule This is currently known through release 14.0.

The L2 objectives for each L1 are:

L1 Deliverables

 DescriptionMetricDue dateReference
WP4.1.2.1 Release of Framework and Tools document 30/09/04  
WP4.1.2.2 Cycle for release 12.0 code in release 16/05/06  
WP4.1.2.3 Cycle for release 13.0 code in release 21/09/06  
WP4.1.2.4 Cycle for release 14.0 code in release 06/12/06  
WP4.1.2.5 Cycle for release 1450 code in release 30/06/07  
WP4.1.2.6 Cycle for release 16.0 code in release 15/12/07  

Event Visualisation

Purpose:
ATLAS event display
Clients:
ATLAS analysts, ATLAS developers, ATLAS commissioning, TDAQ
Success:
Adoption of ATLANTIS by client communities
Risks:
Demands from various clients overwhelming the Atlantis effort

L1 Deliverables

 DescriptionMetricDue dateReference
WP4.2.1 Atlantis used in the 2004 Combined Testbeam Demonstration, event displays from CTB04 data 01/10/04 Atlantis
WP4.2.2 Use of standard protocols for online data access. Use as a web service Document 01/04/05
WP4.2.3 Event/Geometry data access: Task assessment and initial design Slides 01/07/05
WP4.2.4 Requirements analysis for interactive algorithm execution Document 01/10/05  
WP4.2.5 First implementation of data access interface Package release 12/12/05  
WP4.2.6 First prototype of mechanism for interactive algorithm execution Package release 01/04/06  
WP4.2.7 Use in ATLAS commissioning and Cosmic data taking Demonstration 01/10/06  
WP4.2.8 Final version of data access mechanism and interactive algorithm execution facility and demonstration; documentation Package release, document 01/04/07  
WP4.2.9 Evaluation with available real data Document 01/07/07  
WP4.2.10 Final release with evaluation document Package release, document 01/04/08  
WP4.3.10 Final release with evaluation document Package release, document 01/04/08  
L2 deliverables

Commentary

04Q3

Validation framework: A major step forward was the adoption by the ATLAS software management of a policy of mandatory validation and testing before releases. This was in large part prompted by the development of the Run Time Testing within this project. One of our personnel has been asked to prepare a work plan for this whole area and to consider a formal management role in its execution. As a result, the priority for the project in the UK programme has been increased, and we are particularly concerned with ensuring a robust service is in place for the collaboration. This has had implications for other areas in WP4, notably the analysis framework and tools, where our impact can be sustained with less effort.

The first deliverable for the validation activity is due in one month, being a requirement capture and design document for the broader validation service. Both aspects exist in draft, and RTT represents a major component already being deployed. This RTT prototype has consumed much of the effort, and is now providing a service for target communities. The detailed level-2 deliverables can be seen at http://www.hep.ucl.ac.uk/atlas/EScience/Planning/EScValidation.html. An important issue is the robustness of the product, and we are attempting to identify other hardware resources that can run the testing as a fail-over option. In the medium term, the testing will be done on Grid resources.

Analysis Framework and Tools: The Artemis activity now uses the AOD framework as decided upon after the Artemis workshop in April 2004. This framework is also receiving support from BNL and CERN. We have aimed our efforts (which are for the moment reduced to allow more effort in the validation area) at producing real demonstrators in physics groups, both to encourage adoption and to learn of the real features required. The primary demonstration is in the UK-lead B-physics area (Catmore, Smizanska), where example code and documentation has been produced. http://msmizans.home.cern.ch/msmizans/production/phys_analysesdc2.html The remit has been broadened to include the first use of the Grid for analyst-users, which has also been extensively documented. http://msmizans.home.cern.ch/msmizans/production/dc2/grid3.html The first deployments in both cases are workable but unsatisfactory, and a strategy and design review is planned over the next few months.

Visualisation: Most of the effort over the last six months has been focused on the support of the ATLAS Combined Testbeam (CTB). This involved developing an automatic procedure for accessing the CTB geometry from athena and implementing it into atlantis. A server-client communication between athena and atlantis was also developed, based on the standard xmlrpc protocol, for using atlantis as an online event display at the CTB control room. Atlantis is being used successfully in the CTB and there are several cases where it helped both in spotting problems in the hardware and in fixing bugs in the detector reconstruction software. Significant effort has also been devoted to user support and developers documentation, as well as in testing and following up updates in the rest of the ATLAS software.

04Q4 and 05Q1

WP4.1 Software Validation

The two L1 requirements for the period 2004-Q4 and 2005-Q1 were the requirements document, and the the prototype release. Both were delivered on time. During the prototype development work, a number of of unanticipated situations arose and were dealt with. The most important of these was the dropping of Linux Red Hat 7, which was replaced by Linux SLC3, and the necessity to run from ATLAS software distribution kits.

The transition to SLC3 was smooth. Prior to the switch over of the UCL farm, where the nightly tests are run, the validation software was run nightly in on a desktop machine with a cut down work load that was chosen to have at least one job from each type found in the full work load. The only feature required specifically by the validation software was a build of Python v2.3 on the SLC3 platform.

Until the end of 2004, the official ATLAS stance was that the release on AFS was the official release. Due to kit patching, the distributed software could, in fact, deviate from the official releases. For this reason it became clear that running from kits is necessary validator functionality. A further reason is that the kit itself needs validation as it is being constructed. A battery of small tests are run using the KitValidation package, but the full power of the validator is also required.

While gaining experience with the prototype it became clear that the validator, was very highly centralised. Complex tasks were carried out by specially written scripts. The flexibility of the software lay in its modularity, not in its control structures.

To remedy this, many variables were identified as input parameters, and were stripped out of the code and placed in configuration files, thus converting specialized functions to generic scripts. These configuration files are stored in the package they refer to, and a discovery mechanism to find which packages provide such information was set in place/ This greatly improved the validator flexibility as maintenance of the configuration files is now the responsibility of those who have the necessary expertise.

The ability to run user supplied scripts to carry out specialised tasks not handled by the validator is an important feature. Information which allows user supplied scripts is possible through the configuration files. The validator does not merely call the scripts but also passes information to the client script, allowing a two-way interaction between the validator framework and the client scripts.

The amount of information provided by the RTT can be bewildering. Jobs can be run form different job groups, with very run time needs, tests, and results files. The same jobs can be run under different circumstances including different releases, debug or optimised builds or from kits. Our efforts in making the information available in an absorbable manner are aided by user feedback from the prototype.

The validator software was reviewed under the general ATLAS Software Infrastructure review in February 2005 (see )

WP4.2 Graphics

The L1 milestones of using the Atlantis event display used in the 2004 Combined test beam (1/10/04) and documenting the use of standard protocols for online data access and use as a web service (1/04/05) were both met.

Many new features have been introduced to the ATLANTIS event display program. For the calorimetery, energy colour maps, log energy scale, and lego plots were provided. Missing Et is now displayed. Events, which were transferred through the network can be saved as XML files. A user's configuration can be saved, and used it at the following start-up. The GUI has been improved.

Online data access and Atlantis-Athena communication has been implemented using XMLRPC protocol as described at

http://www.hep.ucl.ac.uk/atlas/EScience/Documents/Visualisation/jiveXML-atlantis-communication_design.pdf

Data can be send from ATLANTIS to ATHENA allowing interactive ATHENA "driven" from ATLANTIS. Execution of ATHENA algorithms is now possible from ATLANTIS

Work to ensure ATLANTIS will be usable for ATLAS commissioning has started, and includes a first look at simulated cosmic data for inner detector, and discussions with Tile commissioning team.

A number of ATLANTIS tutorials have been given, and the next one is scheduled for the 27th May, during the ATLAS software week. We have Atlantis developer's guide, which can be seen at :

https://uimon.cern.ch/twiki/bin/view/Atlas/DetDescAndGraphicsReview/atlantis_developers_guide.pdf

and the Atlantis start-up guide, which is viewable at http://cern.ch/atlantis/docs/AtlantisStartupGuide-v12.pdf

ATLANTIS was reviewed in the ATLAS software graphics review. The proceedings of the review can be found at:

https://uimon.cern.ch/twiki/bin/view/Atlas/DetDescAndGraphicsReview

WP4.3 Physics Analysis and Tools

After discussions with the ATLAS ESRAT (Event Selection, Reconstruction and Analysis Tools) co-ordinator, it became clear that there was a need to develop tools beyond the idea of a simple event data model, and accompanying helper functions. The work at UCL on the ATLANTIS event display, in particular interaction with ATHENA provided an opportunity to provide a vital set of tools that allow physicists to visualise the reconstructed entities, in order to develop, debug and tune reconstruction and physics algorithms. Early work to exploit the XML-RPC transport mechanism put in place as part of WP4.2 were presented at the Tucson Physics Analysis Tools workshop in April 2005.

Other work being carried out in the context of the ATLAS AOD is the development of event views. An event view provides a coherent, exhaustive and mutually exclusive handling of reconstruction entities.

05Q2

WP4.1 Software Validation

Over the last quarter there has been a growing usage of the validator process. By the end of June, 12 job groups being processed for two build branches.

As part of the co-ordinated ATLAS offline testing effort, the RTT post processing tools into separate python modules which are installed as part of the offline software build process. The post processing tools now conform to a common interface. The combination of these changes allow RTT tools to be freely used by other testing framworks, and brings a greater degree of coherency to the RTT itself.

Down time on the UCL cluster used to run the validation jobs underlined two vulnerabilies of the system: jobs were not run for the period the cluster was down, and for some time afterwards the system suffered instability. The second effect was to show that more extended quality control was required. The backlog of untested - because tests were impossible during the doen time - made working conditions very difficult. The second problem has been cured by instituting a pipeline of tests of increasing completeness. When the pipeline is full, code development must stop until the backlog is cleared. This means the production system is protected, and bugs are in new code are identified and fixed far more rapidly than before.

Two approaches are being followed to solve the problem of vulnerablity to hardware failure or unavailablity: the first if to find a back-up batch system which would run the validation framework. The other approach being investigated is to run the validation jobs on the grid.

Work has been put into the web pages to improve usablity and clarity. Basic monitoring scripts have been implemented.

WP4.2 Graphics

The work in this quarter involved:

Atlantis is already being successfully used in the commissioning of the Tile Calorimeter, the first detector taking cosmic data in the ATLAS underground cavern. Atlantis pictures of cosmic events in the TileCal were shown in all the major international summer conferences and appeared in publications, such as the CERN bulletin and "Nature". Examples of event displays can be seen on Tile tests Wiki page ; The CERN bulletin front page; Nature magazine .

WP4.3 Physics Analysis and Tools The physics analysis effort is concentrating developing and extending existing areas of expertise at UCL, namely fast simulation (Atlfast) and interactive analysis using ATLANTIS. Creation to export AOD objects for Atlfast has been updated and Atlfast track objects have been added. The Atlfast jobs have been updated to use the DC2 data format. Test jobs have been added to the RTT. Modifications to Atlfast to allow the use of the WP3 comparator aer underway. Work on providing fast simulation for monopoles has started.

05Q3

WP4.1 Software Validation

The validation software has demonstrated a much higher level of stability this quarter due to the efforts made during 05Q2 to improve our self-testing protocals. The RunTimeTester still experiences residual instability due partly to failure in systems on which it depends, and, at a very low rate, due to a number of reasons apparantly due to the cluster and batch queues on which it runs. We are working to identify the causes more precisely. Work on the installation of a parallel backup version of the validation package has been started at Lancaster, to cover periods when the UCL cluster is unavailable. The number of client offline packages has ridden to 19. The success rate for running test jobs varies, put was high (approx. 80\%) at times towards the end of the quarter. The number of user provided post job tests remains low. Effort has been put into providing RTT monitoring. This enables better control of the RTT for the development team, and a better understanding of the state of software testing in ATLAS for managers, in particular the SIT co-ordinator. The RTT continues to work to improve monitoring. Internal reorganisation of how test results are stored has been performed. This was necessary to accomodate new clients who report their results via linked web pages. To improve throughput on the cluster in the 24 hour testing period (the tests are run once every 24 hours after the nightly builds have completed) have motivated us to use threading to increase parallelism in the system. As a results of the change in the operating system (CERN moved to SLC) we lost the ability to run the RTT interactively, which is vital to allow package testers to develop their tests. The reasons were identified by the RTT team - the problems included incorrect python versions for ATLAS, and incorect PyXML installation by the LCG - and interactive running has been restablished. As a further aid to testers we have set up an interactive web based configuration file tester to further aid developers to integrate their tests into the RTT.

WP4.2 Graphics

The work during this quarter has focus on the following areas:

WP4.3 Physics Analysis and Tools

Physics analysis work this quarter has centred around developing and maintaining the ATLAS fast simulation package, Atlfast. A new scheme of jobOptions has been introduced to afford easier access to new users, thus extending the user base within the collaboration. The electron and muon classes have been adapted to work effectively with the WP3 comparator tool, enabling accurate future calibrations to real data and/or full simulation. New classes have been designed and integrated to accommodate monopole production. Progress has also been made towards graphical analysis with Atlantis, the ATLAS event display. JiveXML has been successfully run on DC2 data and extra functionality has been introduced to allow AOD objects to be displayed in Atlantis.

05Q4

WP4.1 Software Validation

Increased functionality
User Control Improvements
Services
New Job Groups
Self Testing

WP4.2 Graphics

There have been three major activities during this quarter:

WP4.3 Physics Analysis and Tools

06Q1

WP4.1 Software Validation

New Functionality
Creation of scripts to automatically copy user requested data to UCL using Grid Tools
Major Changes
Transition to ATLAS project builds. Considerable structural changes due to improved understanding of the use of CMT. These also lead to marked performance improvements.
Improvements
Refactoring
New Job Groups:
User Control additions
Self Testing

WP4.2 Graphics

Intense development following requests and feedback from users has continued. The most significant items in this period have been:

During this period, the online help of atlantis was also updated and a tutorial was given during the ATLAS-UK EScience training course. Also, the atlantis-JiveXML project was presented at the CHEP-06 conference and a paper was submitted to the proceedings.

WP4.3 Physics Analysis and Tools

The work of the ATLAS Atlfast Task Force (ATF) started this quarter. The goal of the task force is to ensure that Atlfast provides is adequate for performing physics analysis. The task force is composed mainly of physicists concerned with detector performance, simuulation and reconstruction, together with the Atlfast team.

The work is being carried out in two phases. In the first phase, the parameterisations used in Atlfast are being examined and, where necessary, updated to reflect changes in detector layout and reconstruction since those developed almost ten years ago.

In particular, changes have been made to:

06Q2

WP4.1 Software Validation

WP4.2 Graphics

Work continued on supporting the InDet commissioning, interactive vertexing, improving the displaying of AOD types, improving the functionality of the GUI and the configuration of atlantis, and on training. Specifically:

WP4.3 Physics Analysis and Tools

06Q3

WP4.1 Software Validation

WP4.2 Graphics

The main activities in this quarter included:

07Q1

WP4.1 Software Validation

WP4.2 Graphics

The main activities in this quarter included:

Meetings & Papers

Konstantanidis et al. "The Atlantis event visualisation program for the ATLAS experiment" 2004

Effort Delivered

Effort funded by ATLAS E-Science

E. Nzuobontane (UCL)

QuarterValidation frameworkAnalysis frameworkVisualisationTotalGrand total
04Q2 0.0 0.0 0.0 0.0  
04Q3 1.0 0.0 0.0 1.0  
04Q4 1.0 0.0 0.0 1.0  
05Q1 1.0 0.0 0.0 1.0  
05Q2 1.0 0.0 0.0 1.0  
05Q3 1.0 0.0 0.0 1.0  

Z. Maxa (UCL)

QuarterValidation frameworkAnalysis frameworkVisualisationTotalGrand total
04Q2 0.0 0.0 1.0 1.0  
04Q3 0.0 0.0 1.0 1.0  
04Q4 0.0 0.0 1.0 1.0  
05Q1 0.0 0.0 1.0 1.0  
05Q2 0.0 0.0 1.0 1.0  
05Q3 0.0 0.0 1.0 1.0  

B Simmons (UCL)

QuarterValidation frameworkAnalysis frameworkVisualisationTotalGrand total
04Q2 0.8 0.2 0.0 1.0  
04Q3 1.0 0.0 0.0 1.0  
04Q4 1.0 0.0 0.0 1.0  
05Q1 1.0 0.0 0.0 1.0  
05Q2 1.0 0.0 0.0 1.0  
05Q3 1.0 0.0 0.0 1.0  

Qiang Lu (Birmingham)

QuarterValidation frameworkAnalysis frameworkVisualisationTotalGrand total
04Q2 0.0 0.0 0.0 0.0  
04Q3 0.0 0.0 0.0 0.0  
04Q4 0.0 0.0 0.0 0.0  
05Q1 0.0 0.0 0.7 0.7  
05Q2 0.0 0.0 1.0 1.0  
05Q3 0.0 0.0 1.0 1.0  

Effort not funded by ATLAS E-Science

QuarterValidation frameworkAnalysis frameworkVisualisationTotalGrand total
04Q2 0.5(PS) 0.5(PS) 1.3(JD, NK) 2.3  
04Q3 1.0(PS) 0.0 1.3(JD, NK) 2.3  
04Q4 1.0(PS) 0.15(JC) 0.45(NK, JT) 1.6  
05Q1 1.0(PS) 0.5(SD) 0.45(NK, JT) 1.95  
05Q2 1.0(PS) 0.5(SD) 0.45(NK, JT) 1.95  
05Q3 1.0(PS) 0.5(SD) 0.45(NK, JT) 1.95  

PSPeter Sherwood
NKNikos Konstantinidis
JDJanice Drohan
JCJames Catmore
JTJurgen Thomas
SDSimon Dean