Contact: comp-dc-operations @ belle2.org
Contents
Individual Services and Resources
Production Plans
- MC10
- MC11
- SKIM9x2, 40 TB
- GCR2c
- prod6
Production Status
Most productions have finished. New productions will be started when updated beam background files are ready.
Some small signal MC samples were submitted on Nov 11 ~06:00 JST, Nov 13, Nov 14.
Phase 2 reprocessing with the distributed computing system (prod6b) started Nov 15 ~00:00 JST.
Large phase 3 BGx1 generic MC samples were submitted recently (~Nov 20). These should be relatively short jobs (though there are many of them). They should run for the next few weeks.
Central Services
Dirac (dirac.cc.kek.jp, b2dchsv01-b2dchsv06.cc.kek.jp, b2dchsv08.cc.kek.jp)
- Date, Issue, Tickets...
- The memory has rapidly increase at b2dchsv04. - BIIDCO-1545Getting issue details... STATUS
- – Network downtime - BIIDCO-1395Getting issue details... STATUS
- 1-min cpu load has rapidly increased and gone over the redline. - BIIDCO-1487Getting issue details... STATUS
DB Production (b2dchdb1.cc.kek.jp, b2dchdb2.cc.kek.jp, b2dcsdb1.cc.kek.jp, b2dcsdb2.cc.kek.jp
- Date, Issue, Tickets...
- – Network downtime - BIIDCO-1395Getting issue details... STATUS
- b2dchdb1.cc.kek.jp is down - BIIDCO-1492Getting issue details... STATUS
DDM (bldirac01.sdcc.bnl.gov)
- BNL network interruption 2018-Dec-18 14:00-15:00 UTC - BIIDCO-1462Getting issue details... STATUS
- DDM ReplicateAndRegister increasing Queued tasks since 2018-07-17 - BIIDCO-1175Getting issue details... STATUS
- DDM is stalled - BIIDCO-1140Getting issue details... STATUS
- 2018-03-01 DDM deletion task seems stuck - BIIDCO-808Getting issue details... STATUS
Conditions DB ()
- BNL network interruption 2018-Dec-18 14:00-15:00 UTC - BIIDCO-1462Getting issue details... STATUS
- Planning to migrate to BNL servers on May 31th. Following IP address will be used.
192.33.128.4
192.33.128.5
192.33.128.6
192.33.128.9
192.33.128.10
192.33.128.11
Monitor
LFC
File Transfers and Replication Status
See also DDM for related issues
FTS
Any problem in the FTS service or FTS monitoring are to be recorded here. Site/SE specific issues are to be recorded under each SIte/SE
Note that the FTS dashboard we use is an "old" instance and not well-maintained. We, Belle II members in general, do not have access to the "new" monitoring. When the dashboard is down, the shifters just need to notify the expert and skip the corresponding part of their work. The expert should check the new monitoring, for the access to the monitoring page is limited.
- FTS transfer stuck since 2018-11-25 aruond 7:00 UTC
-
BIIDCO-1458Getting issue details...
STATUS
GGUS ticket https://ggus.eu/index.php?mode=ticket_info&ticket_id=138490 has submitted 2018-11-26 - 18/11/08 No activity in "Throughput" and "Successful" plots. - BIIDCO-1423Getting issue details... STATUS
- 18/10/12 01:30 UTC Low activity in "Throughput" and "Successful" plots.
-
BIIDCO-1367Getting issue details...
STATUS
Replication Status
- BNL network interruption 2018-Dec-18 14:00-15:00 UTC - BIIDCO-1462Getting issue details... STATUS
- Replication and DDM plots are not updated since 2018-11-25 7:00 UTC
-
BIIDCO-1460Getting issue details...
STATUS
related to - BIIDCO-1458Getting issue details... STATUS - 2018-10-11 Sharp drop in 'done' jobs and increase in 'waiting' - BIIDCO-1365Getting issue details... STATUS
- 2018-09-29 The numbers have been almost zero. - BIIDCO-1339Getting issue details... STATUS
- 2018-09-22 The number of "Done Jobs" is lower than the number of "Scheduled Jobs" during the last 6 hours or more
2018-07-02 No Donetransfer, several scheduled and rapid increase of Waiting replication - BIIDCO-1125Getting issue details... STATUS
Job Status Plot
Job Summary
SEs
SE Common Issues
- Issues with individual SEs should be recorded below (Primary SEs or Other SEs).
Primary SEs
Primary SE: BNL-TMP-SE (dcblsrm.sdcc.bnl.gov)
- BNL network interruption 2018-Dec-18 14:00-15:00 UTC - BIIDCO-1462Getting issue details... STATUS
- SRM_AUTHORIZATION_FAILURE for users - BIIDCO-1303Getting issue details... STATUS
- UNAVAILABLE files - BIIDCO-1302Getting issue details... STATUS
Primary SE: CESNET-TMP-SE (dpm1.egee.cesnet.cz)
Primary SE: CNAF-TMP-SE (storm-fe-archive.cr.cnaf.infn.it)
- Replication status: Increasing 'Scheduled' with zero 'done' since 2018-12-05 13:00 UTC. - BIIDCO-1473Getting issue details... STATUS
Cotinuous timeout failure between NTU-CC-TMP-SE and CNAF-TMP-SE - BIIDCO-1310Getting issue details... STATUS
Primary SE: DESY-TMP-SE (dcache-se-desy.desy.de)
- SE Health check by DDM : download, upload do not work since 2018-12-15 21:46:02 UTC https://agira.desy.de/browse/BIIDCO-1490
- SE Health check by DDM : download, upload do not work since 2018-12-16 07:08:04 UTC.
Date, Issue, Tickets...
Primary SE:KEK2-TMP-SE (kek2-se03.cc.kek.jp)
- File Transfer failures : File Transfer Efficiency is too low from KEK2-TMP-SE. since about 2018-12-18 1:00 (UTC) - BIIDCO-1511Getting issue details... STATUS
- 2018-10-22 FTS transfer and upload failure has observed since 2018-10-21 20:53:36 UTC.
-
BIIDCO-1388Getting issue details...
STATUS
GGUS ticket https://ggus.eu/index.php?mode=ticket_info&ticket_id=137862 has submitted 2018-10-22 02:59 UTC - Firewall issue found in user activities https://ggus.eu/index.php?mode=ticket_info&ticket_id=136643
- 2018-07-01: Scheduled jobs are increasing from about 05:00 UTC.- FTS job fail with Timeout error GGUS-135874 - BIIDCO-1129Getting issue details... STATUS
Primary SE: KISTI-TMP-SE (belle-se-head.sdfarm.kr)
- NOTE: This site is banned, there is no need to create a ticket related to SE Health checks
SE Health check by DDM - BIIDCO-1364Getting issue details... STATUS
GGUS ticket https://uggus.eu/index.php?mode=ticket_info&ticket_id=137825 has submitted 2018-10-18 14:21No new assignment of MC production data blocks to this destination - BIIDCO-848Getting issue details... STATUS
Primary SE: KIT-TMP-SE (dcachesrm-kit.gridka.de)
Primary SE: KMI-TMP-SE (nsrmfe01.hepl.phys.nagoya-u.ac.jp )
- Replication status: Zero 'done' with non-zero 'queued' 2018-8-24 06:30 UTC - BIIDCO-1233Getting issue details... STATUS
Primary SE: Napoli-TMP-SE (belle-dpm-01.na.infn.it )
- SE Health check by DDM : upload does not work since 2018-11-14 19:37:18 UTC. - BIIDCO-1435Getting issue details... STATUS
- SE Health check by DDM : checksum, download, upload do not work since 2018-09-20 05:24:11 UTC - BIIDCO-1306Getting issue details... STATUS
Primary SE: SIGNET-TMP-SE (dcache.ijs.si
- SE Health check by DDM : checksum, remove file, remove directory, download, upload, ls do not work since 2018-10-31 16:09:44 UTC. - BIIDCO-1407Getting issue details... STATUS
Other SEs
Adelaide-TMP-SE (coepp-dpm-01.ersa.edu.au)
CYFRONET-TMP-SE (dpm.cyf-kr.edu.pl)
CINVESTAV-TMP-SE (jaguar-se.fis.cinvestav.mx)
- Failed file transfers observed after 7:00 UTC on 22/11/2018, ticket updated - BIIDCO-1340Getting issue details... STATUS
- Low transfer efficiency observed at 21:00 UTC on 23/10/2018, put it in the ticket - BIIDCO-1340Getting issue details... STATUS
- Low transfer efficiency is observed again after 9:00 UTC on 10/10/18. and on Ticket updated. - BIIDCO-1340Getting issue details... STATUS
- The problem raised by https://agira.desy.de/browse/BIIDCO-1340 seems to have been solved.
- Low transfer efficiency. - BIIDCO-1340Getting issue details... STATUS
- FTS authentication crediential error - BIIDCO-1319Getting issue details... STATUS
Frascati-TMP-SE (atlasse.lnf.infn.it)
FTS authentication credidential error - BIIDCO-1318Getting issue details... STATUS
GGUS ticket https://ggus.eu/?mode=ticket_info&ticket_id=137376 has submitted 2018-09-25 08:43
HEPHY-TMP-SE (hephyse.oeaw.ac.at)
IPHC-TMP-SE (sbgse1.in2p3.fr)
LAL-TMP-SE (grid05.lal.in2p3.fr)
Melbourne-TMP-SE (b2se.mel.coepp.org.au)
transfer rate to be zero - BIIDCO-896Getting issue details... STATUS
- Melbourne-DATA-SE banned for write - BIIDCO-927Getting issue details... STATUS
McGill-TMP-SE (storm02.clumeq.mcgill.ca)
MPPMU-TMP-SE (grid-srm.rzg.mpg.de)
Downtime : ARC.MPPMU.de/MPPMU-TMP-SE from 2018-11-26 06:00 to 2018-11-28 06:00 (UTC) - BIIDCO-1461Getting issue details... STATUS
NTU-TMP-SE, NTU-CC-TMP-SE (bgrid3.phys.ntu.edu.tw, belle2grid3.cc.ntu.edu.tw)
- File transfer failure and cancellation to NTUCC-DATA-SE happened 2018-12-22 - BIIDCO-1551Getting issue details... STATUS
- Frequent timtout has observed between NTU-CC-TMP-SE and CNAF-TMP-SE
-
BIIDCO-1310Getting issue details...
STATUS
GGUS ticket https://ggus.eu/index.php?mode=ticket_info&ticket_id=137334 has submitted 2018-09-22 05:10 UTC - NTUCC-TMP-SE banned for write - BIIDCO-1333Getting issue details... STATUS
Pisa-TMP-SE (stormfe1.pi.infn.it)
- The problem raised by https://agira.desy.de/browse/BIIDCO-1355 seems to have been solved.
- Low Transfer Efficiency for source is observed since 17:00 UTC on 7/10/18. - BIIDCO-1355Getting issue details... STATUS
GGUS ticket:https://ggus.eu/index.php?mode=ticket_info&ticket_id=130905 has submitted at 2017-10-04 05:37 UTC. No response from site.
GGUS ticket:"File Transfer failure to stormfe1.pi.infn.it"(129865) has been submitted at 07:01:38 UTC on 2017/08/01. No response from site.
PNNL-TMP-SE (se.hep.pnnl.gov)
Roma3-TMP-SE (storm-01.roma3.infn.it)
- Date, Issue, Tickets...
TAU-TMP-SE (tau-se.hep.tau.ac.il)
- Low transfer efficiency
-
BIIDCO-1362Getting issue details...
STATUS
GGUS ticket https://ggus.eu/index.php?mode=ticket_info&ticket_id=138882 - FTS transfer failure due to Authtication credidential error since 2018-09-22
-
BIIDCO-1317Getting issue details...
STATUS
Solved and verified 2018-10-25 : GGUS ticket https://ggus.eu/?mode=ticket_info&ticket_id=137335 has submitted at 2018-09-22 13:40 UTC FTS transfer failure happened as SOURCE since 2018-09-03 - BIIDCO-1248Getting issue details... STATUS
Solved and verified 2018-09-04 : GGUS ticket https://ggus.eu/?mode=ticket_info&ticket_id=136986 has submitted at 2018-09-03
Torino-TMP-SE (se-srm-00.to.infn.it)
File transfer is inefficient from Torino-TMP-SE since about 4 hours observed at 1:30 pm (UTC) on 2018-12-21. - BIIDCO-1549Getting issue details... STATUS
GGUS ticket https://ggus.eu/index.php?mode=ticket_info&ticket_id=138968 has submitted at 2018-12-21
ULAKBIM-TMP-SE (torik1.ulakbim.gov.tr)
File transfer failures to Napoli-TMP-SE and KEK2-TMP-SE, BNL-TMP-SE observed from 2018-12-05 02:00 - BIIDCO-1468Getting issue details... STATUS
GGUS ticket https://ggus.eu/index.php?mode=ticket_info&ticket_id=138882
UMiss-TMP-SE (umiss005.hep.olemiss.edu)
UVic-TMP-SE(charon01.westgrid.ca)
File Transfer failures : File Transfer Efficiency is too low from UVic-DATA-SE. since about 2018-12-18 1:00 (UTC) - BIIDCO-1491Getting issue details... STATUS
- File transfer failures from Source UVic-TMP-SE observed on 26 Oct 2018 - BIIDCO-1397Getting issue details... STATUS
FTS connection timeout from Uvic to KEK (kek2-se03.cc.kek.jp) - BIIDCO-1314Getting issue details... STATUS
Solved and verified at 2018-10-25 : GGUS ticket https://ggus.eu/index.php?mode=ticket_info&ticket_id=137332 has submitted 2018-09-22 04:28 UTC
Sites
Sites Common Issue
- DIRAC SSH sites does not filled jobs at MC11 - BIIDCO-1231Getting issue details... STATUS
- Several sites: Pilot submission failures/short pilots (4x) and Software could not be installed (2x) - Common JIRA ticket issued:
-
BIIDCO-1443Getting issue details...
STATUS
→ See below info for individual sites referring to the same JIRA ticket (BIIDCO-1443) - Several sites: Short Pilot Jobs (17x). - BIIDCO-1484Getting issue details... STATUS
ARC.DESY.de
- Job status check: Input Data Resolution issues (still 100% of the jobs) on 2018/12/22 at 8:00 UTC.
- Job status check: Input Data Resolution issues (100% of the jobs) on 2018/12/21 at 8:48 UTC.
- Health checker info. : "Short pilot jobs" has been found since 08:20:00 UTC on 2018/12/16.(details)
- all jobs are in "Input data resolution" status since 17:00:00 UTC on 2018/12/18. - BIIDCO-1541Getting issue details... STATUS
- 100% Jobs fail at ARC.DESY.DE - BIIDCO-1504Getting issue details... STATUS - BIIDCO-1486Getting issue details... STATUS
- Health checker info. : "Short pilot jobs" has been found at 14:20:00 UTC on 2018/12/13.(details). - BIIDCO-1518Getting issue details... STATUS - BIIDCO-1486Getting issue details... STATUS
- Job submission check : Pilot submission failure has been found since 05:42:00 UTC on 2018/10/25. (details)
- Health checker info. : "Aborted pilot jobs" has been found since 16:20:00 UTC on 2018/10/23. - BIIDCO-1391Getting issue details... STATUS
- Reconfiguration for the site queues: - BIIDCO-1392Getting issue details... STATUS
ARC.DESY-test.de
A test queue for the new CE. - BIIDCO-1469Getting issue details... STATUS
- Please report any issues.
- Date, Issue, Ticket
ARC.KIT.de
- Job status check: Application finished with errors (5% of the jobs in last 24 hours) on 2018/12/21 at 8:48 UTC.
- Health checker info. : "Short pilot jobs" has been found at 07:20:00 UTC on 2018/11/19.
- Health checker info. : "Failed pilot jobs" has been found since 20:20:00 UTC on 2018/10/20 - BIIDCO-1384Getting issue details... STATUS
- Health checker info. : "Short pilot jobs" has been found since 17:20:00 UTC on 2018/10/06.(details)
ARC.LMU.de
This is a test site. Do not need to report any issue.
ARC.LMU2.de
2018/08/13 Downtime: Start time: 2018-08-12 13:00 (UTC) End time: 2018-08-13 15:00 (UTC) - BIIDCO-1212Getting issue details... STATUS
Banned as currently no resource behind the CE - BIIDCO-239Getting issue details... STATUS
ARC.Melbourne.au
ARC.MPPMU.de
- Health checker info. : "Failed pilot jobs" has been found at 09:20:00 UTC on 2018/10/25.(details) - BIIDCO-1537Getting issue details... STATUS
- Job submission check : Pilot submission failure has been found since 13:26:00 UTC on 2018/10/21. - BIIDCO-1386Getting issue details... STATUS
ARC.SIGNET.si
- Job status check: Application finished with errors (5% of the jobs) at 11:15 UTC on 2018/12/21.
- Health checker info. : "Short pilot jobs" has been found since 17:20:00 UTC on 2018/12/20.(details)
Job submission check : Pilot submission failure has been found at 13:26:00 UTC on 2018/12/19. - BIIDCO-1547Getting issue details... STATUS
"Failed to install DIRAC on " has been found since 20:20:00 UTC on 2018/11/03. - BIIDCO-1420Getting issue details... STATUS
- "Short pilot jobs" has been found at 14:20:00 UTC on 2018/10/29. - BIIDCO-1519Getting issue details... STATUS
- Health che cker info. : "Aborted pilot jobs" has been found since 20:20:00 UTC on 2018/10/20. - BIIDCO-1383Getting issue details... STATUS
- Health checker info. : "Failed pilot jobs" has been found since 06:20:00 UTC on 2018/10/03.(details) - BIIDCO-1350Getting issue details... STATUS
- Health checker info. : "Belle II software could not be installed on " has been found since 17:20:00 UTC on 2018/09/23 - BIIDCO-1321Getting issue details... STATUS
- Health checker info. : "Short pilot jobs" has been found at 22:20:00 UTC on 2018/09/14.(details) - BIIDCO-1288Getting issue details... STATUS
- Job submission check : Pilot submission failure has been found at 05:29:00 UTC on 2018/09/15. (details) - BIIDCO-1289Getting issue details... STATUS
- Job submission check : Pilot submission failure has been found since 01:31:00 UTC on 2018/09/08. (details) - BIIDCO-1128Getting issue details... STATUS
CLOUD.CC1_Krakow.pl
- Not used in production yet. Seeing no jobs (no plot) is not a problem
DIRAC.Beihang.cn
- Health checker info. : "Short pilot jobs" has been found at 06:20:00 UTC on 2018/12/08. - BIIDCO-1520Getting issue details... STATUS - BIIDCO-1534Getting issue details... STATUS
- Job status check: "application finished with errors" (100% currently) on 2018/10/26.
- Job submission check : Pilot submission failure has been found since 09:24:00 UTC on 2018/09/21. (details) - BIIDCO-1312Getting issue details... STATUS
- - BIIDCO-647Getting issue details... STATUS Many MCProduction jobs failed at file upload stage for fail-over SEs 2017-12-24
- The number of jobs limited. - BIIDCO-289Getting issue details... STATUS
- All the upload trials are failing against all the SEs configured: OutputSE (KMI-TMP-SE, PNNL-TMP-SE), Fail-over SEs(DESY-TMP-SE, Napoli-TMP-SE, PNNL-TMP-SE, KIT-TMP-SE)
- Large % of failed jobs in DIRAC status plot (Added 2016-11-03 22:45:00 UTC)
DIRAC.BINP.ru
- Job status check: Application finished with errors (27% of the jobs over the last 24h) at 8:00 UTC on 2018/12/22.
- Job submission check : Pilot submission failure has been found since 17:26:00 UTC on 2018/10/21. - BIIDCO-1387Getting issue details... STATUS
- Health checker info. : "Failed to install DIRAC on " has been found at 22:20:00 UTC on 2018/09/15
DIRAC.BINP-VM.ru
- Job status check: Application finished with errors (34% of the jobs in last 24 hours) and Stalled (8%) on 2018/12/21 at 8:48 UTC.
- Job status check: "Application Finished with Errors " (episodically, 10% in total), on 2018/12/20.
- Job Status Plots "Application Finished with Errors " (100 %) on 9/10/18.https://agira.desy.de/browse/BIIDCO-1358
- Job status plots, "Application Finished With Errors" (2018-02-11 but lasting for at least a month) - BIIDCO-749Getting issue details... STATUS
DIRAC.CINVESTAV.mx
- - BIIDCO-1476Getting issue details... STATUS - BIIDCO-1524Getting issue details... STATUS
- Job status plots, "Application Finished With Errors" & "Watchdog identified this job as Stalled" (2018-02-12)
-
BIIDCO-755Getting issue details...
STATUS
DIRAC.DESY.de
- Test site. Not in use in MC production
DIRAC.IITG.in
- Health checker info. : "Aborted pilot jobs" has been found since 21:20:00 UTC on 2018/12/22.
- Job status check: Application finished with errors (95% of the jobs over the last 24h) at 8:00 UTC on 2018/12/22.
- Health checker info. : "Short pilot jobs" has been found since 03:20:00 UTC on 2018/12/22.(details)
- Job status check: Application finished with errors (100% of the jobs) since 6:00 UTC on 2018/12/21.
- Job status check: Input Data Resolution issues (100% of the jobs) on 2018/12/21 at 8:48 UTC.
- Health checker info. : "Short pilot jobs" has been found at 06:20:00 UTC on 2018/12/21.(details)
- all jobs are in "Input data resolution" status (2018/12/20).
- 100% Jobs fail at DIRAC.IITG.in - BIIDCO-1505Getting issue details... STATUS
- Health checker info. : "Aborted pilot jobs" has been found at 14:20:00 UTC on 2018/12/15.
- Health checker info. : "Short pilot jobs" has been found at 14:20:00 UTC on 2018/12/14. - BIIDCO-1521Getting issue details... STATUS
- Job submission check : Pilot submission failure has been found at 22:23:00 UTC on 2018/12/05. - BIIDCO-1474Getting issue details... STATUS
- Health checker info. : "Short pilot jobs" has been found since 17:20:00 UTC on 2018/11/02. - BIIDCO-1409Getting issue details... STATUS
- Job submission check : Pilot submission failure has been found at 13:24:00 UTC on 2018/09/26. (details)
- Health checker info. : "Aborted pilot jobs" has been found since 14:20:00 UTC on 2018/04/22
-
BIIDCO-977Getting issue details...
STATUS
DIRAC.IITH.in
- Health checker info. : "Short pilot jobs" has been found at 14:20:00 UTC on 2018/12/14.
- Job status check: "input Data Resolution" issues (36%) on 2018/10/26.
- - BIIDCO-1378Getting issue details... STATUS
- Job submission check : Pilot submission failure has been found since 11:27:00 UTC on 2018/10/06. (details)
- Job submission check : Pilot submission failure has been found since 11:28:00 UTC on 2018/10/03. - BIIDCO-1349Getting issue details... STATUS
DIRAC.LMU.de
DIRAC.MIPT.ru
- 100% Jobs fail - BIIDCO-1506Getting issue details... STATUS
- Health checker info. : "Belle II software could not be installed on " has been found since 08:20:00 UTC on 2018/12/14.
- Health checker info. : "Belle II software could not be installed on " has been found at 22:20:00 UTC on 2018/12/05.
- Health checker info. : "Belle II software could not be installed on " has been found since 01:20:00 UTC on 2018/11/24.
- Health checker info. : "Short pilot jobs" has been found since 21:20:00 UTC on 2018/11/23.(details)
- Health checker info. : "Belle II software could not be installed on " has been found since 21:20:00 UTC on 2018/11/21.
- Health checker info. : "Belle II software could not be installed on " has been found at 14:20:00 UTC on 2018/11/20. - BIIDCO-1443Getting issue details... STATUS
- Job Status Plots "Application Finished with Errors " (100 %) on 9/10/18.https://agira.desy.de/browse/BIIDCO-1358
- Health checker info. : "Aborted pilot jobs" has been found at 14:20:00 UTC on 2018/10/06.(details)
- Health checker info. : "Aborted pilot jobs" has been found at 07:20:00 UTC on 2018/09/21.(details) - BIIDCO-747Getting issue details... STATUS
- Health checker info. : "Short pilot jobs" has been found at 20:20:00 UTC on 2018/09/13.(details)
- Health checker info. : "Short pilot jobs" has been found at 15:20:00 UTC on 2018/07/23.(details)
- Health checker info. : "Short pilot jobs" has been found at 04:20:00 UTC on 2018/07/19.
- Health checker info. : "Aborted pilot jobs" has been found at 22:20:00 UTC on 2018/07/18.
- Job status plots, "Application Finished With Errors" has been found at about 04:00:00 JST on 2018/07/06. (details)
- Health checker info. : "Aborted pilot jobs" has been found at 12:20:00 UTC on 2018/02/11 - BIIDCO-747Getting issue details... STATUS
DIRAC.Nagoya.jp
- Job status check: Application finished with errors (5.6% of the jobs in last 24 hours) on 2018/12/21 at 8:48 UTC.
- Health checker info. : "Short pilot jobs" has been found at 07:20:00 UTC on 2018/12/21.(details)
- Health checker info. : "Short pilot jobs" has been found at 14:20:00 UTC on 2018/12/13.(details)
- Health checker info. : "Short pilot jobs" has been found since 19:20:00 UTC on 2018/11/15.
- Health checker info. : "Short pilot jobs" has been found at 14:20:00 UTC on 2018/10/21.
- Health checker info. : "Short pilot jobs" has been found since 19:20:00 UTC on 2018/09/30.(details)
- Health checker info. : "Short pilot jobs" has been found since 12:20:00 UTC on 2018/08/17.(details)
-
BIIDCO-1227Getting issue details...
STATUS
DIRAC.Nara-WU.jp
- Job status check: Application finished with errors (11% of the jobs in last 24 hours) on 2018/12/21 at 8:48 UTC.
- Under commisioning from 2018-11-13
-
BIIDCO-1432Getting issue details...
STATUS
DIRAC.NDU.jp
Health checker info. : Short pilot jobs has been found at 22:20:00 UTC on 2018/12/12 - BIIDCO-1525Getting issue details... STATUS
DIRAC.Niigata.jp
- - BIIDCO-1510Getting issue details... STATUS
- Health checker info. : Short pilot jobs has been found since 17:20:00 UTC on 2018/12/12.
-
BIIDCO-1526Getting issue details...
STATUS
- Job submission check : Pilot submission failure has been found since 12:36:00 UTC on 2018/10/17. - BIIDCO-1376Getting issue details... STATUS
- Health checker info. : "Aborted pilot jobs" has been found at 14:20:00 UTC on 2018/10/06.(details)
DIRAC.Osaka-CU.jp
- Job submission check : Pilot submission failure has been found since 07:23:00 UTC on 2018/12/04. (details)
- Pilot submission failure has been found since 18:32:00 UTC on 2018/11/24 - BIIDCO-1434Getting issue details... STATUS
- Health checker info. : "Short pilot jobs" has been found since 22:20:00 UTC on 2018/03/17.
→ Ask site admin to check the status 2018-03-17 10:00 JST. (DB access failure again from DIRAC.Osaka-CU.jp to PNNL from 2018-03-16 11:00 UTC)
- BIIDCO-290Getting issue details... STATUS
DIRAC.PNNL.us
DIRAC.PNNL2.us
DIRAC.PNNL-CASCADE.us
- Seeing no jobs (no plot) is not a problem
DIRAC.PNNL-PIC.us
- Seeing no jobs (no plot) is not a problem
DIRAC.RCNP.jp
- Health checker info. : "Short pilot jobs" has been found since 06:20:00 UTC on 2018/12/14. - BIIDCO-1497Getting issue details... STATUS - BIIDCO-1527Getting issue details... STATUS
- Health checker info. : "Not enough disk space on " has been found since 05:20:00 UTC on 2018/10/25. - BIIDCO-1394Getting issue details... STATUS
- Job Status : "Job has exceeded wall clock time" : Pink Colour : (100%) on 9/10/18. - BIIDCO-1358Getting issue details... STATUS
- Health checker info. : "Aborted pilot jobs" has been found since 12:20:00 UTC on 2018/09/08.(details)
DIRAC.SSU.kr
- - BIIDCO-1543Getting issue details... STATUS
- Health checker info. : "Short pilot jobs" has been found since 17:20:00 UTC on 2018/11/01. - BIIDCO-1415Getting issue details... STATUS
DIRAC.TIFR.in
- Health checker info. : "Short pilot jobs" has been found since 13:20:00 UTC on 2018/10/22.
- Job status plots, "Application Finished With Errors" has been found at about 00:00:00 JST on 2018/07/06. (details) - BIIDCO-1132Getting issue details... STATUS
- Health checker info. : "Short pilot jobs" -- Already reported: - BIIDCO-971Getting issue details... STATUS
- RunningLimit is set for MCProduction=1 - BIIDCO-1006Getting issue details... STATUS
- Job stalled at input data resolution - BIIDCO-714Getting issue details... STATUS
DIRAC.TMU.jp
- Health checker info. : "Short pilot jobs" has been found since 10:20:00 UTC on 2018/12/14. - BIIDCO-1529Getting issue details... STATUS
- Health checker info. : "Short pilot jobs" has been found since 10:20:00 UTC on 2018/11/02 - BIIDCO-1409Getting issue details... STATUS - BIIDCO-1522Getting issue details... STATUS
- Job status check: "Application finished with errors" (60%) on 2018/10/26.
- Health checker info. : "Belle II software could not be installed on " has been found since 18:20:00 UTC on 2018/10/17.
- Health checker info. : "Short pilot jobs" has been found since 01:20:00 UTC on 2018/10/15. - BIIDCO-1373Getting issue details... STATUS
- Job submission check : Pilot submission failure has been found since 10:24:00 UTC on 2018/09/30. (details)
DIRAC.Tokyo.jp
- Date, Issue, Tickets..
DIRAC.UAS.mx
- 100% of jobs fails with errors - BIIDCO-1508Getting issue details... STATUS
- Health checker info. : "Belle II software could not be installed on " has been found since 04:20:00 UTC on 2018/12/17. - BIIDCO-1508Getting issue details... STATUS
- Health checker info. : "Belle II software could not be installed on " has been found since 16:20:00 UTC on 2018/11/14.
- Job submission check : Pilot submission failure has been found since 01:26:00 UTC on 2018/09/21. (details)
- Health checker info. : "Short pilot jobs" has been found since 17:20:00 UTC on 2018/09/17 Added to - BIIDCO-1286Getting issue details... STATUS
- Job submission check : Pilot submission failure has been found since 15:24:00 UTC on 2018/09/16 (emailed comp-dc-operations, create JIRA ticket when able)
DIRAC.UVic.ca
- GGUS ticket : "CA-VICTORIA-WESTGRID-T2 : FTS connection timeout to srm://kek2-se03.cc.kek.jp"(137332) has been submited at 04:28:09 UTC on 2018/09/22.
- Health checker info. : "Short pilot jobs" has been found since 20:20:00 UTC on 2018/10/07.(details)
- Health checker info. : "Short pilot jobs" has been found since 08:20:00 UTC on 2018/08/16.(details)
DIRAC.UVic-local.ca
- Health checker info. : "Short pilot jobs" has been found since 05:20:00 UTC on 2018/09/29.(details)
DIRAC.Yamagata.jp
- Job status check: Application finished with errors (13% of the jobs at 11:15 UTC, but 100% in the last hours) on 2018/12/21.
- Health checker info. : "Short pilot jobs" has been found since 04:20:00 UTC on 2018/12/12.(details)
- Job submission check : Pilot submission failure has been found since 01:27:00 UTC on 2018/09/16. (details) - BIIDCO-1290Getting issue details... STATUS
- Health checker info. : "Short pilot jobs" has been found since 15:20:00 UTC on 2018/05/21.(details)
DIRAC.Yonsei.kr
- Health checker info. : "Short pilot jobs" has been found at 06:20:00 UTC on 2018/12/08.
- BIIDCO-1416Getting issue details... STATUS
LCG.CESNET.cz
- Health checker info. : "Short pilot jobs" has been found at 07:20:00 UTC on 2018/12/21.(details)
- Job submission check : Pilot submission failure has been found at 07:21:00 UTC on 2018/11/19.
Job submission check : Pilot submission failure has been found since 18:22:00 UTC on 2018/11/16.
- Health checker info. : "Short pilot jobs" has been found since 18:20:00 UTC on 2018/11/02.
-
BIIDCO-1409Getting issue details...
STATUS
-
BIIDCO-1523Getting issue details...
STATUS
- Job submission check : Pilot submission failure has been found since 19:40:00 UTC on 2018/05/23. (details)
- Need some intervention to run Merge jobs - BIIDCO-771Getting issue details... STATUS
- Job submission check : Pilot submission failure has been found since 17:34:00 UTC on 2018/05/16. (details)
LCG.CNAF.it
- Health checker info. : "Aborted pilot jobs" has been found since 04:20:00 UTC on 2018/12/13 - BIIDCO-1488Getting issue details... STATUS
- Short pilot jobs" has been found since 21:20:00 UTC on 2018/11/21
-
BIIDCO-1455Getting issue details...
STATUS
LCG.Cosenza.it
- Health checker info. : "Short pilot jobs" has been found since 12:20:00 UTC on 2018/10/30.
- Downtime 2018-10-25 13:00 (UTC) - 2018-10-26 13:00 (UTC) and 2018-10-23 13:00 (UTC) - 2018-10-25 13:00 (UTC)
- BIIDCO-1393Getting issue details... STATUS - Health checker info. : "Short pilot jobs" has been found since 16:20:00 UTC on 2018/10/08.(details)
LCG.CYFRONET.pl
- Job status check: Stalled (9% of the jobs in last 24 hours) on 2018/12/21 at 8:48 UTC.
- Health checker info. : "Short pilot jobs" has been found since 13:20:00 UTC on 2018/12/13.(details)
- BIIDCO-1246Getting issue details... STATUS - Health checker info. : "Short pilot jobs" has been found at 14:20:00 UTC on 2018/10/21.
- Health checker info. : "Aborted pilot jobs" has been found at 18:20:00 UTC on 2018/09/13.(details)
- Downtime (Decommissioning cream.grid.cyf-kr.edu.pl and cream02.grid.cyf-kr.edu.pl): Start time: 2018-02-27 23:00 (UTC), End time: 2018-12-31 00:00 (UTC) - BIIDCO-694Getting issue details... STATUS
LCG.DESY.de
- The site to be retired - BIIDCO-1240Getting issue details... STATUS – No more jobs to be submitted.
- Downtime, Start time: 2018-09-01 00:00 (UTC), End time: 2018-09-30 23:59 (UTC) - BIIDCO-1276Getting issue details... STATUS
LCG.Frascati.it
- Health checker info. : "Failed pilot jobs" has been found at 14:20:00 UTC on 2018/10/21.
- Health checker info. : "Short pilot jobs" has been found since 12:20:00 UTC on 2018/07/10.(details) - BIIDCO-1153Getting issue details... STATUS
- Health checker info. : "Short pilot jobs" has been found since 06:20:00 UTC on 2018/06/30.(details)
LCG.HEPHY.at
- Health checker info. : Short pilot jobs has been found since 21:20:00 UTC on 2018/12/12 - BIIDCO-1532Getting issue details... STATUS
- Health checker info. : "BLAH ERROR" has been found since 13:20:00 UTC on 2018/10/09.(details)
Job submission check : Pilot submission failure has been found since 16:25:00 UTC on 2018/06/21. (details) - BIIDCO-1107Getting issue details... STATUS
MCProduction = 680 - BIIDCO-281Getting issue details... STATUS
LCG.IPHC.fr.
- Health checker info. : "Failed pilot jobs" has been found at 00:20:00 UTC on 2018/06/18.(details)
LCG.KEK.jp
- Job submission check : Pilot submission failure has been found since 05:25:00 UTC on 2018/12/20. - BIIDCO-1548Getting issue details... STATUS
Health checker info. : "Short pilot jobs" has been found since 17:20:00 UTC on 2018/11/12.(details) - BIIDCO-1431Getting issue details... STATUS
- Health checker info. : "Failed pilot jobs" has been found since 06:20:00 UTC on 2018/10/10.(details)
- Health checker info. : "Belle II software could not be installed on cb512.cc.kek.jp" has been found since 21:20:00 UTC on 2018/10/01.
- Health checker info. : "Failed pilot jobs" has been found at 22:20:00 UTC on 2018/09/24.(details)
- Performance degraded with "Input data resolution" status since 2018-07-24 around 20:00 UTC - BIIDCO-1191Getting issue details... STATUS
LCG.KEK2.jp
Health checker info. : "Failed pilot jobs" has been found since 13:20:00 UTC on 2018/12/21. - BIIDCO-1559Getting issue details... STATUS
- Job submission check : Pilot submission failure has been found at 06:25:00 UTC on 2018/12/20. - BIIDCO-1548Getting issue details... STATUS
- all jobs are in "Input data resolution" status since 12.00 2018/12/18 UTC - BIIDCO-1542Getting issue details... STATUS
- Health checker info. : "Short pilot jobs" has been found since 21:20:00 UTC on 2018/11/23.(details)
- Health checker info. : "Short pilot jobs" has been found since 19:20:00 UTC on 2018/11/22.(details) - BIIDCO-1450Getting issue details... STATUS
- Health checker info. : "Failed pilot jobs" has been found since 06:20:00 UTC on 2018/10/10.(details)
- Health checker info. : "Failed pilot jobs" has been found at 06:20:00 UTC on 2018/09/28.(details)
LCG.KISTI.kr
- Jobs slots are disabled for SE maintenace from 2018-10-19 to 2018-10-23 - BIIDCO-1380Getting issue details... STATUS
Health checker info. : "BLAH ERROR" has been found since 06:20:00 UTC on 2018/10/19.(details)
- "Short pilot jobs" has been found at 06:20:00 UTC on 2018/10/09.(details)
- BLAH error seems to be happen if jobs exceed the allocated # of queues, not a problem (Site specific feature)
- BIIDCO-1259Getting issue details... STATUS - MCProduction= 280 - BIIDCO-280Getting issue details... STATUS
- A large number of Merge jobs in waiting status - BIIDCO-773Getting issue details... STATUS
LCG.KMI.jp
- Job status check: Application finished with errors (7% of the jobs in last 24 hours) on 2018/12/21 at 8:48 UTC.
- Health checker info. : "Short pilot jobs" has been found at 14:20:00 UTC on 2018/12/13.(details)
-
BIIDCO-1533Getting issue details...
STATUS
- Health checker info. : "Belle II software could not be installed on pwn22.local" has been found since 21:20:00 UTC on 2018/11/22.
- Health checker info. : "Belle II software could not be installed on pwn22.local" has been found since 05:20:00 UTC on 2018/11/22.
- Job submission check : Pilot submission failure has been found since 21:24:00 UTC on 2018/10/02. (details)
LCG.LAL.fr
Site under commissioning. Issues to be reported.
LCG.Legnaro.it
- Downtime: 2018-10-16 06:30 2018-10-16 17:00 SE Software update - BIIDCO-1372Getting issue details... STATUS
- Health checker info. : "Failed pilot jobs" has been found since 13:20:00 UTC on 2018/03/16.(details)
- Downtime: Start downtime: 2018-10-29 07:00 -- End downtime: 2018-10-29 11:00 - BIIDCO-1400Getting issue details... STATUS
LCG.Napoli.it
- Job submission check : Pilot submission failure has been found since 07:21:00 UTC on 2018/11/10. - BIIDCO-1267Getting issue details... STATUS
- Health checker info. : "Short pilot jobs" has been found at 22:20:00 UTC on 2018/11/04
Health checker info. : "Short pilot jobs" has been found at 06:20:00 UTC on 2018/10/09.(details) - BIIDCO-1398Getting issue details... STATUS
- Health checker info. : "Failed pilot jobs" has been found at 14:20:00 UTC on 2018/10/06.(details)
- This site is in down time schedule from 2018-10-02 16:00 (UTC) to 2018-10-08 18:00 (UTC) - BIIDCO-1348Getting issue details... STATUS
- Health checker info. : "Failed pilot jobs" has been found since 16:20:00 UTC on 2018/09/27.(details)
- Health checker info. : "Failed pilot jobs" has been found since 08:20:00 UTC on 2018/09/19.(details) - BIIDCO-825Getting issue details... STATUS
- Job submission check : Pilot submission failure has been found at 05:32:00 UTC on 2018/09/11. (details) - BIIDCO-1267Getting issue details... STATUS
- Job submission check : Pilot submission failure has been found since 01:21:00 UTC on 2018/11/07.
- Stalled jobs - BIIDCO-1255Getting issue details... STATUS
- "Failed pilot jobs" has been found at 14:20:00 UTC on 2018/03/17. - BIIDCO-825Getting issue details... STATUS
LCG.NTU.tw
- Health checker info. : "Failed pilot jobs" has been found since 16:20:00 UTC on 2018/11/20.(details) - BIIDCO-1453Getting issue details... STATUS
- Job submission check : Pilot submission failure has been found since 12:25:00 UTC on 2018/11/20. (details) - BIIDCO-1443Getting issue details... STATUS
- Health checker info. : "CRL has expired" has been found since 08:20:00 UTC on 2018/11/04.
-
BIIDCO-1430Getting issue details...
STATUS
Solved and verified : GGUS ticekt :https://ggus.eu/index.php?mode=ticket_info&ticket_id=138235 - Job submission check : Pilot submission failure has been found since 15:23:00 UTC on 2018/11/03.
-
BIIDCO-1377Getting issue details...
STATUS
Solved and verified : GGUS ticket : https://ggus.eu/?mode=ticket_info&ticket_id=138290 has submitted 2018-11-14 03:28 UTC - Job submission check : Pilot submission failure has been found since 08:28:00 UTC on 2018/10/15. - BIIDCO-1377Getting issue details... STATUS
- Solved and verified 2018-11-01: GGUS ticket : https://ggus.eu/index.php?mode=ticket_info&ticket_id=137827 has submitted 2018-10-18 14:39 UTC
- Job submission check : Pilot submission failure has been found since 02:26:00 UTC on 2018/10/07. (details)
- GGUS ticket : "TW-NTU-HEP : FTS transfer timeout to from/to belle2grid3.cc.ntu.edu.tw"(137334) has been submited at 05:10:14 UTC on 2018/09/22.
- Health checker info. : "Failed to install DIRAC on node29" has been found since 16:20:00 UTC on 2018/09/24. - BIIDCO-1324Getting issue details... STATUS
- Health checker info. : "BLAH ERROR" has been found at 18:20:00 UTC on 2018/09/15.(details)
- Health checker info. : "CRL has expired" has been found since 11:20:00 UTC on 2018/08/16. Created JIRA ticket - BIIDCO-1217Getting issue details... STATUS
LCG.Pisa.it
- Health checker info. : "Short pilot jobs" has been found since 02:20:00 UTC on 2018/12/22.(details)
- Job status check: Application finished with errors (5% of the jobs in last 24 hours) on 2018/12/21 at 8:48 UTC.
- Health checker info. : "Short pilot jobs" has been found since 06:20:00 UTC on 2018/12/21.(details)
- 100% Jobs fail at LCG.Pisa.it - BIIDCO-1509Getting issue details... STATUS
- Health checker info. : "Short pilot jobs" has been found since 16:20:00 UTC on 2018/12/04
- "Failed pilot jobs" has been found at 07:20:00 UTC on 2018/11/19.
- "Short pilot jobs" has been found at 07:20:00 UTC on 2018/11/19.
- Health checker info. : "Short pilot jobs" has been found since 19:20:00 UTC on 2018/11/15. - BIIDCO-1157Getting issue details... STATUS
Health checker info. : "Short pilot jobs" has been found since 16:20:00 UTC on 2018/11/14. - BIIDCO-1157Getting issue details... STATUS
- Health checker info. : "Short pilot jobs" has been found at 14:20:00 UTC on 2018/11/11.
- Job submission check : Pilot submission failure has been found since 10:24:00 UTC on 2018/11/11. - BIIDCO-1315Getting issue details... STATUS
- GGUS ticket :
- "INFN-PISA: possible issue in CA certificate directory"(136751) has been submited at 05:26:08 UTC on 2018/08/17.
- "INFN-PISA: Disk space on WNs"(136750) has been submited at 04:17:17 UTC on 2018/08/17.
- "INFN-PISA: CVMFS availability on WNs"(136749) has been submited at 03:20:57 UTC on 2018/08/17.
- "INFN-PISA : Pilot failed at gridce3.pi.infn.it"(130815) has been submited at 10:11:45 UTC on 2017/09/28.
- Health checker info. : "Short pilot jobs" has been found since 05:20:00 UTC on 2018/11/06.
- Health checker info. : "Short pilot jobs" has been found since 03:20:00 UTC on 2018/10/31. - BIIDCO-1157Getting issue details... STATUS
- Health checker info. : "Short pilot jobs" has been found since 22:20:00 UTC on 2018/10/29.(details)
- Health checker info. : "Short pilot jobs" has been found at 07:20:00 UTC on 2018/10/27.(details)
- Health checker info. : "Short pilot jobs" has been found since 21:20:00 UTC on 2018/10/25.(details)
- Health checker info. : "Short pilot jobs" has been found since 03:20:00 UTC on 2018/10/24.
-
BIIDCO-1157Getting issue details...
STATUS
- Health checker info. : "Short pilot jobs" has been found at 15:20:00 UTC on 2018/10/23.
- Health checker info. : "Short pilot jobs" has been found since 19:20:00 UTC on 2018/10/22.
- Health checker info. : "Short pilot jobs" has been found since 12:20:00 UTC on 2018/10/21.
- Health checker info. : "Short pilot jobs" has been found since 18:20:00 UTC on 2018/10/20.(details)
- Health checker info. : "Failed pilot jobs" has been found since 06:20:00 UTC on 2018/10/09.(details)
- Health checker info. : "Short pilot jobs" has been found at 06:20:00 UTC on 2018/10/02.(details)
- "Failed pilot jobs" has been found since 21:20:00 UTC on 2018/09/22.(details)
- "Short pilot jobs" has been found since 02:20:00 UTC on 2018/09/21.(details)
-
BIIDCO-1157Getting issue details...
STATUS
- possible issue in CA certificate directory on WN se1wn26.pi.infn.it - BIIDCO-1220Getting issue details... STATUS
- "Not enough disk space on <various servers> has been found since <various times> UTC on 2018/08/12 onwards. - BIIDCO-1211Getting issue details... STATUS
- Failed to install DIRAC on ... - BIIDCO-1152Getting issue details... STATUS
- "BLAH ERROR" has been found since 03:20:00 UTC on 2018/06/20. "Aborted pilot jobs" has been found since 03:20:00 UTC on 2018/06/20.(details), - BIIDCO-1106Getting issue details... STATUS , the related GGUS ticket 130815--INFN-PISA : Pilot failed at gridce3.pi.infn.it
LCG.Roma3.it
- Job status check: Application finished with errors (25% of the jobs in last 24 hours) and Stalled (36%) on 2018/12/22 at 8:00 UTC.
- Health checker info. : "Failed pilot jobs" has been found since 13:20:00 UTC on 2018/12/21.
- Job status check: Application finished with errors (10% of the jobs in last 24 hours) and Stalled (76%) on 2018/12/21 at 8:48 UTC.
- Health checker info. : "Failed pilot jobs" has been found at 07:20:00 UTC on 2018/12/21.(details)
- Stalled jobs on 2018/12/20.
Health checker info. : "Failed pilot jobs" has been found at 06:20:00 UTC on 2018/12/08. - BIIDCO-1538Getting issue details... STATUS
- Health checker info. : "Aborted pilot jobs" has been found at 16:20:00 UTC on 2018/10/09.(details)
- "BLAH ERROR" has been found at 04:20:00 UTC on 2018/06/20. - BIIDCO-1106Getting issue details... STATUS
- Roma3 commissioning - BIIDCO-111Getting issue details... STATUS (NOTE: This ticket seems obsolete, it should be closed and removed from operation status)
LCG.TAU.il
- Health checker info. : "Short pilot jobs" has been found at 22:20:00 UTC on 2018/11/01.
- Downtime - Start time: 2018-09-26 01:00(UTC), End time: 2018-09-28 20:00(UTC) - BIIDCO-1328Getting issue details... STATUS
- Health checker info. : "Failed pilot jobs" has been found at 18:20:00 UTC on 2018/09/14.(details)
LCG.Torino.it
- Health checker info. : "BLAH ERROR" has been found since 21:20:00 UTC on 2018/12/06.
- Health checker info. : "BLAH ERROR" has been found since 03:20:00 UTC on 2018/11/23. - BIIDCO-1451Getting issue details... STATUS
- Health checker info. : "BLAH ERROR" has been found at 23:20:00 UTC on 2018/11/22.(details)
- Health checker info. : "Failed pilot jobs" has been found since 13:20:00 UTC on 2018/11/11.
- Health checker info. : "Failed pilot jobs" has been found at 22:20:00 UTC on 2018/11/07.
- Job submission check : Pilot submission failure has been found at 22:22:00 UTC on 2018/11/04.
- Health checker info. : "Failed pilot jobs" has been found since 13:20:00 UTC on 2018/11/04
- Health checker info. : "Failed pilot jobs" has been found since 20:20:00 UTC on 2018/11/01. - BIIDCO-1417Getting issue details... STATUS
- Health checker info. : "Failed pilot jobs" has been found since 17:20:00 UTC on 2018/10/31. - BIIDCO-252Getting issue details... STATUS
- Health checker info. : "BLAH ERROR" has been found since 21:20:00 UTC on 2018/10/22.
LCG.ULAKBIM.tr
- "BLAH ERROR" has been found since 20:20:00 UTC on 2018/12/24
-
BIIDCO-1558Getting issue details...
STATUS
GGUS ticket https://ggus.eu/index.php?mode=ticket_info&ticket_id=138991 has submitted at 2018-12-25 03:06 UTC - Solved and verified 2018-11-02 : GGUS ticket : "TR-10-ULAKBIM : pilot job failure by CRL expiration error"(136985) has been submited at 01:35:16 UTC on 2018/09/03.
- Health checker info. : "BLAH ERROR" has been found since 12:20:00 UTC on 2018/07/06.(details) - BIIDCO-1091Getting issue details... STATUS
OSG.BNL.us
- Health checker info. : "Short pilot jobs" has been found since 15:20:00 UTC on 2018/12/21.(details)
- Health checker info. :
- Health checker info. : "Short pilot jobs" has been found since 18:20:00 UTC on 2018/12/18.
- Health checker info. : "Short pilot jobs" has been found since 07:20:00 UTC on 2018/12/14.
- Health checker info. : "Short pilot jobs" has been found since 02:20:00 UTC on 2018/12/11.
- BNL network interruption 2018-Dec-18 14:00-15:00 UTC - BIIDCO-1462Getting issue details... STATUS
- Health checker info. : "Short pilot jobs" has been found since 13:20:00 UTC on 2018/07/13.(details)
-
BIIDCO-1164Getting issue details...
STATUS
- Recurring issue 2018/11/23 through 09/30
- Health checker info. : "Aborted pilot jobs" has been found since 09:20:00 UTC on 2018/09/19.(details)
-
BIIDCO-950Getting issue details...
STATUS
- Recurring issue 2018/11/04 through 09/23
- Production jobs: UNAVAILABLE files - BIIDCO-1302Getting issue details... STATUS
- User jobs: SRM_AUTHORIZATION_FAILURE - BIIDCO-1303Getting issue details... STATUS
- Number of concurrent MCProduction jobs restricted - BIIDCO-1256Getting issue details... STATUS
- MCProduction jobs are mostly stalled - BIIDCO-1253Getting issue details... STATUS
OSG.CORI.us
- OSG.CORI.us resource has been removed because CY18 allocation was not approved
OSG.UMiss.us
- Health checker info. : "Aborted pilot jobs" has been found since 01:20:00 UTC on 2018/12/22.(details)
-
BIIDCO-1550Getting issue details...
STATUS
GGUS ticket https://ggus.eu/index.php?mode=ticket_info&ticket_id=138979 has submitted at 2018-12-22 14:45 UTC. - Health checker info. : "Short pilot jobs" has been found since 22:20:00 UTC on 2018/12/14.
- Health checker info. : "Short pilot jobs" has been found since 18:20:00 UTC on 2018/11/02. - BIIDCO-1142Getting issue details... STATUS
SSH.KMI.jp
- Job status check: Application finished with errors (12% of the jobs in last 24 hours) on 2018/12/22 at 11:30 UTC.
- Health checker info. : "Short pilot jobs" has been found at 20:20:00 UTC on 2018/08/13.
VCYCLE.Napoli.it, VCYCLE.HNSC01.it, VCYCLE.HNSC02.it
- Opportunistic site (Empty plot is not a problem)
Links
- Computing ShiftManual
- DistributedComputingSite