Uploaded image for project: 'FTS'
  1. FTS
  2. FTS-1702

Destination file integrity check when file exists and Archive Monitoring is requested

    XMLWordPrintable

    Details

    • Type: New Feature
    • Status: Closed
    • Priority: Medium
    • Resolution: Fixed
    • Affects Version/s: fts 3.10.1
    • Fix Version/s: fts 3.11.0, fts-rest 3.11.0
    • Component/s: MySQL, REST API, Server, URL Copy
    • Security Level: Public Data (This ticket is visible to anyone on the internet and will be indexed by search engines)
    • Labels:
      None

      Description

      A problematic pattern in the Rucio / FTS interaction was discovered during CMS workflow when writing to TAPE:

      • If FTS encounters a problem that would lead to data being successfully copied to tape, but the transfer marked as FAILED, Rucio will keep retrying it
      • Rucio doesn't use overwrite when writing to TAPE (for good reason)
      • FTS won't be able to copy the file again because it exists already (expected behavior)

      This leaves us in a bad spot where the file will be retried continuously.

      Background
      The exact problem which lead to this behavior was the FTS process was online but not processing status messages anymore from fts_url_copy process. Eventually, another node recognized these transfers as stalled and put them inĀ FAILED file state)

      Proposal
      To get out of this loop, the proposal is to have "file-reuse" functionality when Archive
      Monitoring feature is requested:

      • Attempting a transfer and the destination file already exists
      • Verify the checksum
      • If checksum is valid, consider the transfer part complete
      • Move to Archive Monitoring

      This feature will be available only when Archive Monitoring is requested.
      For disk endpoints, it's preferable to use overwrite and recopy the file.
      However, for ape endpoints, overwriting has larger implications

      The aim of this feature is to help avoid deleting valid files from tape.

        Attachments

          Issue Links

            Activity

              People

              Assignee:
              batistal Joao Pedro Lopes
              Reporter:
              mipatras Mihai Patrascoiu
              Component Watchers:
              Votes:
              1 Vote for this issue
              Watchers:
              9 Start watching this issue

                Dates

                Created:
                Updated:
                Resolved: