Uploaded image for project: 'Planet4'
  1. Planet4
  2. PLANET-4987

Prevent sync of non-pdf attachments to ElasticSearch

    XporterXMLWordPrintable

    Details

    • Type: Task
    • Status: CLOSED
    • Priority: Should have
    • Resolution: Released
    • Affects Version/s: None
    • Fix Version/s: 2.34.2
    • Labels:
    • Story Points:
      2
    • Sprint:
      Sprint #132, Sprint #133, Sprint #134, Sprint #135, Sprint #136, Sprint #137, Sprint #138, Sprint #139
    • Section:
      Search
    • P4 site:
      All sites
    • Track:
      Development
    • P4 Test Environment:
      atlas
    • Repositories:
      planet4-master-theme

      Description

      Currently we already filter out all attachments that are not a pdf in the search query. However these are still synced to the ES cluster by the ElasticPress plugin, even though they are never used. This increases the size of the ES index which affects performance of queries. This size issue is aggravated by the fact that attachments sometimes have a lot of duplicate garbage data.

      Probably worse is that it slows down ES syncing, which we want to avoid as during a manual sync ES is unavailable and the search falls back to mysql search, which is slow and breaks a couple of things.

      Tasks

      • Investigate if the above type of files can be skipped
      • Check how ElasticSearch plugin is handling other mime types
      • If its an easy solution that fits in the estimated time please implement, otherwise open follow up ticket

        Gliffy Diagrams

          Attachments

            Activity

              People

              Assignee:
              pvincent Pieter Vincent
              Reporter:
              pvincent Pieter Vincent
              Votes:
              0 Vote for this issue
              Watchers:
              0 Start watching this issue

                Dates

                Created:
                Updated:
                Resolved: