Skip to content

Extract Metadata has inconsistent behaviour #24520

@dangarmol

Description

@dangarmol

I have searched the existing issues, both open and closed, to make sure this is not a duplicate report.

  • Yes

The bug

I have started a fresh Immich instance, mounting my photo library via NFS (read-only), database on local SSD and /data on local HDD and most jobs seem to be running fine, except for the "Extract Metadata" job. I read that there was an issue with high concurrency, so I limited it to only 2.

On first container boot, the job seemed stuck after a while and the assets remaining didn't really go down, but I let it run for about a day before restarting the container.
The second time I started the container, the number of assets that needed metadata extraction started growing from 0 up to ~650k, although my library only has roughly ~460k assets, which Immich correctly reports on the External Libraries page. After a while, the number stopped growing but it never started going down. I let it run for several hours, but the number didn't move. I clicked the button to Clear the job, and Waiting has gone down to 0, but Active is stuck at 2, so I can't start a new job unless I restart the container.

I haven't restarted the containers in case it can be helpful for troubleshooting.

The OS that Immich Server is running on

Ubuntu 24.04.2 inside Proxmox VM

Version of Immich Server

v2.3.1

Version of Immich Mobile App

Irrelevant

Platform with the issue

  • Server
  • Web
  • Mobile

Device make and model

VM with 24 cores and 80GB RAM

Your docker-compose.yml content

name: immich

services:
  immich-server:
    container_name: immich_server
    image: ghcr.io/immich-app/immich-server:${IMMICH_VERSION:-release}
    # extends:
    #   file: hwaccel.transcoding.yml
    #   service: cpu # set to one of [nvenc, quicksync, rkmpp, vaapi, vaapi-wsl] for accelerated transcoding
    volumes:
      # Do not edit the next line. If you want to change the media storage location on your system, edit the value of UPLOAD_LOCATION in the .env file
      - ${UPLOAD_LOCATION}:/data
      - /etc/localtime:/etc/localtime:ro
      - syno-nfs-homes:/syno-homes:ro
    env_file:
      - .env
    ports:
      - '2283:2283'
    depends_on:
      - redis
      - database
    restart: always
    healthcheck:
      disable: false

  immich-machine-learning:
    container_name: immich_machine_learning
    # For hardware acceleration, add one of -[armnn, cuda, rocm, openvino, rknn] to the image tag.
    # Example tag: ${IMMICH_VERSION:-release}-cuda
    image: ghcr.io/immich-app/immich-machine-learning:${IMMICH_VERSION:-release}
    # extends: # uncomment this section for hardware acceleration - see https://docs.immich.app/features/ml-hardware-acceleration
    #   file: hwaccel.ml.yml
    #   service: cpu # set to one of [armnn, cuda, rocm, openvino, openvino-wsl, rknn] for accelerated inference - use the `-wsl` version for WSL2 where applicable
    volumes:
      - model-cache:/cache
    env_file:
      - .env
    restart: always
    healthcheck:
      disable: false

  redis:
    container_name: immich_redis
    image: docker.io/valkey/valkey:8@sha256:81db6d39e1bba3b3ff32bd3a1b19a6d69690f94a3954ec131277b9a26b95b3aa
    healthcheck:
      test: redis-cli ping || exit 1
    restart: always

  database:
    container_name: immich_postgres
    image: ghcr.io/immich-app/postgres:14-vectorchord0.4.3-pgvectors0.2.0@sha256:bcf63357191b76a916ae5eb93464d65c07511da41e3bf7a8416db519b40b1c23
    environment:
      POSTGRES_PASSWORD: ${DB_PASSWORD}
      POSTGRES_USER: ${DB_USERNAME}
      POSTGRES_DB: ${DB_DATABASE_NAME}
      POSTGRES_INITDB_ARGS: '--data-checksums'
      # Uncomment the DB_STORAGE_TYPE: 'HDD' var if your database isn't stored on SSDs
      # DB_STORAGE_TYPE: 'HDD'
    volumes:
      # Do not edit the next line. If you want to change the database storage location on your system, edit the value of DB_DATA_LOCATION in the .env file
      - ${DB_DATA_LOCATION}:/var/lib/postgresql/data
    shm_size: 128mb
    restart: always

volumes:
  model-cache:
  syno-nfs-homes:
    driver: local
    driver_opts:
      type: nfs
      o: addr=<redacted>,nolock
      device: :<redacted>

Your .env content

UPLOAD_LOCATION=/storage/immich
DB_DATA_LOCATION=./postgres

TZ=<redacted>
IMMICH_VERSION=v2

DB_PASSWORD=<redacted>

DB_USERNAME=postgres
DB_DATABASE_NAME=immich

Reproduction steps

  1. Fresh Immich install
  2. Add read-only external library (NFS) with ~460k assets
  3. Make all jobs run for the library assets to be added to the timeline
  4. Extract Metadata behaves erratically, reports too many assets after container restart, gets stuck and is unable to clear running jobs

Relevant log output

I searched the log for errors but none are related to the Extract Metadata job. I'll be happy to report back if you give me something to search for in the logs or elsewhere. Thanks!

Additional information

No response

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    Status

    To triage

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions