VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2026-05-17 08:36:55 +03:00

Author	SHA1	Message	Date
andriibeee	2bb03f6e34	lib/storage, lib/mergeset: properly account inmemoryPart refCount Previously inmemoryPart refCount was not properly decremented. Previous behavior: * createInmemoryPart called newPartWrapperFromInmemoryPart and returns a partWrapper with refCount=1 * multiple parts are merged in mustMergeInmemoryPartsFinal, which creates a new merged part * the source partWrappers are never decRef'd * Since refCount never reaches 0, putInmemoryPart and (*part).MustClose are never called This commit properly decrements refCount at mustMergeInmemoryPartsFinal. Fixes https://github.com/VictoriaMetrics/VictoriaMetrics/issues/10086	2026-03-17 10:54:08 +01:00
Aliaksandr Valialkin	804d77ffc5	all: run `go fix -reflecttypefor`	2026-02-19 14:05:06 +01:00
Aliaksandr Valialkin	84dc5453ad	all: run `go fix -minmax`	2026-02-18 19:24:27 +01:00
Aliaksandr Valialkin	809f9471df	all: run `go fix -fmtappendf`	2026-02-18 18:21:02 +01:00
Aliaksandr Valialkin	5ea7314912	lib/mergeset: run `go fix -rangeint`	2026-02-18 14:28:29 +01:00
Aliaksandr Valialkin	2e9bda2bff	lib/{mergeset,storage}: add a comment explaining why the strange construct with anonymous function is needed This is a follow-up for the commit `2a0e382a99` Updates https://github.com/VictoriaMetrics/VictoriaLogs/issues/1020	2026-01-27 19:44:49 +01:00
Aliaksandr Valialkin	e35a9a366c	all: consistently use sync.WaitGroup.Go() instead of sync.WaitGroup.Add(1) + sync.WaitGroup.Done() This improves code readability a bit.	2026-01-27 00:29:47 +01:00
Phuong Le	2a0e382a99	lib/storage, lib/mergeset: avoid deadlock on panic while merging Related to https://github.com/VictoriaMetrics/VictoriaLogs/issues/1020#issuecomment-3763912067	2026-01-20 21:43:12 +01:00
Nikolay	2056e5b46d	lib/mergeset: do no cache inmemoryBlock with single item indexDB mergeset has an edge for single item inmemoryBlock. It stores such items blocks in-memory at blockheader firstItem. So there is no need to perform on-disk read operations and storing copy of it at cache. It also may result in incorrect search results, inmemoryBlock with a single item has always zero index block offset. Which causes collisions if it's cached with the next index block at part. Fixes https://github.com/VictoriaMetrics/VictoriaMetrics/issues/10239 Probably fixes https://github.com/VictoriaMetrics/VictoriaMetrics/issues/10063	2026-01-15 12:12:08 +01:00
Cancai Cai	a244750bc6	doc/table: fix typo (#10243 ) ### Describe Your Changes Please provide a brief description of the changes you made. Be as specific as possible to help others understand the purpose and impact of your modifications. ### Checklist The following checks are mandatory: - [ ] My change adheres to [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/victoriametrics/contributing/#pull-request-checklist). - [ ] My change adheres to [VictoriaMetrics development goals](https://docs.victoriametrics.com/victoriametrics/goals/). Signed-off-by: cancaicai <2356672992@qq.com>	2026-01-05 21:42:01 +02:00
Artem Fetishev	f97f627f79	lib/storage: implement partition index (#8134 ) This should reduce disk space occupied by indexDBs as they get deleted along with the corresponding partitions once those partitions become outside the retention window. - Motivation: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7599 - What to expect: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/8134 Signed-off-by: Artem Fetishev <rtm@victoriametrics.com> Co-authored-by: Andrei Baidarov <baidarov@nebius.com>	2025-12-24 18:53:49 +01:00
Artem Fetishev	85367cae38	Idb blockcache metrics unittest (#10050 ) indexDB has 3 block caches. These caches export metrics. Storage collects these metrics for each indexDB it has (currently prev and curr only). There is a potential problem: - These caches are shared by all indexDBs - Each indexDB reports the block cache metrics. - Storage collects the metrics of all indexDBs by adding them together. I.e. it is possible to count block cache metrics several times. It is not the case in current implementation because the addition of the metrics is not performed intentionally. The added unit test 1) demonstrates that the resulting counts are reported correctly and 2) protects from future unintentional changes in this behavior. Additionally a code comment is added to explain why block cache metrics are not summed up. Signed-off-by: Artem Fetishev <rtm@victoriametrics.com>	2025-12-06 18:14:52 +01:00
Aliaksandr Valialkin	9725ee50ec	lib/mergeset: verify that Table parts are no longer used at Table.MustClose() This should catch possible errors related to improper release of Table parts. Fix such an error at TestTableCreateSnapshotAt by properly closing all the initialized TableSearch instances. Thanks to @rtm0 for pointing to this issue.	2025-11-05 13:25:16 +01:00
Aliaksandr Valialkin	7967ad661e	lib/fs: remove fsync for the parent directory from MustMkdirIfNotExist(), MustMkdirFailIfExist(), MustHardLinkFiles() and MustCopyDirectory() This allows performing a single MustFsyncPath() for the parent directory after multiple calls to these functions. This clarifies code paths, which call these functions, and makes them more maintainable. This also removes a redundant fsync() call for the parent directory when creating a file-based part. Previously the first fsync() was indirectly called when the directory was created via MustMkdirFailIfExist() and the second fsync() was called via MustSyncPathAndParentDir() after all the data is written to the part.	2025-08-30 01:53:54 +02:00
Aliaksandr Valialkin	65251c8fe2	lib/{mergeset,storage}: open files inside parts in parallel This should reduce the time needed for opening the parts on high-latency storage systems such as NFS or Ceph. Updates https://github.com/VictoriaMetrics/VictoriaLogs/issues/517	2025-07-28 13:41:02 +02:00
Aliaksandr Valialkin	8e155da0ac	lib/{mergeset,storage}: close in parallel part files opened for reading This should reduce the time needed for closing the part on high-latency storage systems such as NFS or Ceph. Updates https://github.com/VictoriaMetrics/VictoriaLogs/issues/517	2025-07-27 19:23:55 +02:00
Aliaksandr Valialkin	cc58d659c3	lib/{fs,filestream}: move lib/filestream.MustCloseWritersParallel() to lib/fs.MustCloseParallel() The lib/fs.MustCloseParallel() accepts a slice of MustWriter items, which must implement only a single method - MustWrite(). The previous lib/filestream.MustCloseWritersParallel() was accepting CloseWriter items, which must implement Write() and Path() methods additionally to MustClose() method. This was adding artificial restrictions on the applicability of the MustCloseWritersParallel() method. Remove these restrictions.	2025-07-27 19:12:34 +02:00
Aliaksandr Valialkin	a05e4cf67b	lib/{mergeset,storage}: store files inside in-memory parts to the persistent storage in parallel This should reduce the time needed for converting in-memory parts to file-based parts on high-latency storage systems such as NFS or Ceph. Updates https://github.com/VictoriaMetrics/VictoriaLogs/issues/517	2025-07-27 18:19:22 +02:00
Aliaksandr Valialkin	3c7c3a5b0d	lib/{mergeset,storage}: flush and close files inside the newly created parts in parallel This should reduce the time needed for closing the newly created parts on high-latency storage systems such as NFS or Ceph. This should help https://github.com/VictoriaMetrics/VictoriaLogs/issues/517	2025-07-27 17:51:56 +02:00
Aliaksandr Valialkin	83da33d8cf	lib/fs: simplify the code for directory removal and make it compatible with object storage (S3) and NFS - Drop the code needed for asynchronous removal of the directory on NFS shares. This code was needed when VictoriaMetrics could keep open files after their deletion or renaming. This is no longer the case after the commit `43b24164ef` . Now files are deleted only after all the readers close them. This updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/61 - Unify MustRemoveAll() and MustRemoveDirAtomic() into MustRemoveDir() and MustRemovePath() functions: - The MustRemoveDir() deletes the given directory with all its contents, in an "atomic" way: it creates a special `.delete-this-dir` file in the directory, then removes all its contents except of this file, and later removes the `.delete-this-dir` file together with the directory itself. This makes possible easily determining whether the given directory needs to be deleted after unclean shutdown - if it contains the `.delete-this-dir` file or if it is empty, it must be deleted. Add IsPartiallyRemovedDir() function, which can be used for detecting whether the given directory must be removed at starup. Previously the MustRemoveDirAtomic() was using a "trick" for atomic directory removal: it was "atomically" renaming the directory to a temporary directory with '.must-remove.' marker in the directory name, and after that it was removing the renamed directory. On startup all the directories with the `.must-remove.` marker were deleted if they are left after unclean shutdown. This "trick" doesn't work for NFS and object storage such as S3, since these storage systems do not support atomic renaming of directories with multiple entries inside. The new MustRemoveDir() function doesn't use this "trick", so it can be safely used in NFS and S3-like storage systems. This is based on the pull request from @func25 - https://github.com/VictoriaMetrics/VictoriaMetrics/pull/9486/files . - The MustRemovePath() deletes the given file or an empty directory. - Delete the existing parts and partitions at startup if they were partially deleted. - Consistently use fs.MustRemoveDir() and fs.MustRemovePath() instead of os.RemoveAll() across the codebase. This reduces the amounts of bolierplate code related to error handling. - Consistently use fs.MustWriteSync() instead of os.WriteFile() across the codebase.	2025-07-25 19:54:03 +02:00
Aliaksandr Valialkin	9d9bea0348	lib/{storage,mergeset}: make sure the newly created data part is visible in the parent directory before storing it in parts.json The newly created data part could become missing after unclean shutdown (such as hardware power off), since the contents of the parent directory wasn't synced to disk before storing the newly created data part in the parts.json file. Fix this by syncing the parent directory contents before storing the newly created part in the parts.json file. This commit is based on https://github.com/VictoriaMetrics/VictoriaLogs/pull/507	2025-07-18 14:56:06 +02:00
Nikolay	eb1164278e	lib/mergeset: reduce memory allocations on blockcache misses This commit adds tmp inmemory and data blocks buffers for index search requests. It allows to reduce memory allocations on block cache misses. Since block cache puts block into cache only on after configured number of cache misses. Related PR https://github.com/VictoriaMetrics/VictoriaMetrics/pull/9324	2025-07-18 10:47:30 +02:00
Alexander Frolov	f9aa74c367	lib/mergeset: misuse of unsafe.Pointer It appears the `lib/mergeset.Item` violates the 3rd rule https://pkg.go.dev/unsafe#Pointer > (3) Conversion of a Pointer to a uintptr and back, with arithmetic. > > Unlike in C, it is not valid to advance a pointer just beyond the end of its original allocation: > ``` > // INVALID: end points outside allocated space. > b := make([]byte, n)` > end = unsafe.Pointer(uintptr(unsafe.Pointer(&b[0])) + uintptr(n)) > ``` `CGO_ENABLED=0 go test -trimpath -gcflags=all=-d=checkptr -run TestTableAddItemsSerial` ``` fatal error: checkptr: pointer arithmetic result points to invalid allocation This commit adds a special path to item encoding, that prevents invalid memory references. See this PR for details: https://github.com/VictoriaMetrics/VictoriaMetrics/pull/9264	2025-06-30 16:56:28 +02:00
Aliaksandr Valialkin	539498058e	lib/atomicutil: add CacheLineSize const equal to the size of CPU cache line, and use this const for padding against false sharing across the code base This should reduce the waste of memory on the padding from 128 bytes to 64 bytes on GOARCH=amd64, while preserving bigger padding for platforms with bigger cache line sizes. See https://stackoverflow.com/questions/68320687/why-are-most-cache-line-sizes-designed-to-be-64-byte-instead-of-32-128byte-now Thanks to @tIGO for the hint	2025-06-06 10:21:40 +02:00
Aliaksandr Valialkin	23fd269ccf	lib/{storage,mergeset}: reduce the multi-CPU contention on global stats vars, which are updated during background merge Background merge updates the global stats on the number of merged / deleted items. This may result in slowdown when multiple goroutines update these global stats at frequent rate, since every goroutine must fetch the actual value for the updated stats from slow memory on every update. It is much faster to count the needed stats locally per every goroutine and then periodically updating the global stats (once per ~second). Thanks to @tIGO for the intial implementation of this idea at https://github.com/VictoriaMetrics/VictoriaMetrics/pull/8683/files#diff-95e28ae911944708f94f3bb31fa9ba8bc185dedc23ae6fb02a272c34b8f83244 This should help improving scalability of background merges on multi-CPU systems. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/8682	2025-06-05 12:24:03 +02:00
Aliaksandr Valialkin	1f5d02e059	lib: make sure that frequently updated global counters are padded in order to protect from false sharing issues on multi-CPU systems Go linker packs global variables close to each other in the memory. This may lead to false sharing (https://en.wikipedia.org/wiki/False_sharing) among these variables if frequently updated vars are put close to mostly read-only vars like described at https://github.com/VictoriaMetrics/VictoriaMetrics/issues/8682 . This commit adds padding to frequently updated global vars. This guarantees that these variables are put into distinct CPU cache lines comparing to the rest of global variables. See https://github.com/VictoriaMetrics/VictoriaMetrics/pull/8683#issuecomment-2943254119 Thanks to @tIGO for the intial attempt to fix the issue at https://github.com/VictoriaMetrics/VictoriaMetrics/pull/8683	2025-06-05 11:40:20 +02:00
Artem Fetishev	06c26315df	lib/storage: test wasMetricIDsMissingBefore with "testing/synctest" (#8740 ) Using this package lets to manipulate time. In this particular case, it lets to advance the time 61 second forward instantly. A few side changes were necessary: - Do not use fasttime in unit tests. The fasttime package starts a goroutine outside the test bubble which causes the clock to be real, not fake. - Stop the time.Ticker explicitly and also stop idbNext. These two create goroutines with infinite loops which causes the unit tests that use synctest to hang forever. All goroutines created inside the bubble must exit in order for the syntest to finish. - synctest is an experimental package and requires an environment variable to be set. The Makefile was changed to set it. Signed-off-by: Artem Fetishev <rtm@victoriametrics.com>	2025-04-17 16:58:47 +02:00
Artem Fetishev	2f0796ff40	lib/storage: When creating and listing snapshots, panic instead of returning an error (#8585 ) When creating and listing snapshots, panic instead of returning an error since errors are not recoverable anyway. Also do not cleanup the filesystem on panic. Leave as is for further manual inspection. Signed-off-by: Artem Fetishev <rtm@victoriametrics.com>	2025-04-02 15:47:23 +02:00
Dan Dascalescu	0a49d8c930	chore: minor grammar fix in error messages (#8580 ) ### Describe Your Changes `its'` -> `its` ### Checklist The following checks are mandatory: - [x] My change adheres [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/contributing/).	2025-03-27 10:21:52 +01:00
Artem Fetishev	9537594971	lib/{mergeset,storage}: Update MustClose() method comments with the condition then the method must be called (#8568 ) Signed-off-by: Artem Fetishev <rtm@victoriametrics.com>	2025-03-25 14:29:04 +01:00
Guillem Jover	76d205feae	spelling and grammar fixes via codespell (#8497 ) ### Describe Your Changes Fix many spelling errors and some grammar, including misspellings in filenames. The change also fixes a typo in metric `vm_mmaped_files` to `vm_mmapped_files`. While this is a breaking change, this metric isn't used in alerts or dashboards. So it seems to have low impact on users. The change also deprecates `cspell` as it is much heavier and less usable. --------- Co-authored-by: Andrii Chubatiuk <achubatiuk@victoriametrics.com> Co-authored-by: Andrii Chubatiuk <andrew.chubatiuk@gmail.com>	2025-03-17 16:32:10 +01:00
Aliaksandr Valialkin	13ff9a8ebd	lib/{mergeset,storage,logstorage}: use chunked buffer instead of bytesutil.ByteBuffer as a storage for in-memory parts This commit adds lib/chunkedbuffer.Buffer - an in-memory chunked buffer optimized for random access via MustReadAt() function. It is better than bytesutil.ByteBuffer for storing large volumes of data, since it stores the data in chunks of a fixed size (4KiB at the moment) instead of using a contiguous memory region. This has the following benefits over bytesutil.ByteBuffer: - reduced memory fragmentation - reduced memory re-allocations when new data is written to the buffer - reduced memory usage, since the allocated chunks can be re-used by other Buffer instances after Buffer.Reset() call Performance tests show up to 2x memory reduction for VictoriaLogs when ingesting logs with big number of fields (aka wide events) under high speed.	2025-03-15 20:58:33 +01:00
Aliaksandr Valialkin	bc69d5f1a4	lib/mergeset: explicitly pass the interval for flushing in-memory data to disk at MustOpenTable() This allows using different intervals for flushing in-memory data among different mergeset.Table instances. The initial user of this feature is lib/logstorage.Storage, which explicitly passes Storage.flushInterval to every created mereset.Table instance. Previously mergeset.Table instances were using 5 seconds flush interval, which didn't depend on the Storage.flushInterval. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/4775	2025-02-24 15:23:50 +01:00
Nikolay	e9f86af7f5	lib/storage: add a hint for merge about type of parts in merge (#7998 ) Hint allows to choose type of cache to be used for index search: - in-memory parts are storing recently ingested samples and should use main cache. This improves ingestion speed and cache hit ration for queries accessing recently ingested samples. - merges of file parts is performed in background, using a separate cache allows avoiding pollution of the main cache with irrelevant entries. Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7182 --------- Signed-off-by: f41gh7 <nik@victoriametrics.com>	2025-01-10 16:01:39 +04:00
Zakhar Bessarab	9f9cc24e4c	Revert "lib/mergeset: add sparse indexdb cache (#7269 )" This reverts commit `837d0d136d`.	2024-11-04 10:29:14 -03:00
Zakhar Bessarab	837d0d136d	lib/mergeset: add sparse indexdb cache (#7269 ) Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7182 - add a separate index cache for searches which might read through large amounts of random entries. Primary use-case for this is retention and downsampling filters, when applying filters background merge needs to fetch large amount of random entries which pollutes an index cache. Using different caches allows to reduce effect on memory usage and cache efficiency of the main cache while still having high cache hit rate. A separate cache size is 5% of allowed memory. - reduce size of indexdb/dataBlocks cache in order to free memory for new sparse cache. Reduced size by 5% and moved this to a separate cache. - add a separate metricName search which does not cache metric names - this is needed in order to allow disabling metric name caching when applying downsampling/retention filters. Applying filters during background merge accesses random entries, this fills up cache and does not provide an actual improvement due to random access nature. Merge performance and memory usage stats before and after the change: - before ![image](https://github.com/user-attachments/assets/485fffbb-c225-47ae-b5c5-bc8a7c57b36e) - after ![image](https://github.com/user-attachments/assets/f4ba3440-7c1c-4ec1-bc54-4d2ab431eef5) --------- Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2024-10-24 15:21:17 +02:00
hagen1778	1154f90d2d	lib/mergeset: fix typos in comments Signed-off-by: hagen1778 <roman@victoriametrics.com>	2024-08-07 15:54:15 +02:00
Aliaksandr Valialkin	832e088659	lib/mergeset: properly update TableMetrics.TooLongItemsDroppedTotal inside Table.UpdateMetrics Substitute '+=' with '=', since tooLongItemsTotal is global counter, which doesn't belong to the Table struct. This is a follow-up for `69d244e6fb` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6297	2024-07-15 23:39:10 +02:00
Aliaksandr Valialkin	c995ccad93	lib/{storage,mergeset}: do not allow setting dataFlushInterval to values smaller than pending{Items,Rows}FlushInterval Pending rows and items unconditionally remain in memory for up to pending{Items,Rows}FlushInterval, so there is no any sense in setting dataFlushInterval (the interval for guaranteed flush of in-memory data to disk) to values smaller than pending{Items,Rows}FlushInterval, since this doesn't affect the interval for flushing pending rows and items from memory to disk. This is a follow-up for `4c80b17027` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/6221	2024-07-15 10:08:15 +02:00
Aliaksandr Valialkin	3c02937a34	all: consistently use 'any' instead of 'interface{}' 'any' type is supported starting from Go1.18. Let's consistently use it instead of 'interface{}' type across the code base, since `any` is easier to read than 'interface{}'.	2024-07-10 00:20:37 +02:00
Nikolay	69d244e6fb	lib/mergeset: adds tracking for indexdb records drop (#6297 ) It allows to create alert for possible item drops at indexdb. It may happen, if ingested metric size exceeds max indexdb item size. --------- Signed-off-by: hagen1778 <roman@victoriametrics.com> Co-authored-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: hagen1778 <roman@victoriametrics.com>	2024-05-24 14:55:20 +02:00
Aliaksandr Valialkin	cc2647d212	lib/encoding: optimize UnmarshalVarUint64, UnmarshalVarInt64 and UnmarshalBytes a bit Change the return values for these functions - now they return the unmarshaled result plus the size of the unmarshaled result in bytes, so the caller could re-slice the src for further unmarshaling. This improves performance of these functions in hot loops of VictoriaLogs a bit.	2024-05-14 01:23:54 +02:00
Hui Wang	4c80b17027	storage: correctly apply `-inmemoryDataFlushInterval` when it's set t… (#6221 ) …o minimum supported value 1s pendingRowsFlushInterval was bumped to 2s in `73f0a805e2`	2024-05-13 16:44:30 +02:00
Aliaksandr Valialkin	590160ddbb	lib/slicesutil: add helper functions for setting slice length and extending its capacity The added helper functions - SetLength() and ExtendCapacity() - replace error-prone code with simple function calls.	2024-05-12 11:32:17 +02:00
Zakhar Bessarab	329c3cbdf0	lib/mergeset: improve test coverage (#6118 ) Add test to cover the code path with overflowing shards buffers and triggering merge to partition. This test covers the code path which leaded to https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5959 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com>	2024-04-30 10:21:37 +02:00
Aliaksandr Valialkin	85d09e5a2d	lib/{mergeset,storage}: log deleting directories inside partitions if they are missing in parts.json This should improve debuggability of unexpected deletion of directories inside partitions. While at it, log the proper path to parts.json when the directory for big part is missing in the partition. parts.json is located inside directory with small parts, and there is no parts.json file inside directory with big parts.	2024-04-16 19:11:32 +02:00
Zakhar Bessarab	2205de2391	lib/mergeset: fix flushing incorrect set of inmemoryBlocks (#6089 ) Follow-up for `bace9a2501` Related: - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/6069 - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/5959 Signed-off-by: Zakhar Bessarab <z.bessarab@victoriametrics.com> Co-authored-by: Aliaksandr Valialkin <valyala@victoriametrics.com>	2024-04-11 09:26:06 +02:00
Aliaksandr Valialkin	6a8dc74ee7	lib/mergeset: use unsafe.Slice and unsafe.String instead of deprecated reflect.SliceHeader with unsafe conversion from slice header to string header	2024-02-29 17:29:33 +02:00
Aliaksandr Valialkin	f81b480905	lib/mergeset: consistently use atomic.* types instead of atomic.* function calls on ordinary types See `ea9e2b19a5`	2024-02-23 23:29:35 +02:00
Aliaksandr Valialkin	5c89150fc9	lib/mergeset: consistently use atomic.* type for refCount and mustDrop fields in table struct in the same way as it is used in lib/storage See `ea9e2b19a5` and `a204fd69f1`	2024-02-23 22:59:23 +02:00

1 2 3 4 5 ...

272 Commits