VictoriaMetrics

mirror of https://github.com/VictoriaMetrics/VictoriaMetrics.git synced 2026-05-17 08:36:55 +03:00

Author	SHA1	Message	Date
Andrii Chubatiuk	305f1c91f8	lib/{fs,filestream}: use single ParallelExecutor for fs and filestream tasks	2025-12-31 11:51:32 +02:00
JAYICE	74b03c93a6	makefile: support vmauth in docs-update-flags command (#10222 ) ### Describe Your Changes implement https://github.com/VictoriaMetrics/VictoriaMetrics/issues/10221 ### Checklist The following checks are mandatory: - [x] My change adheres to [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/victoriametrics/contributing/#pull-request-checklist). - [x] My change adheres to [VictoriaMetrics development goals](https://docs.victoriametrics.com/victoriametrics/goals/).	2025-12-30 19:14:06 +02:00
Max Kotliar	0e9bb5a42d	docs: sync flags in docs with acutal binaries	2025-12-30 18:59:33 +02:00
Max Kotliar	f1a88e57cf	docs/changelog: fix link to PR follow up on `1792b6bd9a`	2025-12-30 17:38:48 +02:00
Max Kotliar	76176ac1d3	app/vmauth: increase concurency limit reached before waiting in queue Follow up on `c9596a0364 (r173413964)` See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/10078	2025-12-30 17:23:10 +02:00
Max Kotliar	c08adb31bb	docs: remove available from placeholder from code block The {{% available_from "#" %}} placeholder does not work inside code blocks. Replacing it with hard coded value. Introduced in https://github.com/VictoriaMetrics/VictoriaMetrics/pull/10168. See comment https://github.com/VictoriaMetrics/VictoriaMetrics/pull/10168/files#r2651440620 for more details.	2025-12-30 16:10:55 +02:00
Artem Fetishev	b49b0471ef	lib/storage: move legacy code to legacy files (#10215 ) Follow-up for `f97f627` (#8134) The code was moved as is, no changes were made to moved code. Signed-off-by: Artem Fetishev <rtm@victoriametrics.com>	2025-12-30 13:16:24 +01:00
Artem Fetishev	13102045a7	changelog: update v1.132.0 release notes with a note on ungraceful shutdown Signed-off-by: Artem Fetishev <rtm@victoriametrics.com>	2025-12-30 10:29:24 +01:00
Artem Fetishev	d226e5b95f	lib/ingestserver: Actually close the first vminsert connection (#10224 ) Since the first connection is not closed, the vmstorage will never terminate gracefully which will cause the reset of all caches on the start-up. Follow-up for `244769a00d` (#10136) Signed-off-by: Artem Fetishev <rtm@victoriametrics.com>	2025-12-29 15:13:30 +01:00
Hui Wang	30bbb5660b	docs: clarify recording rule labels do not support templating (#10186 ) fix https://github.com/VictoriaMetrics/VictoriaMetrics/issues/10183	2025-12-29 15:29:45 +02:00
Max Kotliar	1792b6bd9a	docs/changelog: Add PR\issue links, fix typo in tip section	2025-12-29 12:58:07 +02:00
Artem Fetishev	f97f627f79	lib/storage: implement partition index (#8134 ) This should reduce disk space occupied by indexDBs as they get deleted along with the corresponding partitions once those partitions become outside the retention window. - Motivation: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/7599 - What to expect: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/8134 Signed-off-by: Artem Fetishev <rtm@victoriametrics.com> Co-authored-by: Andrei Baidarov <baidarov@nebius.com>	2025-12-24 18:53:49 +01:00
Phuong Le	785c1fd053	issues/question-template: fix typos (#947 )	2025-12-24 11:37:34 +01:00
Aliaksandr Valialkin	697bfd5cee	app/vmauth: properly verify whether the request has been canceled by the client in handleConcurrecnyLimitError() The `err` may contain information about request cancelation performed by the server code. In such cases the error must be logged. The error must be ignored only if the client canceled the request. This is a follow-up for the commit `c9596a0364` Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/10078	2025-12-24 11:31:36 +01:00
Artem Fetishev	f0ac6d9ac9	lib/storage: log the beginning and end of saving metric name usage stats to file (#10205 ) This is to debug cases when metric name tracker resets the tsid cache after restart. It could be due vmstorage not having enough time to stop gracefully. Logs should provide this info. Signed-off-by: Artem Fetishev <rtm@victoriametrics.com>	2025-12-23 17:25:43 +01:00
Artem Fetishev	f0b251d967	lib/storage: fix per-idb cache stats (#10204 ) This fixes the following corner case: if all instances of a cache have zero size, the stats won't be set at all. This results in some weird graphs if the cache is reset very often (such as tfssCache): the cache sizeMaxBytes alternates between the actual value and zero. Follow-up for `f62893c151` Signed-off-by: Artem Fetishev <rtm@victoriametrics.com>	2025-12-23 17:06:10 +01:00
Nikolay	c3346ae8fd	app/victoria-metrics: properly add prometheus metrics metadata (#10192 ) Commit `5a587f2006` was not properly ported to the single node branch. Since single node is able to perform both promscrape and self-scrape, it's required to add metadata add methods to those paths. This commit fixes missing metadata add to the storage. Fixes https://github.com/VictoriaMetrics/VictoriaMetrics/issues/10175	2025-12-23 13:57:19 +01:00
Jinlin	0ffb3fdfce	lib/storage: fix log typo	2025-12-23 13:50:40 +01:00
Zakhar Bessarab	4e234ccbd1	docs/enterprise: add description of license key update (#10194 ) Describe Your Changes: - describe options of updating the enterprise license key - fix a few typos	2025-12-23 13:37:36 +01:00
Alexander Frolov	943589ca31	lib/promscrape: fix `isAutoMetric` to recognize all auto-generated metrics Previously, `scrape_labels_limit` was missing from the check. Related PR https://github.com/VictoriaMetrics/VictoriaMetrics/pull/10197	2025-12-23 13:36:50 +01:00
Aliaksandr Valialkin	c9596a0364	app/vmauth: add `-maxQueueDuration` command-line flag for graceful handling of short spikes in the number of concurrent requests Previously a short spike in the number of concurrent requests immediately led to `429 Too Many Requests` errors when the number of concurrent requests exceeds -maxConcurrentRequests or -maxConcurrentPerUserRequests. This commit allows processing short spikes in the number of concurrent requests during the -maxQueueDuration timeout. The requests are rejected only if they couldn't be served accroding to the concurrency limits during the -maxQueueDuration. See https://github.com/VictoriaMetrics/VictoriaMetrics/issues/10078 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/10112	2025-12-22 16:39:01 +01:00
Aliaksandr Valialkin	e7b0a00493	app/vmauth: follow-up for the commit `7f689df824` - Introduce backendURLs struct, which holds all the backend urls and allows stopping all the health checkers across all the backend urls with a single call to backendURLs.stopHealthChecks(). - Immediately cancel the pending Dial call to the backend when backendURLs.stopHealthChecks() is called. Use lib/netutil.Dialer.DialContext() for this. - Replace a fragile closing of stopHealthCheckCh channel via stopHealthCheckOnce.Do() with easier to maintain call of cancel() func for the corresponding healthChecksContext. - Wait until health checker goroutines are finished before return from UserInfo.stopHealthChecks(). Previously the health checker goroutines could run for some time trying to dial the backend after the return from UserInfo.stopHealthChecks(). - Try dialing the broken backend for https urls. It is better if the broken backend logs the error instead of routing client requests to the broken backend. - Log dial errors to the broken backend, so users could troubleshoot the backend connectivity issue with more details. - Refer the correct issue - https://github.com/VictoriaMetrics/VictoriaMetrics/issues/9997 - in the comments explaining why periodic dialing of the broken backend is needed. Previously the https://github.com/VictoriaMetrics/VictoriaMetrics/issues/9890 was incorrectly referred. Updates https://github.com/VictoriaMetrics/VictoriaMetrics/issues/9997 Updates https://github.com/VictoriaMetrics/VictoriaMetrics/pull/10147	2025-12-22 15:20:51 +01:00
Hui Wang	be0fe546e5	vmauth: skip a redundant request if all backends are broken with least_loaded policy (#10202 ) similar to https://github.com/VictoriaMetrics/VictoriaMetrics/pull/10170	2025-12-22 13:06:12 +01:00
Hui Wang	13911db316	vmauth: add new counters to track the number of user request errors follow up https://github.com/VictoriaMetrics/VictoriaMetrics/pull/10177 Add `vmauth_user_request_backend_requests_total` and `vmauth_unauthorized_user_request_backend_requests_total` which track the number of user request errors, and aligned with `vmauth_user_requests_total`. The existing `vmauth_http_request_errors_total` currently only counts requests with `invalid_auth_token`. Once authorization has passed, any subsequent request errors are tracked under `xxx_user_request_backend_requests_total`.	2025-12-22 13:05:54 +01:00
Artem Fetishev	0cb90f91fc	lib/storage: follow-up for `d9c07dbc0b` (#10169 ) - fix changelog Signed-off-by: Artem Fetishev <rtm@victoriametrics.com>	2025-12-19 08:44:10 +01:00
Alexander Frolov	bdf65dde88	app/vmagent: make sure `vmagent_rows_inserted_total` counts samples (#10191 ) As vminsert does `4d9b69b5a6/app/vminsert/newrelic/request_handler.go (L68)` Related PR https://github.com/VictoriaMetrics/VictoriaMetrics/pull/10191	2025-12-18 16:37:37 +01:00
Max Kotliar	4d9b69b5a6	docs/changelog: add known issue note related to memory leak on OpenTelemetry parsing code.	2025-12-18 12:39:12 +02:00
Nikolay	692a9be5fa	lib/storage: check indexDB refCount at MustClose In order to gracefully stop indexDB, refCount must be checked during storage graceful shutdown. Related issue: https://github.com/VictoriaMetrics/VictoriaMetrics/issues/10063	2025-12-17 18:48:53 +01:00
Kirill Kobylyanskiy	c8742ab120	lib/promscrape: add global sampleLimit support This commit introduces the global `sampleLimit` setting to restrict the number of samples accepted per scrape target, mirroring the behavior of Prometheus. Motivation: 1) The existing `-promscrape.seriesLimitPerTarget` flag currently takes precedence over any `sample_limit` setting defined directly on the scrape target. The new `sampleLimit` implementation ensures that the target configuration is able to override the global setting, allowing users to define specific limits per target. 2) The existing series limit flag uses memory-intensive Bloom filters, resulting in high RAM consumption under high-cardinality scraping scenarios. The `sampleLimit` provides a much simpler, low-overhead alternative. fixes https://github.com/VictoriaMetrics/VictoriaMetrics/issues/10145	2025-12-17 18:47:05 +01:00
Aliaksandr Valialkin	b6f8128273	Makefile: update golangci-lint from v2.4.0 to v2.7.2 See https://github.com/golangci/golangci-lint/releases/tag/v2.7.2	2025-12-17 16:59:02 +01:00
Aliaksandr Valialkin	bed7cbd0a4	all: consistently use encoding.DecompressZSTD* instead of zstd.Decompress* across the codebase The encoding.DecompressZSTD* consistently updates the vm_zstd_block_decompress_calls_total metric. Also make the follwing improvements after the commit `10f7cd2ffc`: - Add encoding.DecompressZSTDLimited() function and use it instead of zstd.DecompressLimited, so it properly updates vm_zstd_block_decompress_calls_total metric. - Clarify description for the encoding.DecompressZSTD* and zstd.Decompress* functions.	2025-12-17 16:48:06 +01:00
Artem Fetishev	d9c07dbc0b	lib/storage: rotate dateMetricIDCache instead of resetting (#10169 ) Currently, `dateMetricIDCache` is reset when it is full and it is never reset is not full but the data it stores is no longer needed. This leads to the following problems: - During regular data ingestion the cache sizeBytes may exceed max allowed size and the cache gets reset which may potentially slow down data ingestion (see #10064) - The cache is per-indexDB. This means that in partition index (#8134) there will be as many instances of this cache as the number of partitions. If someone performs a backfill across all partitions, this will fill all caches and they will never get reset even if no more historical data is ingested. So the solution is to periodically rotate the cache. After first rotation the data is not deleted but moved to `prev` storage. After second rotation `prev` gets deleted. This gives the cache an opportunity to restore the `prev` data if it is still in use. Based on #10167. This PR also removes the introduced recently introduced `-storage.cacheSizeIndexDBDateMetricID` flag (see #10135). This should be safe since it is new and its use case is very niche, i.e. no one would really use it. --------- Signed-off-by: Artem Fetishev <rtm@victoriametrics.com>	2025-12-17 15:43:05 +01:00
Artem Fetishev	20ad9cd395	lib/storage: introduce metricIDCache The cache serves the same purpose as `dateMetricIDCache` but is used for caching metricIDs from global index. The cache was introduces in https://github.com/VictoriaMetrics/VictoriaMetrics/pull/10167 and it has been decided to add it in a separate commit to reduce diff. Related PR https://github.com/VictoriaMetrics/VictoriaMetrics/pull/10167	2025-12-17 13:31:11 +01:00
Hui Wang	8b3fe9cdec	app/vmauth: add new counters to track the number of requests sent to backends We have `vmauth_user_requests_total` and `vmauth_unauthorized_user_requests_total` to track requests from the user side. However, in scenarios such as request timeouts or when the response code matches `retry_status_code`, a single request may be retried across multiple backends. Exposing counters `vmauth_user_request_backend_requests_total` and `vmauth_unauthorized_user_request_backend_requests_total` that track the number of requests sent to backends provides insight into the routing logic and can help identify if requests are being consistently retried, which may contribute to increased request duration. Related PR https://github.com/VictoriaMetrics/VictoriaMetrics/pull/10171	2025-12-17 13:27:08 +01:00
Hui Wang	e1e367b3cb	app/vmauth: properly increment metric xxx_user_request_backend_errors_total Currently, backendErrors may be counted twice if a request to the backend fails due to context.DeadlineExceeded. `9bc7a17d80/app/vmauth/main.go (L328)` `9bc7a17d80/app/vmauth/main.go (L294)` And we increment this counter in a way that is somewhat inconsistent. Given that the counter's name is `xx_request_backend_errors_total`, it should only increase when a backend request returns an error. This value can exceed the user request error count if multiple backend requests fail for a single user request. The `xxx_request_backend_errors_total` counter should be used in conjunction with the `xxx_request_backend_requests_total` introduced in https://github.com/VictoriaMetrics/VictoriaMetrics/pull/10171.	2025-12-17 13:24:26 +01:00
Hui Wang	f40c6fcad1	app/vmauth: skip a redundant request if all backends are broken with first_available policy There is no reason to send a request to the first backend if all backends are marked as broken. Also, >// getFirstAvailableBackendURL returns the first available backendURL, which isn't broken. The fix only skips a redundant request when all backends are unavailable, it doesn't introduce any changes from user's perspective, so I skipped changelog.	2025-12-17 13:22:37 +01:00
Aliaksandr Valialkin	b6bc186013	docs/victoriametrics/Articles.md: add https://developer-friendly.blog/blog/2024/06/17/unlocking-the-power-of-victoriametrics-a-prometheus-alternative/	2025-12-16 15:46:23 +01:00
Aliaksandr Valialkin	9bc7a17d80	lib/protoparser/opentelemetry: typo fix: wince -> since This is a follow-up for the commit `293d80910c`	2025-12-15 20:13:45 +01:00
f41gh7	9ce548dcb5	docs: update release version to latest	2025-12-15 10:37:35 +01:00
f41gh7	82e583338d	docs: update LTS releases	2025-12-15 10:34:43 +01:00
Aliaksandr Valialkin	19009836c7	vendor: update github.com/valyala/fastjson from v1.6.5 to v1.6.7 v1.122.11 v1.110.26	2025-12-14 23:09:43 +01:00
Max Kotliar	c2362ab670	docs: review links in changelogs	2025-12-12 19:43:15 +02:00
f41gh7	d04a42e846	make vmui-update v1.132.0	2025-12-12 12:50:13 +01:00
f41gh7	0d930dda16	CHANGELOG.md: cut v1.132.0 release	2025-12-12 12:45:34 +01:00
Artem Fetishev	e026215701	lib/storage: Document post-delete cache resets (#10158 ) When the time series deletion is performed some of the storage caches need to be reset but some not. This PR reviews all storage caches and documents why there are reset or not and also places all the resetting logic (and comments) in one place.	2025-12-12 11:11:30 +01:00
JAYICE	34a542c324	lib/storage: include last sample when query at the last millisecond of the day One millisecond shouldn't be subtracted from the `tr.MaxTimestamp`, and related test cases will be added Fixes https://github.com/VictoriaMetrics/VictoriaMetrics/issues/9804	2025-12-12 11:01:06 +01:00
Fred Navruzov	ff0aaa38b7	docs/vmanomaly: release v1.28.2 (#10160 ) ### Describe Your Changes Update docs and assets (visualizations) for /anomaly-detection section with `v1.28.2` release ### Checklist The following checks are mandatory: - [x] My change adheres to [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/victoriametrics/contributing/#pull-request-checklist). - [x] My change adheres to [VictoriaMetrics development goals](https://docs.victoriametrics.com/victoriametrics/goals/).	2025-12-11 20:56:59 +02:00
Max Kotliar	0e2f0ac95f	lib/protoparser/opentelemetry: fix typo in code # github.com/VictoriaMetrics/VictoriaMetrics/lib/protoparser/opentelemetry/pb lib/protoparser/opentelemetry/pb/pb.go:1683:19: undefined: lctx Bug introduced in `1dc71212f8`	2025-12-11 18:37:04 +02:00
Max Kotliar	7f689df824	app/vmauth: validate backend with a dial check before marking it healthy (#10147 ) ### Describe Your Changes Previously, a backend was considered healthy as soon as its 'bu.brokenDeadline' deadline expired, even if it was still unavailable. This caused avoidable request failures and retries. Now vmauth performs a TCP dial (1s timeout) before restoring the backend to the healthy pool. This avoids routing traffic to backends that are still down. The dial check also covers cases where a route to the backend cannot be resolved. Without this check, user requests would hang until the connection timeout, leading to long waits or errors. The new check fails fast and doesn't impact real user requests. Fixes https://github.com/VictoriaMetrics/VictoriaMetrics/issues/9997 ### Checklist The following checks are mandatory: - [ ] My change adheres to [VictoriaMetrics contributing guidelines](https://docs.victoriametrics.com/victoriametrics/contributing/#pull-request-checklist). - [ ] My change adheres to [VictoriaMetrics development goals](https://docs.victoriametrics.com/victoriametrics/goals/).	2025-12-11 18:26:59 +02:00
Max Kotliar	bd725bdd69	dashboards: add usseful links to dashboards Dashboards: - Add a link to proper docs section - Add a link to troubleshooting page - Add links to community and enterprise support Fixes https://github.com/VictoriaMetrics/VictoriaMetrics/issues/9904	2025-12-11 18:11:07 +02:00

1 2 3 4 5 ...

11783 Commits