go-ethereum

Commit Graph

Author	SHA1	Message	Date
rjl493456442	a7f9523ae1	all: implement state history v2 (#30107 ) This pull request delivers the new version of the state history, where the raw storage key is used instead of the hash. Before the cancun fork, it's supported by protocol to destruct a specific account and therefore, all the storage slot owned by it should be wiped in the same transition. Technically, storage wiping should be performed through storage iteration, and only the storage key hash will be available for traversal if the state snapshot is not available. Therefore, the storage key hash is chosen as the identifier in the old version state history. Fortunately, account self-destruction has been deprecated by the protocol since the Cancun fork, and there are no empty accounts eligible for deletion under EIP-158. Therefore, we can conclude that no storage wiping should occur after the Cancun fork. In this case, it makes no sense to keep using hash. Besides, another big reason for making this change is the current format state history is unusable if verkle is activated. Verkle tree has a different key derivation scheme (merkle uses keccak256), the preimage of key hash must be provided in order to make verkle rollback functional. This pull request is a prerequisite for landing verkle. Additionally, the raw storage key is more human-friendly for those who want to manually check the history, even though Solidity already performs some hashing to derive the storage location. --- This pull request doesn't bump the database version, as I believe the database should still be compatible if users degrade from the new geth version to old one, the only side effect is the persistent new version state history will be unusable. --------- Co-authored-by: Zsolt Felfoldi <zsfelfoldi@gmail.com>	2025-01-17 02:59:02 +01:00
rjl493456442	37c0e6992e	cmd, core, miner: rework genesis setup (#30907 ) This pull request refactors the genesis setup function, the major changes are highlighted here: (a) Triedb is opened in verkle mode if `EnableVerkleAtGenesis` is configured in chainConfig or the database has been initialized previously with `EnableVerkleAtGenesis` configured. A new config field `EnableVerkleAtGenesis` has been added in the chainConfig. This field must be configured with True if Geth wants to initialize the genesis in Verkle mode. In the verkle devnet-7, the verkle transition is activated at genesis. Therefore, the verkle rules should be used since the genesis. In production networks (mainnet and public testnets), verkle activation always occurs after the genesis block. Therefore, this flag is only made for devnet and should be deprecated later. Besides, verkle transition at non-genesis block hasn't been implemented yet, it should be done in the following PRs. (b) The genesis initialization condition has been simplified There is a special mode supported by the Geth is that: Geth can be initialized with an existing chain segment, which can fasten the node sync process by retaining the chain freezer folder. Originally, if the triedb is regarded as uninitialized and the genesis block can be found in the chain freezer, the genesis block along with genesis state will be committed. This condition has been simplified to checking the presence of chain config in key-value store. The existence of chain config can represent the genesis has been committed.	2025-01-14 11:49:30 +01:00
rjl493456442	82e963e5c9	triedb/pathdb: configure different node hasher in pathdb (#31008 ) As the node hash scheme in verkle and merkle are totally different, the original default node hasher in pathdb is no longer suitable. Therefore, this pull request configures different node hasher respectively.	2025-01-10 20:51:19 +08:00
Ceyhun Onur	5065e6c935	triedb/pathdb: fix tester generator (#30972 ) This change fixes is a rare bug in test generator: If the run is very unlucky it can use `modifyAccountOp` / `deleteAccountOp` without creating any account, leading to have a trie root same as the parent. This change makes the first operation always be a creation.	2025-01-07 11:49:13 +01:00
gitglorythegreat	85ffbde427	all: use cmp.Compare (#30958 )	2025-01-02 14:06:47 +01:00
rjl493456442	bc1ec69008	trie/pathdb: state iterator (snapshot integration pt 4) (#30654 ) In this pull request, the state iterator is implemented. It's mostly a copy-paste from the original state snapshot package, but still has some important changes to highlight here: (a) The iterator for the disk layer consists of a diff iterator and a disk iterator. Originally, the disk layer in the state snapshot was a wrapper around the disk, and its corresponding iterator was also a wrapper around the disk iterator. However, due to structural differences, the disk layer iterator is divided into two parts: - The disk iterator, which traverses the content stored on disk. - The diff iterator, which traverses the aggregated state buffer. Checkout `BinaryIterator` and `FastIterator` for more details. (b) The staleness management is improved in the diffAccountIterator and diffStorageIterator Originally, in the `diffAccountIterator`, the layer’s staleness had to be checked within the Next function to ensure the iterator remained usable. Additionally, a read lock on the associated diff layer was required to first retrieve the account blob. This read lock protection is essential to prevent concurrent map read/write. Afterward, a staleness check was performed to ensure the retrieved data was not outdated. The entire logic can be simplified as follows: a loadAccount callback is provided to retrieve account data. If the corresponding state is immutable (e.g., diff layers in the path database), the staleness check can be skipped, and a single account data retrieval is sufficient. However, if the corresponding state is mutable (e.g., the disk layer in the path database), the callback can operate as follows: ```go func(hash common.Hash) ([]byte, error) { dl.lock.RLock() defer dl.lock.RUnlock() if dl.stale { return nil, errSnapshotStale } return dl.buffer.states.mustAccount(hash) } ``` The callback solution can eliminate the complexity for managing concurrency with the read lock for atomic operation.	2024-12-16 21:10:08 +08:00
Martin HS	9045b79bc2	metrics, cmd/geth: change init-process of metrics (#30814 ) This PR modifies how the metrics library handles `Enabled`: previously, the package `init` decided whether to serve real metrics or just dummy-types. This has several drawbacks: - During pkg init, we need to determine whether metrics are enabled or not. So we first hacked in a check if certain geth-specific commandline-flags were enabled. Then we added a similar check for geth-env-vars. Then we almost added a very elaborate check for toml-config-file, plus toml parsing. - Using "real" types and dummy types interchangeably means that everything is hidden behind interfaces. This has a performance penalty, and also it just adds a lot of code. This PR removes the interface stuff, uses concrete types, and allows for the setting of Enabled to happen later. It is still assumed that `metrics.Enable()` is invoked early on. The somewhat 'heavy' operations, such as ticking meters and exp-decay, now checks the enable-flag to prevent resource leak. The change may be large, but it's mostly pretty trivial, and from the last time I gutted the metrics, I ensured that we have fairly good test coverage. --------- Co-authored-by: Felix Lange <fjl@twurst.com>	2024-12-10 13:27:29 +01:00
rjl493456442	05148d972c	triedb/pathdb: track flat state changes in pathdb (snapshot integration pt 2) (#30643 ) This pull request ports some changes from the main state snapshot integration one, specifically introducing the flat state tracking in pathdb. Note, the tracked flat state changes are only held in memory and won't be persisted in the disk. Meanwhile, the correspoding state retrieval in persistent state is also not supported yet. The states management in disk is more complicated and will be implemented in a separate pull request. Part 1: https://github.com/ethereum/go-ethereum/pull/30752	2024-11-29 19:30:45 +08:00
rjl493456442	6485d5e3ff	core, triedb: remove destruct flag in state snapshot (#30752 ) This pull request removes the destruct flag from the state snapshot to simplify the code. Previously, this flag indicated that an account was removed during a state transition, making all associated storage slots inaccessible. Because storage deletion can involve a large number of slots, the actual deletion is deferred until the end of the process, where it is handled in batches. With the deprecation of self-destruct in the Cancun fork, storage deletions are no longer expected. Historically, the largest storage deletion event in Ethereum was around 15 megabytes—manageable in memory. In this pull request, the single destruct flag is replaced by a set of deletion markers for individual storage slots. Each deleted storage slot will now appear in the Storage set with a nil value. This change will simplify a lot logics, such as storage accessing, storage flushing, storage iteration and so on.	2024-11-22 16:55:43 +08:00
rjl493456442	74ef47462f	core/state, triedb/database: refactor state reader (#30712 ) Co-authored-by: Martin HS <martin@swende.se>	2024-11-09 08:08:06 +08:00
Martin HS	da17f2d65b	all: fix issues with benchmarks (#30667 ) This PR fixes some issues with benchmarks - [x] Removes log output from a log-test - [x] Avoids a `nil`-defer in `triedb/pathdb` - [x] Fixes some crashes re tracers - [x] Refactors a very resource-expensive benchmark for blobpol. NOTE: this rewrite touches live production code (a little bit), as it makes the validator-function used by the blobpool configurable. - [x] Switch some benches over to use pebble over leveldb - [x] reduce mem overhead in the setup-phase of some tests - [x] Marks some tests with a long setup-phase to be skipped if `-short` is specified (where long is on the order of tens of seconds). Ideally, in my opinion, one should be able to run with `-benchtime 10ms -short` and sanity-check all tests very quickly. - [x] Drops some metrics-bechmark which times the speed of `copy`. --------- Co-authored-by: Sina Mahmoodi <itz.s1na@gmail.com>	2024-11-04 15:10:12 +01:00
rjl493456442	b6c62d5887	core, trie, triedb: minor changes from snapshot integration (#30599 ) This change ports some non-important changes from https://github.com/ethereum/go-ethereum/pull/30159, including interface renaming and some trivial refactorings.	2024-10-18 17:06:31 +02:00
rjl493456442	eff0bed91b	core/rawdb: freezer index repair (#29792 ) This pull request removes the `fsync` of index files in freezer.ModifyAncients function for performance gain. Originally, fsync is added after each freezer write operation to ensure the written data is truly transferred into disk. Unfortunately, it turns out `fsync` can be relatively slow, especially on macOS (see https://github.com/ethereum/go-ethereum/issues/28754 for more information). In this pull request, fsync for index file is removed as it turns out index file can be recovered even after a unclean shutdown. But fsync for data file is still kept, as we have no meaningful way to validate the data correctness after unclean shutdown. --- But why do we need the `fsync` in the first place? As it's necessary for freezer to survive/recover after the machine crash (e.g. power failure). In linux, whenever the file write is performed, the file metadata update and data update are not necessarily performed at the same time. Typically, the metadata will be flushed/journalled ahead of the file data. Therefore, we make the pessimistic assumption that the file is first extended with invalid "garbage" data (normally zero bytes) and that afterwards the correct data replaces the garbage. We have observed that the index file of the freezer often contain garbage entry with zero value (filenumber = 0, offset = 0) after a machine power failure. It proves that the index file is extended without the data being flushed. And this corruption can destroy the whole freezer data eventually. Performing fsync after each write operation can reduce the time window for data to be transferred to the disk and ensure the correctness of the data in the disk to the greatest extent. --- How can we maintain this guarantee without relying on fsync? Because the items in the index file are strictly in order, we can leverage this characteristic to detect the corruption and truncate them when freezer is opened. Specifically these validation rules are performed for each index file: For two consecutive index items: - If their file numbers are the same, then the offset of the latter one MUST not be less than that of the former. - If the file number of the latter one is equal to that of the former plus one, then the offset of the latter one MUST not be 0. - If their file numbers are not equal, and the latter's file number is not equal to the former plus 1, the latter one is valid And also, for the first non-head item, it must refer to the earliest data file, or the next file if the earliest file is not sufficient to place the first item(very special case, only theoretical possible in tests) With these validation rules, we can detect the invalid item in index file with greatest possibility. --- But unfortunately, these scenarios are not covered and could still lead to a freezer corruption if it occurs: All items in index file are in zero value It's impossible to distinguish if they are truly zero (e.g. all the data entries maintained in freezer are zero size) or just the garbage left by OS. In this case, these index items will be kept by truncating the entire data file, namely the freezer is corrupted. However, we can consider that the probability of this situation occurring is quite low, and even if it occurs, the freezer can be considered to be close to an empty state. Rerun the state sync should be acceptable. Index file is integral while relative data file is corrupted It might be possible the data file is corrupted whose file size is extended correctly with garbage filled (e.g. zero bytes). In this case, it's impossible to detect the corruption by index validation. We can either choose to `fsync` the data file, or blindly believe that if index file is integral then the data file could be integral with very high chance. In this pull request, the first option is taken.	2024-10-01 18:16:16 +02:00
rjl493456442	623b17ba20	core/state: state reader abstraction (#29761 ) This pull request introduces a state.Reader interface for state accessing. The interface could be implemented in various ways. It can be pure trie only reader, or the combination of trie and state snapshot. What's more, this interface allows us to have more flexibility in the future, e.g. the archive reader (for accessing archive state). Additionally, this pull request removes the following metrics - `chain/snapshot/account/reads` - `chain/snapshot/storage/reads`	2024-09-05 13:10:47 +03:00
rjl493456442	bfda8ae0c6	core: add metrics for state access (#30353 ) This pull request adds a few more performance metrics, specifically: - The average time cost of an account read - The average time cost of a storage read - The rate of account reads - The rate of storage reads	2024-08-26 20:02:10 +08:00
rjl493456442	4ad88e9463	triedb/pathdb: print out all trie owner and hash information (#30200 ) This pull request explicitly prints out the full hash for debugging purpose.	2024-07-24 20:32:28 +08:00
rjl493456442	f59d013e40	core/rawdb, triedb, cmd: create an isolated disk namespace for verkle (#30105 ) * core, triedb/pathdb, cmd: define verkle state ancient store * core/rawdb, triedb: add verkle namespace in pathdb	2024-07-16 16:17:58 +03:00
rjl493456442	b530d8e455	trie, triedb: remove unnecessary child resolver interface (#30167 )	2024-07-16 18:52:19 +08:00
lilasxie	19c3c1e205	triedb/pathdb: fix flaky test in pathdb (#29901 )	2024-06-28 21:15:54 +08:00
rjl493456442	045b9718d5	trie: relocate state execution logic into pathdb package (#29861 )	2024-06-27 20:30:39 +08:00
maskpp	c736b04d9b	triedb/pathdb: use maps.Clone and maps.Keys (#29985 )	2024-06-17 17:09:29 +02:00
maskpp	115d154392	trie, triedb/pathdb: prealloc capacity for map and slice (#29986 )	2024-06-17 11:42:41 +02:00
Darioush Jalali	fd5078c779	trie/triedb: add Reader to backend interface (#29988 )	2024-06-14 14:52:46 +08:00
jwasinger	69351e8b0f	core/state, eth/protocols, trie, triedb/pathdb: remove unused error from trie Commit (#29869 ) * core/state, eth/protocols, trie, triedb/pathdb: remove unused error return from trie Commit * move set back to account-trie-update block scoping for easier readability * address review * undo tests submodule change * trie: panic if BatchSerialize returns an error in Verkle trie Commit * trie: verkle comment nitpicks --------- Co-authored-by: Péter Szilágyi <peterke@gmail.com>	2024-06-12 12:23:16 +03:00
rjl493456442	b88051ec83	core/rawdb, triedb/pathdb: fix freezer read-only option (#29823 )	2024-05-28 14:41:11 +02:00
maskpp	2c67fab0d7	trie/pathdb: preallocate map capacity (#29690 ) * preallocated capacity for map's certain usege of memory * preallocated capacity for map's certain usege of memory	2024-05-02 12:35:45 +03:00
rjl493456442	9f96e07c1c	core/rawdb, trie: improve db APIs for accessing trie nodes (#29362 ) * core/rawdb, trie: improve db APIs for accessing trie nodes * triedb/pathdb: fix	2024-04-30 16:25:35 +02:00
rjl493456442	f46c878441	core/rawdb: implement in-memory freezer (#29135 )	2024-04-30 11:33:22 +02:00
Martin HS	853e0c23f3	eth/catalyst, trie/pathdb: fix flaky tests (#29571 ) This change fixes three flaky tests `TestEth2AssembleBlock`,`TestEth2NewBlock`, `TestEth2PrepareAndGetPayload` and `TestDisable`. --------- Co-authored-by: Gary Rong <garyrong0905@gmail.com>	2024-04-23 10:33:36 +02:00
Abirdcfly	b179b7b8e7	all: remove duplicate word in comments (#29531 ) This change removes some duplicate words in in comments	2024-04-15 08:34:31 +02:00
Guillaume Ballet	da7469e5c4	core: add an end-to-end verkle test (#29262 ) core: add a simple verkle test triedb, core: skip hash comparison in verkle core: remove legacy daoFork logic in verkle chain maker fix: nil pointer in tests triedb/pathdb: add blob hex core: less defensive Co-authored-by: Ignacio Hagopian <jsign.uy@gmail.com> Co-authored-by: Martin HS <martin@swende.se> Co-authored-by: Gary Rong <garyrong0905@gmail.com>	2024-03-26 21:25:41 +01:00
Aaron Chen	723b1e36ad	all: fix mismatched names in comments (#29348 ) * all: fix mismatched names in comments * metrics: fix mismatched name in UpdateIfGt	2024-03-26 21:01:28 +01:00
Martin HS	14cc967d19	all: remove dependency on golang.org/exp (#29314 ) This change includes a leftovers from https://github.com/ethereum/go-ethereum/pull/29307 - using the [new `slices` package](https://go.dev/doc/go1.21#slices) and - using the [new `cmp.Ordered`](https://go.dev/doc/go1.21#cmp) instead of exp `constraints.Ordered`	2024-03-25 07:50:18 +01:00
rjl493456442	6490d9897a	cmd, triedb: implement history inspection (#29267 ) This pull request introduces a database tool for inspecting the state history. It can be used for either account history or storage slot history, within a specific block range. The state output format can be chosen either with - the "rlp-encoded" values (those inserted into the merkle trie) - the "rlp-decoded" value (the raw state value) The latter one needs --raw flag.	2024-03-22 20:12:10 +08:00
Darioush Jalali	f46fe62c5d	triedb/hashdb: Avoid setting db.cleans on Close (#29309 )	2024-03-22 19:38:24 +08:00
rjl493456442	15eb9773f9	triedb/pathdb: improve tests (#29278 )	2024-03-19 10:50:08 +08:00
rjl493456442	7b81cf6362	core/state, trie/triedb/pathdb: remove storage incomplete flag (#28940 ) As SELF-DESTRUCT opcode is disabled in the cancun fork(unless the account is created within the same transaction, nothing to delete in this case). The account will only be deleted in the following cases: - The account is created within the same transaction. In this case the original storage was empty. - The account is empty(zero nonce, zero balance, zero code) and is touched within the transaction. Fortunately this kind of accounts are not-existent on ethereum-mainnet. All in all, after cancun, we are pretty sure there is no large contract deletion and we don't need this mechanism for oom protection.	2024-03-05 14:31:55 +01:00
psogv0308	35cebc1687	triedb/pathdb: changed the test code to check for verifying state (#29150 ) Co-authored-by: this-is-iron <iron@superblock.co>	2024-03-04 11:03:53 +01:00
Péter Szilágyi	865e1e9f57	cmd/utils, core/rawdb, triedb/pathdb: flip hash to path scheme (#29108 ) * cmd/utils, core/rawdb, triedb/pathdb: flip hash to path scheme * graphql: run tests in hash mode as the chain maker needs it	2024-02-29 12:40:59 +02:00
yzb	db4cf69166	all: replace fmt.Errorf() with errors.New() if no param required (#29126 ) replace-fmt-errorf Co-authored-by: yzb@example.cn <yzb@example.cn>	2024-02-29 11:56:46 +02:00
rjl493456442	5bae14f9df	triedb/pathdb: fix panic in recoverable (#29107 ) * triedb/pathdb: fix panic in recoverable * triedb/pathdb: add todo * triedb/pathdb: rename * triedb/pathdb: rename	2024-02-28 14:40:28 +02:00
rjl493456442	49623bd469	core, triedb/pathdb: calculate the size for batch pre-allocation (#29106 ) * core, triedb/pathdb: calculate the size for batch pre-allocation * triedb/pathdb: address comment	2024-02-28 14:23:52 +02:00
rjl493456442	fe91d476ba	all: remove the dependency from trie to triedb (#28824 ) This change removes the dependency from trie package to triedb package.	2024-02-13 14:49:53 +01:00

43 Commits