core/state: add code to state reader #30808

s1na · 2024-11-25T21:08:45Z

Contract code is technically also a part of the state. This PR adds code to the state reader interface. We also add code size because code size is separately cached in the cachingDB (although I think the overhead of computing the code size on the consumer end should not be too bad).

The motivation is #30441 where we are overriding the state reader interface to hook into state read events (account load, storage load and code load). You can see how that is done here: s1na@a22b30b#diff-abb7c3ba4a7ea95d325d17a5c93b0ebde2c5b43e67f2bc3eb0f76a0c5fc7a8cfR335-R337

s1na · 2024-11-25T21:10:15Z

core/state/reader.go

+	// ContractCode returns the code associated with a particular account.
+	//
+	// - Returns an empty code if it does not exist
+	// - It can return an error to indicate code doesn't exist


This breaks consistency to the rest of the interface. I have done this to avoid any semantics change in this PR.

I am a bit puzzled why the statedb is recording "not found" errors for code but not for storage or account.

It says "empty code if it does not exist". Pls clarify.
It says "can return an error to indicate code doesn't exist". Well, does it, or not?

Sorry clarified. It will return an error if code is not found.

core/state/reader.go

s1na · 2024-11-26T09:50:35Z

core/state/reader.go

+	for _, reader := range r.readers {
+		// Skip state reader as it doesn't provide contract code and
+		// always returns an error.
+		if _, ok := reader.(*trieReader); ok {


This is a hack: snapshot reader can never return code. So we skip it here and only focus on trie reader.

If the state reader returns empty result and no error the multi reader can confuse it for empty code.

If the state reader returns error it can cause multi reader to always return an error and confuse consumer of the Reader.

holiman · 2024-11-26T10:09:07Z

core/state/reader.go

+	// - It returns an error to indicate code doesn't exist
+	// - The returned code is safe to modify after the call


I want the docs to be really clear on semantics here. Here, you say that an error is returned if code doesn't exist. So, does it return errors if you ask for the contract code of anything which has no code? Or only if the account does not exist in the trie?

I mean, there are more points that could be clarified too:
How about asking for the code of a contract which is currently marked as selfdestructed? How about during initcode execution, or just after creation?

Also, the implementations (snapshot) seem to return error just because they are unable to provide the code. That seems like a spec-flaw. The API doesn't allow the implementor to signal "I'm unable to do this", thus forcing it to signal "the code doesn't exist", which is not necessarily the case. An API-wart, IMO

It's not beautiful, but you could e.g. add another error, so the underlying code can signal NotSupportedErr or something

I want the docs to be really clear on semantics here. Here, you say that an error is returned if code doesn't exist. So, does it return errors if you ask for the contract code of anything which has no code? Or only if the account does not exist in the trie?

I'm not sure how to formulate this better. What happens is we query disk by code hash. Address is not used at all. I believe it is kept in the (cachingDB) interface because of verkle.

So the semantic is return value only if code with that hash is found and it has size > 0.

How about asking for the code of a contract which is currently marked as selfdestructed? How about during initcode execution, or just after creation?

The reader doesn't care about evm semantics as it consults disk directly.

Then we should not have address in there... ?

Also, the implementations (snapshot) seem to return error just because they are unable to provide the code. That seems like a spec-flaw. The API doesn't allow the implementor to signal "I'm unable to do this", thus forcing it to signal "the code doesn't exist", which is not necessarily the case. An API-wart, IMO

Right. I have now added a method "SupportsCodeQuery() bool" to the interface which should return true if a reader allows you to fetch code. I'm not particularly fond of the name

Then we should not have address in there... ?

Summoning @rjl493456442 as I tried to stay consistent with state.Database

The address was added for verkle. In verkle, the contract code is tight with the contract.

rjl493456442 · 2024-11-26T11:52:23Z

One thing I am not happy with this implementation is: for all reader implementations, now they are tight with code reader.

Theoretically, we can divide reader into several different parts: state reader (account reader, storage reader) and code reader. The existing readers (state reader and trie reader) are actually the implementations of account reader and storage reader.

Please give me a bit more time to figure out the proper interface definition here. Ideally, it would be:

type Reader interface {
    AccountReader
    StorageReader
    CodeReader
}

And we can somehow combine different types of reader together and assemble them as a unified Reader.

s1na · 2024-11-28T10:31:47Z

Closing in favor of #30816

core/state: pipe code loading through reader

e4a794a

s1na requested review from karalabe, holiman and rjl493456442 as code owners November 25, 2024 21:08

s1na commented Nov 25, 2024

View reviewed changes

namiloh reviewed Nov 26, 2024

View reviewed changes

core/state/reader.go Outdated Show resolved Hide resolved

s1na added 3 commits November 26, 2024 10:26

fix copy, fix comments

42d3dc8

code only on trie reader

d6c4d58

change comment

56775c5

s1na commented Nov 26, 2024

View reviewed changes

holiman reviewed Nov 26, 2024

View reviewed changes

add method supportsCodeQuery

2a3b498

rjl493456442 mentioned this pull request Nov 27, 2024

core, state: introduce code reader interface #30816

Merged

s1na closed this Nov 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

core/state: add code to state reader #30808

core/state: add code to state reader #30808

s1na commented Nov 25, 2024

s1na Nov 25, 2024

namiloh Nov 26, 2024

s1na Nov 26, 2024

s1na Nov 26, 2024

holiman Nov 26, 2024

holiman Nov 26, 2024

s1na Nov 26, 2024 •

edited

Loading

holiman Nov 26, 2024

s1na Nov 26, 2024

s1na Nov 26, 2024

rjl493456442 Nov 26, 2024

rjl493456442 commented Nov 26, 2024

s1na commented Nov 28, 2024

		// - It returns an error to indicate code doesn't exist
		// - The returned code is safe to modify after the call

core/state: add code to state reader #30808

core/state: add code to state reader #30808

Conversation

s1na commented Nov 25, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

s1na Nov 26, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

rjl493456442 commented Nov 26, 2024

s1na commented Nov 28, 2024

s1na Nov 26, 2024 •

edited

Loading