Research Group: http://mpi-softsec.github.io
2/2
2/2
On the positive side, I found the AI reviewer *way* more elaborate in eliciting both the positive and negative points. The review is more objective, less/not opinionated. It is more constructive and for every weakness makes suggestions for improvements.
1/
On the positive side, I found the AI reviewer *way* more elaborate in eliciting both the positive and negative points. The review is more objective, less/not opinionated. It is more constructive and for every weakness makes suggestions for improvements.
1/
📝: arxiv.org/abs/2507.00057
🦋 : bsky.app/profile/did:...
We are off to a good start. While the synopsis misses the motivation (*why* this is interesting), it offers the most important points. Good abstract-length summary.
1/
However, it does make errors, and I wouldn't trust it as an actual (co)-reviewer.
12/12
However, it does make errors, and I wouldn't trust it as an actual (co)-reviewer.
12/12
11/
11/
10/
10/
Our theorem expresses what (and how efficiently) we can learn about detecting non-zero incoherence given the alg. output: "If after n(δ,ε) samples we detect no disagreement, then incoherence is at most ε with prob. at least 1-δ".
9/
Our theorem expresses what (and how efficiently) we can learn about detecting non-zero incoherence given the alg. output: "If after n(δ,ε) samples we detect no disagreement, then incoherence is at most ε with prob. at least 1-δ".
9/
Our incoherence-based detection reports indeed no false positives: A non-zero incoherence implies a non-zero error, even empirically. To cite the AI reviewer: "If two sampled programs disagree on an input, at least one of them is wrong".
8/
Our incoherence-based detection reports indeed no false positives: A non-zero incoherence implies a non-zero error, even empirically. To cite the AI reviewer: "If two sampled programs disagree on an input, at least one of them is wrong".
8/
7/
7/
Nothing reject-worthy: Not a key equation but a remark, and the error is just a typo; e.g., 1-\bar{E}(S,1,1) fixes it.
But YES, the AI reviewer found a bug in our Equation (12). Wow!!
6/
Nothing reject-worthy: Not a key equation but a remark, and the error is just a typo; e.g., 1-\bar{E}(S,1,1) fixes it.
But YES, the AI reviewer found a bug in our Equation (12). Wow!!
6/
5/
5/
Interestingly, one item (highlighted in blue) is never mentioned in our paper, but something we are now actively pursuing.
4/
Interestingly, one item (highlighted in blue) is never mentioned in our paper, but something we are now actively pursuing.
4/
Apart from a minor error (no executable semantics needed; only ability to execute), the first strength looks good.
3/
Apart from a minor error (no executable semantics needed; only ability to execute), the first strength looks good.
3/
The summary of review definitely hits the nail on the head. We can see motivation and main contributions. Nice!
2/
The summary of review definitely hits the nail on the head. We can see motivation and main contributions. Nice!
2/
We define *incoherence* as the probability that any two generated programs implement different functions and prove that incoherence is an upper bound on incorrectness.
We define *incoherence* as the probability that any two generated programs implement different functions and prove that incoherence is an upper bound on incorrectness.