Hyper-Connections (HC) make the usual single shortcut in neural networks wider by creating several parallel streams and letting the model mix them, but this can become unstable when stacked deep.
AT2PO is a new way to train AI agents that work in several turns, like asking the web a question, reading the result, and trying again.