3 min readfrom Machine Learning

Some new updates to Papers with Code [P]

Some new updates to Papers with Code [P]
Some new updates to Papers with Code [P]

Hi folks,

Niels here from the open-source team at Hugging Face. I continue working on a revival of paperswithcode.co as we're back to the "age of research" per Ilya Sutskever! Hence, it's important to discover each other's research and build on each other's work, so we can collectively build the next Transformer. Below, I'll go over each of the new features that were recently added.

## Support for SOTA badges

Yes, that's right, totally like the old website. You can see that GLM-5.2, for instance, is obviously the hottest blog post today, achieves SOTA on PostTrainBench, and performs well on many other benchmarks. It is displayed whenever a paper gets a score within the top 3 of a given benchmark.

Note that these are displayed on any paper feed, including https://paperswithcode.co/tasks/video-classification, for example.

https://preview.redd.it/wawma8paeu8h1.png?width=2418&format=png&auto=webp&s=0ba3b6a0eaef231b7f3ca468cc3db4120f1b9e4d

## New trending score

The papers are now ranked based on a new trending metric. This is a combination of the GitHub star velocity and the trending score of the linked Hugging Face artifacts (models, datasets, and Spaces). Previously, this only took into account GitHub star velocity.

Thanks to this, papers like IndexCache are now trending, which is a core technique behind the trending GLM-5.2 model.

https://preview.redd.it/b6g04w2ogu8h1.png?width=2380&format=png&auto=webp&s=13d59bbadd5f8e8295deac2ee6e1e0e3dbc0f40f

## Support for external evals

Second, I've added support for "external" evals. This is a feature the legacy PwC website didn't actually have. Oftentimes, a paper has way more evals than the ones introduced in the paper itself. You can now view these third-party evals. Some examples:

https://preview.redd.it/mfnfdzxpeu8h1.png?width=1914&format=png&auto=webp&s=2b909ecf7c6e3fc088fd0a46fbc56f6859dfaf17

## More tasks, benchmarks and evals

I'm adding more benchmarks and adding evals of more papers. This happens gradually, based on the legacy PwC data available on the hub.

Some new benchmarks include:

- ImageNet - 10% of the data

https://preview.redd.it/wr55g27ofu8h1.png?width=2880&format=png&auto=webp&s=e6e5ef7e3a36cd5aa6d2841b149194239f4ad1e0

- 3D semantic segmentation:

https://preview.redd.it/zxgobrnqfu8h1.png?width=2880&format=png&auto=webp&s=6ee2935981825d5d7825709294ddb84a4b7a3ac9

- object counting:

https://preview.redd.it/uhv4wbrsfu8h1.png?width=2880&format=png&auto=webp&s=183decb144d9779e41bf12ca58fbaab66cd29cbf

and a lot more. Browse all of them at https://paperswithcode.co/tasks

## New domain

Papers with Code is now also available from paperswithco.de :)

Let me know what is missing, bug/feature requests, and whether you want to contribute!

Kind regards,

Niels

submitted by /u/NielsRogge
[link] [comments]

Want to read more?

Check out the full article on the original site

View original article

Tagged with

#generative AI for data analysis
#Excel alternatives for data analysis
#natural language processing for spreadsheets
#conversational data analysis
#data analysis tools
#big data management in spreadsheets
#real-time data collaboration
#financial modeling with spreadsheets
#intelligent data visualization
#no-code spreadsheet solutions
#data visualization tools
#enterprise data management
#big data performance
#data cleaning solutions
#rows.com
#cloud-based spreadsheet applications
#Papers with Code
#research
#benchmarks
#SOTA