@ogrisel@sigmoid.social
@ogrisel@sigmoid.social avatar

ogrisel

@ogrisel@sigmoid.social

Machine Learning Engineer at :probabl., scikit-learn core contributor. #Python, #Pydata, #MachineLearning & #DeepLearning.

This profile is from a federated server and may be incomplete. Browse more on the original instance.

ogrisel, to random French
@ogrisel@sigmoid.social avatar
ogrisel, (edited )
@ogrisel@sigmoid.social avatar

@magsol not necessarily directly useful from a practical point of view, e.g. for logistic regression, in particular with correlated features and rare categorical values, second order methods are probably better while for deep transformers trained on datasets with millions of data points, stochastic solvers with momentum like Adam are probably more efficient. Yet it's interesting to see that cyclic learning rate schedules with very large learning rates can be theoretically justified.

arthurzenika, to security French
@arthurzenika@pouet.chapril.org avatar

Hier, en "pause tech" chez mon client, j'ai présenté quelques solutions matérielles pour faire de l'authentification multi facteurs (2FA/MFA/TOTP). J'ai parlé de yubikey, solokeys, titan keys. Et aussi des solutions logicielles: Authenticator, FreeOTP, LastPass, etc.

Coté applications qui permette l'usage de cette bonne pratique de sécurité, j'ai découvert https://www.dongleauth.com/

Vous utilisez quoi vous ?

#security #2fa #mfa

ogrisel,
@ogrisel@sigmoid.social avatar

@arthurzenika Un combo FreeOTP + yubikeys. J ai pas testé les autres.

ogrisel, (edited ) to random
@ogrisel@sigmoid.social avatar

Jérémie has just released threadpoolctl 3.2.0:

https://pypi.org/project/threadpoolctl/

This is a small Python library to inspect and change the size of the threadpools used by libraries dynamically linked to a Python program (e.g. OpenBLAS, MKL, OpenMP runtimes...).

It is quite useful to debug oversubscription problems in the #SciPy / #PyData ecosystem.

This new version makes it possible to register a custom controller for your own native library. See the changelog for details:

https://github.com/joblib/threadpoolctl/blob/master/CHANGES.md

ogrisel, (edited ) to ArtificialIntelligence French
@ogrisel@sigmoid.social avatar

LongNet: Scaling Transformers to 1,000,000,000 Tokens

https://arxiv.org/abs/2307.02486

#deeplearning #transformers

ogrisel,
@ogrisel@sigmoid.social avatar

@val It's not clear which model size is used to produce the right hand side figure nor if they kept a fixed context size to produce the left hand side figure.

ogrisel,
@ogrisel@sigmoid.social avatar

I tried to edit the above post to add the missing ALT text to the second screenshot but sigmoid.social & elk do not take the change into account, so here it is:

Figure 7: Left: Test loss of LONGNET with an increasing model size. The scaling curve follows a similar law to the vanilla Transformers. Right: Test loss of LONGNET using different context windows. A longer context window yields better language modeling.

ogrisel, (edited ) to random
@ogrisel@sigmoid.social avatar

joblib 1.3.0 is out in the wild!

joblib is a library that provides an generic way to call into thread-based, process-based and distributed parallelism (via external backends) + a way to cache expensive computation in repeated function calls on disk.

https://joblib.readthedocs.io

This new release provides several major new features, inclusing a return_as="generator" argument to the Parallelclass to make it possible to aggregate parallel results when ready (preserving the submission order).

1/4

ogrisel, (edited )
@ogrisel@sigmoid.social avatar

In the future this will also be extended to return_as="unordered_generator" to optionally make it possible to aggregate results as soon as ready.

This release also includes a new parallel_config context manager as an extension to parallel_backend to make it possible to configure all the arguments of the Parallel class and not just the backend using a context manager idiom.

Detailed changelog:
https://github.com/joblib/joblib/blob/master/CHANGES.rst#release-130----20230628

2/4

ogrisel, (edited )
@ogrisel@sigmoid.social avatar

As a side benefit of this refactoring, the traceback of an exception raised in sequential mode (n_jobs=1) is now flatter.

3/4

ogrisel, (edited )
@ogrisel@sigmoid.social avatar

And thanks to everybody involved in making this happen, and Thomas as the release manager in particular.

4/4

ogrisel, (edited )
@ogrisel@sigmoid.social avatar

@rupdecat I think that at the time I preferred the side-effect free design of cloudpickle if I remember correctly.

  • All
  • Subscribed
  • Moderated
  • Favorites
  • JUstTest
  • mdbf
  • ngwrru68w68
  • cubers
  • magazineikmin
  • thenastyranch
  • rosin
  • khanakhh
  • InstantRegret
  • Youngstown
  • slotface
  • Durango
  • kavyap
  • DreamBathrooms
  • megavids
  • tacticalgear
  • osvaldo12
  • normalnudes
  • tester
  • cisconetworking
  • everett
  • GTA5RPClips
  • ethstaker
  • anitta
  • Leos
  • provamag3
  • modclub
  • lostlight
  • All magazines