Machine Learning

51

2

what are you reading this week ? (feddit.nl)

submitted 1 year ago by grannyweatherwax@feddit.nl to c/machinelearning@lemmy.ml

0 comments fedilink

Hello Machine Learning Community,

The intention of this post is to replicate a similar tradition from R/machinelearning and to trigger engagement. This post will be created weekly.

What are you reading this week and any thoughts to share?

52

1

[Solved] PyTorch Lightning is bottlenecked by the CPU (lemmy.ml)

submitted 1 year ago* (last edited 1 year ago) by kernelPanic@lemmy.ml to c/machinelearning@lemmy.ml

0 comments fedilink

When I train my PyTorch Lightning model on two GPUs on jupyter lab with strategy="ddp_notebook", only two CPUs are used and their usages are 100%. How can I overcome this CPU bottleneck?

Edit: I tested with PyTorchProfiler and it was because of old ssds used on the server

53

1

MetaGPT: Meta Programming for Multi-Agent Collaborative Framework (arxiv.org)

submitted 1 year ago by yogthos@lemmy.ml to c/machinelearning@lemmy.ml

0 comments fedilink

54

1

Huawei Cloud Pangu-Weather Model Now Available on European Weather Agency Website (www.huawei.com)

submitted 1 year ago by yogthos@lemmy.ml to c/machinelearning@lemmy.ml

0 comments fedilink

55

1

A diffusion model to colorize black and white images (github.com)

submitted 1 year ago by yogthos@lemmy.ml to c/machinelearning@lemmy.ml

0 comments fedilink

56

1

Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors (guochengqian.github.io)

submitted 1 year ago by yogthos@lemmy.ml to c/machinelearning@lemmy.ml

0 comments fedilink

57

1

Containers for machine learning (github.com)

submitted 1 year ago by yogthos@lemmy.ml to c/machinelearning@lemmy.ml

0 comments fedilink

58

1

what are you reading this week ? (feddit.nl)

submitted 1 year ago by grannyweatherwax@feddit.nl to c/machinelearning@lemmy.ml

0 comments fedilink

Hello Machine Learning Community,

The intention of this post is to replicate a similar tradition from R/machinelearning and to trigger engagement. This post will be created weekly.

What are you reading this week and any thoughts to share?

59

2

ChatGPT is David Copperfield (lemmy.ml)

submitted 1 year ago by igalmarino@lemmy.ml to c/machinelearning@lemmy.ml

2 comments fedilink

60

1

RT-2: New model translates vision and language into action (www.deepmind.com)

submitted 1 year ago by fox@lemm.ee to c/machinelearning@lemmy.ml

0 comments fedilink

61

1

Structure-informed Language Models Are Protein Designers (arxiv.org)

submitted 1 year ago by yogthos@lemmy.ml to c/machinelearning@lemmy.ml

0 comments fedilink

62

1

what are you reading this week ? (feddit.nl)

submitted 1 year ago by grannyweatherwax@feddit.nl to c/machinelearning@lemmy.ml

0 comments fedilink

Hello Machine Learning Community,

The intention of this post is to replicate a similar tradition from R/machinelearning and to trigger engagement. This post will be created weekly.

What are you reading this week and any thoughts to share?

63

2

[Paper] Learning to Generate Better Than Your LLM (arxiv.org)

submitted 1 year ago by EthicalAI@beehaw.org to c/machinelearning@lemmy.ml

0 comments fedilink

I was looking through papers that combine LLMs and RL and this was pretty fascinating and the citations are perfect for continuing my search.

64

1

GitHub - aerdem4/lofo-importance: Leave One Feature Out Importance (github.com)

submitted 1 year ago by igalmarino@lemmy.ml to c/machinelearning@lemmy.ml

0 comments fedilink

65

1

Almost All Research on the Mind is in English. That May Be a Problem (www.wired.com)

submitted 1 year ago by ZephyrXero@lemmy.world to c/machinelearning@lemmy.ml

0 comments fedilink

66

1

what are you reading this week ? (feddit.nl)

submitted 1 year ago by grannyweatherwax@feddit.nl to c/machinelearning@lemmy.ml

0 comments fedilink

Hello Machine Learning Community,

The intention of this post is to replicate a similar tradition from R/machinelearning and to trigger engagement. This post will be created weekly.

What are you reading this week and any thought to share on it ?

67

1

In-Memory Computing and Analog Chips for AI (archive.ph)

submitted 1 year ago by yogthos@lemmy.ml to c/machinelearning@lemmy.ml

0 comments fedilink

68

1

GPT-4's details are leaked (pastebin.com)

submitted 1 year ago by yogthos@lemmy.ml to c/machinelearning@lemmy.ml

0 comments fedilink

69

1

what do you all think about a weekly "what are you reading ?" post ? (feddit.nl)

submitted 1 year ago by grannyweatherwax@feddit.nl to c/machinelearning@lemmy.ml

0 comments fedilink

I'd love to know what others are reading, why they think it's awesome (or not). In general, get an exposure to other sub genres of ML. Most of the papers I read are in the computer vision domain cause of work so I'd appreciate reading more about others.

So...

Are you all interested in such a post ?
If yes, which day of the week ?

70

1

Gaussian processes from scratch (peterroelants.github.io)

submitted 1 year ago by igalmarino@lemmy.ml to c/machinelearning@lemmy.ml

0 comments fedilink

71

1

Great series by Andrej Karpathy on machine learning and training (www.youtube.com)

submitted 1 year ago by TommyCat@lemmy.world to c/machinelearning@lemmy.ml

0 comments fedilink

Great series on machine learning. Posting for anyone interested in more of the details on the AI's and LLM's and how they're built/trained.

72

1

SDXL Model Report (github.com)

submitted 1 year ago by yogthos@lemmy.ml to c/machinelearning@lemmy.ml

0 comments fedilink

TLDR of Stability-AI's Paper:

Summary: The document discusses the advancements and limitations of the Stable Diffusion (SDXL) model for text-to-image synthesis. SDXL shows significant improvements in synthesized image quality, prompt adherence, and composition. However, it also has limitations such as challenges in synthesizing intricate structures like human hands, achieving perfect photorealism, addressing biases, mitigating concept bleeding, and improving text rendering. The document also compares SDXL with Midjourney v5.1, where SDXL shows a slight preference in terms of prompt adherence. The document concludes with suggestions for future improvements.

Key Takeaways:

SDXL outperforms or is statistically equal to Midjourney V5.1 in 7 out of 10 categories.
SDXL does not achieve better FID scores than the previous SD versions. This suggests the need for additional quantitative performance scores, specifically for text-to-image foundation models.
SDXL outperforms Midjourney V5.1 in all but two categories in the user preference comparison.
The model may encounter challenges when synthesizing intricate structures, such as human hands.
The model does not attain perfect photorealism. Certain nuances, such as subtle lighting effects or minute texture variations, may still be absent or less faithfully represented in the generated images.
The model’s training process heavily relies on large-scale datasets, which can inadvertently introduce social and racial biases.
The model may exhibit a phenomenon known as “concept bleeding” where distinct visual elements unintentionally merge or overlap.
The model encounters difficulties when rendering long, legible text.
Future work should investigate ways to provide a single stage of equal or better quality, improve text synthesis, enable scaling to much larger transformer-dominated architectures, decrease the compute needed for inference, and increase sampling speed.

73

1

AI makes non-invasive mind-reading possible by turning thoughts into text (amp.theguardian.com)

submitted 1 year ago by yogthos@lemmy.ml to c/machinelearning@lemmy.ml

0 comments fedilink

74

1

Why Are There No Consumer Server GPUs? (social.tath.link)

submitted 1 year ago by maxerature@social.tath.link to c/machinelearning@lemmy.ml

0 comments fedilink

I work with machine learning tasks daily, both as an ML researcher and as a hobby. The difference between what I can do at work and at home is significant - an A40 at work can do far more than the 3080 I have at home. This obviously makes sense, given the massively increased price point.

However, what I find odd is how there are no consumer level server GPUs targeted towards ML on the market. The A40 is not just a scaled up consumer GPU, and with machine learning growing as a hobby, consumer and enthusiast-level server GPUs are a surprising market gap.

75

1

IBM Watson machine learning stopped development at PyTorch 1.7 (lemmy.world)

submitted 1 year ago by felixquinihildebet@lemmy.world to c/machinelearning@lemmy.ml

0 comments fedilink

https://public.dhe.ibm.com/ibmdl/export/pub/software/server/ibm-ai/conda/#/

On the face of it, the ability to run models larger than GPU memory would seem to be extremely valuable. Why did they give up? Not everyone has an 80GB GPU.

Was the performance too slow?