osmos::feed

Open

Any recent examples of RL applications you really liked?

submitted by /u/paswut [link] [comments]

Input/output relationships

Hello community, Assuming that we have N elements that each has a set of features (Xi1, Xi2) and the task of DQN is to select one of this elements (so we have 3 outputs). Suppose that the input vector is [X11, X21, X12, X22], how DQN can relate each input feature of an element with its corresponding output, for example how it can understand that X11 and X12 are features of element 1 even there are distant? I hope that the description is clear submitted by /u/GuavaAgreeable208 [link] [comments]

Mentor/Expert in RL

I am an undergrad and currently finishing a thesis. I took on a project that uses continuous control using RL in controlling a robot with a 6d pose estimator. I looked far and beyond but RL robotics might just be too unsaturated in our country. I tried to look for structured ways in learning this just like Spinning Up RL with OpenAI and theoretical background with Sutton & Barto's book. I am really eager to finish this project by next year but I don't have mentors. Even the professors in our university are soon to adapt RL robotics. I saw from a past post that it's fine to ask mentors here, so please excuse me. I apologize if I wasn't able to properly frame the questions well. I WANT TO ACHIEVE THESE: - Get a good grasp of RL fundamentals especially in continuous action space control. - F…

Reproducing results of "SOLVING THE OSHI-ZUMO GAME M. Buro 2004"

Hi, I wanted to do Self-Play or Double Oracle on the Oshi-Zumo game (I omit some details but, it’s a two player, zero-sum game. Each player has a certain amount of coins, they simultaneously bet a certain number. The player with the highest bet moves the coin towards the enemy . When the coin ‘falls’ on the side of any player, that player loses). First I figured I might as well get a Nash Equilibrium policy by actually solving the game with Linear Programming rather than using a function approximations. I followed the paper SOLVING THE OSHI-ZUMO GAME by M. Buro 2004 and I used the open spiel python library. I managed (after some long debugging) to solve the [50,3,1]-Oshi-Zumo game and found the exact same result for the policy of the Nash Equilibrium at position (50,50,0) (beginning of…

Need help!! RL project

Im currently working on a final project for university using the Q learning algorithm. Is there anyone here Whos quite proficient and can help me submitted by /u/amulli21 [link] [comments]

KAN + RL?

KAN is good in continual learning, possible to make on-policy robust? submitted by /u/Professional_Card176 [link] [comments]
Open

[P] Simplified PyTorch Implementation of AlphaFold 3

submitted by /u/csozboz [link] [comments]

[D] Fine-Tuning LLaVA: Duration and Configuration?

Hi everyone, I'm planning to fine-tune the LLaVA model and am curious if anyone here has experience with this. Specifically, I'm looking to understand: The size of your dataset (number of images and annotations). How long the fine-tuning process took. Your hardware setup (GPUs, CPUs, RAM). Any specific configuration settings you used. Thanks in advance submitted by /u/NbaWM2394 [link] [comments]

[R] Embeddings dimensions and MMD loss function

Hi, The ScorePerformer paper (see: https://www.researchgate.net/publication/377272701_ScorePerformer_Expressive_Piano_Performance_Rendering_With_Fine-Grained_Control) mentions that the output of the score content embedding Cs contains L rows, while all other sequences contain N amount of rows (with N probably meaning the number of notes). I don't see why this would be L and they also didn't provide what that means. Also they mention the MMD loss, where it's not clear what dimension these z and z' symbols gave. I thought matrices were written capitalized (so should style embedding Z then be), but what are these z symbols (vectors, matrices, ...)? Also what is the || between p and q and what type of norm is used in the Gaussian Kernel? Thanks in advance submitted by /u/Emile_J_11 [link] [comments]

[D] What role do you think machine learning will play in fields like computational biology and bioinformatics in the coming years?

I believe that computation biology and bioinformatics are going to be adopting ML work more and more, and I’m quite excited to see what advancements are made. I think it is going to open up a whole new world in terms of matching diseases to current medications that could potentially be used off label. What other things should we be on the lookout for? Who are some researchers working in this world? submitted by /u/RawCS [link] [comments]

[D] Are LLM observability tools really used in startups and companies?

There are many LLM observability and monitoring tools launching every week. Are they actually used by real startups and companies? These tools seem to do one or a combination of the following: - monitor LLM inputs and outputs for prompt injection, adversarial attacks, profanity, off-topic content, rtc - monitor LLM metrics over time such as cost, latency, readability, output length, and custom metrics (tone, mood, etc), drift - prompt management: a/b testing, versioning, gold standard set What have you observed — in real companies who have their own LLM-powered features or products, do they used these tools? submitted by /u/WolvesOfAllStreets [link] [comments]

[P] Title: I created a Neural Network to quickly detect spoken vowels 20 times per second

Quick disclaimer: I am aware that there is an internaltional standard for labeling the diferent recognized speech sounds (phonemes), but I wanted ASCII or extended ASCII for programming simplification, so I use a different nomeclature. Besides, it's easier for me to recognize and read. -Please forgive me So I have often wondered about the real rules that govern speech that people use. For instance using something similar to a "glottal stop" to end words like "don't" and "that". The "t" is not pronounced. Or how "r" is almost always used as a vowel (in american english). My favorite examples are "fur", "fir", and "-fer". All three are pronounced identically and the typical "i,u,e" vowels are not pronounced at all. Its just pronounced "fr". One day I was looking at a spectrograph …

[R] Visual Guide to the K-Means Clustering Algorithm. 👥

TL;DR: K-Means clustering groups data points into clusters based on their similarities, making it useful for applications like customer segmentation, image segmentation, and document clustering. K-Means Clustering Visual Guide Processing img 92n1nckko01d1... submitted by /u/ml_a_day [link] [comments]

[D] SOFTS: Efficient Multivariate Time Series Forecasting with Series-Core Fusion

Happy to share my latest Medium article about Time Series Forecasting."SOFTS: Efficient Multivariate Time Series Forecasting with Series-Core Fusion" It is about SOFTS, an innovative MLP-based model that utilizes the novel STar Aggregate-Dispatch (STAD) module to centralize channel interactions, achieving superior forecasting performance with linear complexity. Unlike traditional methods that struggle with the trade-off between robustness and complexity, SOFTS efficiently captures channel correlations, paving the way for scalable and accurate predictions across various fields like finance, traffic management, and healthcare. https://medium.com/towards-artificial-intelligence/softs-efficient-multivariate-time-series-forecasting-with-series-core-fusion-0ac40d2adcd2 submitted by /u/rezayazdanfar [link] [comments]

[D] Does DSPy actually change the LM weights?

I always thought it's essentially glorified and structured prompt engineering (very useful still IMO), but it also claims in the docs that it fine-tunes and changes LM weights, and then absolutely refuses to elaborate on this in any of the sections in their docs. I don't even understand how it can change the actual parameters of the LM, especially if we're using third party API calls for the LMs. By LM weights, I assume it means the weights of the last layers of the transformer model. When they describe optimizers, they say "DSPy introduces new optimizers, which are LM-driven algorithms that can tune the prompts and/or the weights of your LM calls, given a metric you want to maximize." Am I misunderstanding what they mean by LM weights? I'm sorry if this is a stupid question, but I just can't seem to find any information about this. Thanks in advance! submitted by /u/chessnudes [link] [comments]

[P] Text to Openpose and Weird RNN bugs

I want to create AI that generate openpose from textual description for example if input "a man running" output would be like the image I provided Is there any model architecture recommend for me? my data condition is canvas_width: 900px canvas_height: 300px frames: 5 (5 person) expected output I trying to train RNN for this task and I use sentence transformer for embedding text and then pass to RNN and the loss is look like image below from sentence_transformers import SentenceTransformer sentence_model = SentenceTransformer("all-MiniLM-L6-v2") text = "a man running" text_input = torch.tensor(sentence_model.encode(text), dtype=torch.float) loss image with num_layers=3 My RNN setting embedding_dim = 384 hidden_dim = 512 num_layers = 3 output_dim = 180 num_epochs = 100 learni…

[D] How did OpenAI go from doing exciting research to a big-tech-like company?

I was recently revisiting OpenAI’s paper on DOTA2 Open Five, and it’s so impressive what they did there from both engineering and research standpoint. Creating a distributed system of 50k CPUs for the rollout, 1k GPUs for training while taking between 8k and 80k actions from 16k observations per 0.25s—how crazy is that?? They also were doing “surgeries” on the RL model to recover weights as their reward function, observation space, and even architecture has changed over the couple months of training. Last but not least, they beat the OG team (world champions at the time) and deployed the agent to play live with other players online. Fast forward a couple of years, they are predicting the next token in a sequence. Don’t get me wrong, the capabilities of gpt4 and its omni version are truly amazing feat of engineering and research (probably much more useful), but they don’t seem to be as interesting (from the research perspective) as some of their previous work. So, now I am wondering how did the engineers and researchers transition throughout the years? Was it mostly due to their financial situation and need to become profitable or is there a deeper reason for their transition? submitted by /u/UnluckyNeck3925 [link] [comments]

[D] Computer vision in ICML

Hi, this is my first year attending ICML. Based on past conferences, I was wondering how much content on computer vision typically appears at this conference, if any? submitted by /u/hilabar [link] [comments]

Multimodal AI from First Principles - Most fundamental approaches [D]

Sharing a video I made on some of the most critical and fundamental building blocks to train Multimodal models for the past decade or so… hope you enjoy if the topic interests you! submitted by /u/AvvYaa [link] [comments]

[P] Tensorrt CPP codebase for onnx models: Dynamic batching, All models, Single file models

https://github.com/PrinceP/tensorrt-cpp-for-onnx/tree/main Created a area for having CPP codebase for Tensorrt using ONNX models. Currently YOLOV9, YOLOV8[Detect, Segment, Classify, OBB, POSE] are coded. Other models are in progress. submitted by /u/Grapefruit-Narrow [link] [comments]

[D] Simple Questions Thread

Please post your questions here instead of creating a new thread. Encourage others who create new posts for questions to post here instead! Thread will stay alive until next one so keep posting after the date in the title. Thanks to everyone for answering questions in the previous thread! submitted by /u/AutoModerator [link] [comments]

[D] How to definitely say if my Dataset is Guassian

I'm following some tutorials on doing some linear regression and as I was building my notebook, I'm working on outlier detection and amongst the techniques described for doing outlier detection, one of them involved calculating the Standard Deviation, but for this I need to know if my columns are of Guassian distribution. I'm aware that there are different techniques like: Histograms KDE Plot Q-Q Plot Kolomogorov-Smirnov Test Shapiro-Wilk Test D'Agostino and Pearson's Test And I bet there are a few more as well. So what is the best one to use? I guess Histograms just give a clue but do not show the real intention. What is the standard practice to identify if the dataset is Guassian or not? submitted by /u/CaterpillarPrevious2 [link] [comments]

[D] Culture of Recycling Old Conference Submissions in ML

I work on statistical ML. I notice that many people (including myself and those that I review) often recycle their submissions for ML conferences. E.g., if their papers got rejected by ICML, they submit to NeurIPS, and later to ICLR (or UAI/AISTATS which are also top in my field). If they did not get into ICML/NeurIPS/ICLR after 2~3 times, they would submit them to AAAI/IJCAI/TMLR/ICDM, journals like T-NNLS/T-KDD/NN/Neurocomputing, or domain-specific venues like LoG/CoLLAs/AABI. After all these, if the paper still did not get accepted, they then simply put them or arXiv. I believe this might also be the case for CV/NLP. As a reviewer, I often encounter conference submissions where the authors resubmit without really taking into account the previous reviews provided. Sometimes they do incorporate the reviews when resubmitting--but sometimes the work may just be not at the level of Tier 1 conferences but they just keep resubmitting and hoping that they can accepted by chance. I think that this is consuming a lot of reviewers' time from the community to keep reviewing the same submissions (especially given that NeurIPS hits 20k submission id; I expect to see many resubmissions). This is perhaps also one of the reason TMLR was born (to emphasize correctness instead of novelty). I do understand arguments like "the quality of research is more important than the publication venues" or "OpenAI often simply just put their papers like GPT-X on arXiv these days". However, students or junior researchers also need publications in their career, including myself. What do folks think about it? submitted by /u/zy415 [link] [comments]

[D] How Do You Efficiently Conduct Ablation Studies in Machine Learning?

When conducting ablation studies for a model that can be pretrained and fine-tuned, do you perform a full grid search for each ablated version during both pretraining and fine-tuning? Or do you have strategies to make this process more efficient? Thank you for your insights. submitted by /u/Few-Pomegranate4369 [link] [comments]

[P] N-way-attention

I have been playing with the concept of attending to more than two tokens in transformer models. Instead of having one query and one key for example, having two keys and one query, and for every query sum over every pair of previous tokens. It makes the algorithm even slower ( O(n**3) instead of O(n**2)), but I think it is a fun concept. Some results where surprising to me, like how good it is at finding the longest increasing subsequence. I want it to share it: https://github.com/Gusanidas/n-way-attention/tree/main And to ask if anyone knows of papers that treat the concept, or mention it. submitted by /u/Gusanidas [link] [comments]

[D] What Is The Current State of LLM Ops

Curious about how people are putting their RAG and other LLM powered applications into production today. How do you define LLM Ops? What is the process like in your team/company, and what combination of tools are you using today to implement or automate those processes and what are some of the gap areas. I'm especially interested in what people are doing around the issue of efficiency scaling larger models across nodes in production settings. Do you apply any GPU virtualization/fractionalization and what are some good resources for these? submitted by /u/gamerx88 [link] [comments]

Intersection of ML & Distributed Systems [D]

What are some existing problems at the intersection of Distributed Systems and ML? I have a decent background in both, and I want to work on projects that employ distributed computing to solve problems in ML. What are some good resources to look at? Or how to start? submitted by /u/tcuser12 [link] [comments]

[P] Cafusion: Diffusion model for generating cat images

I've been working on this project for a while now. It can only generate nightmare fuel images that don't even look like cats but I'm trying to make it better here's the repo: https://github.com/Null-byte-00/Catfusion and here's the jupyter notebook: https://nbviewer.org/github/Null-byte-00/Catfusion/blob/main/catfusion.ipynb submitted by /u/Soroush_ra [link] [comments]

[D]why don’t we see zero shot Truthfulqa performance listed on papers ?

My intuition was it’s one of the most important metric , but we normally see multi shot performance. like in phi3 paper 10 shot performance was reported. submitted by /u/Bytesfortruth [link] [comments]
Open

What's the likelihood of free & open source AI video models catching up or being on par with stuff like SORA in quality in the near future?

Could we possibly see Facebook/Meta release something that's on par with the models that'd've been shown off with quality ahead of most like SORA so far that's open source & free? They seem like one of the only companies that have the resources to be able to compete with OAI & the like that release models open source. submitted by /u/CaptainAnonymous92 [link] [comments]

Will AI Become New Infinite Scroll?

Last week, OpenAI demonstrated GPT-4o. AI voice assistants are getting super real, with features like laughter and sighs, very much like the movie Her (2013). It shows how far we’ve come: the present is catching up with the future and science fiction is quickly becoming reality. This is awesome for things like learning and tutoring, but we also have to be careful. What happens if we are spending too much time with AI? submitted by /u/jurgo123 [link] [comments]

Against Computers (infinite play)

submitted by /u/LeatherJury4 [link] [comments]

New AI tool that changes the face, body or style (cartoon, superhero, situation, etc) of anyone.

submitted by /u/jacobgc75 [link] [comments]

Better Help using AI to write articles? Random article based on a Vocaloid song completely out of context.

submitted by /u/cornho1eo99 [link] [comments]
Open

Why neural networks struggle with the Game of Life

submitted by /u/nickb [link] [comments]

Kolmogorov-Arnold Networks (KANs) Explained:

Recently a new Neural network architecture is released called KANs which are capable of capturing more complex non-linearity compared to conventional neural networka. Find the maths and how KANs work in this new tutorial : https://youtu.be/LpUP9-VOlG0?si=sNk8vUeYNX3vxVPf submitted by /u/mehul_gupta1997 [link] [comments]

Open

Introducing OpenAI Japan

We are excited to announce our first office in Asia and we’re releasing a GPT-4 custom model optimized for the Japanese language. ( 2 min )

Open

Introducing improvements to the fine-tuning API and expanding our custom models program

We’re adding new features to help developers have more control over fine-tuning and announcing new ways to build custom models with OpenAI. ( 4 min )

Open

Implementing Gradient Descent in PyTorch

The gradient descent algorithm is one of the most popular techniques for training deep neural networks. It has many applications in fields such as computer vision, speech recognition, and natural language processing. While the idea of gradient descent has been around for decades, it’s only recently that it’s been applied to applications related to deep […] The post Implementing Gradient Descent in PyTorch appeared first on MachineLearningMastery.com. ( 25 min )

Open

Training a Linear Regression Model in PyTorch

Linear regression is a simple yet powerful technique for predicting the values of variables based on other variables. It is often used for modeling relationships between two or more continuous variables, such as the relationship between income and age, or the relationship between weight and height. Likewise, linear regression can be used to predict continuous […] The post Training a Linear Regression Model in PyTorch appeared first on MachineLearningMastery.com. ( 24 min )

Making Linear Predictions in PyTorch

Linear regression is a statistical technique for estimating the relationship between two variables. A simple example of linear regression is to predict the height of someone based on the square root of the person’s weight (that’s what BMI is based on). To do this, we need to find the slope and intercept of the line. […] The post Making Linear Predictions in PyTorch appeared first on MachineLearningMastery.com. ( 21 min )

Open

Loading and Providing Datasets in PyTorch

Structuring the data pipeline in a way that it can be effortlessly linked to your deep learning model is an important aspect of any deep learning-based system. PyTorch packs everything to do just that. While in the previous tutorial, we used simple datasets, we’ll need to work with larger datasets in real world scenarios in […] The post Loading and Providing Datasets in PyTorch appeared first on MachineLearningMastery.com. ( 20 min )

Open

Using Dataset Classes in PyTorch

In machine learning and deep learning problems, a lot of effort goes into preparing the data. Data is usually messy and needs to be preprocessed before it can be used for training a model. If the data is not prepared correctly, the model won’t be able to generalize well. Some of the common steps required […] The post Using Dataset Classes in PyTorch appeared first on MachineLearningMastery.com. ( 21 min )

Open

Calculating Derivatives in PyTorch

Derivatives are one of the most fundamental concepts in calculus. They describe how changes in the variable inputs affect the function outputs. The objective of this article is to provide a high-level introduction to calculating derivatives in PyTorch for those who are new to the framework. PyTorch offers a convenient way to calculate derivatives for […] The post Calculating Derivatives in PyTorch appeared first on Machine Learning Mastery. ( 20 min )

Open

Two-Dimensional Tensors in Pytorch

Two-dimensional tensors are analogous to two-dimensional metrics. Like a two-dimensional metric, a two-dimensional tensor also has $n$ number of rows and columns. Let’s take a gray-scale image as an example, which is a two-dimensional matrix of numeric values, commonly known as pixels. Ranging from ‘0’ to ‘255’, each number represents a pixel intensity value. Here, […] The post Two-Dimensional Tensors in Pytorch appeared first on Machine Learning Mastery. ( 21 min )

Open

One-Dimensional Tensors in Pytorch

PyTorch is an open-source deep learning framework based on Python language. It allows you to build, train, and deploy deep learning models, offering a lot of versatility and efficiency. PyTorch is primarily focused on tensor operations while a tensor can be a number, matrix, or a multi-dimensional array. In this tutorial, we will perform some […] The post One-Dimensional Tensors in Pytorch appeared first on Machine Learning Mastery. ( 22 min )

Open

365 Data Science courses free until November 21

Sponsored Post The unlimited access initiative presents a risk-free way to break into data science. The online educational platform 365 Data Science launches the #21DaysFREE campaign and provides 100% free unlimited access to all content for three weeks. From November 1 to 21, you can take courses from renowned instructors and earn […] The post 365 Data Science courses free until November 21 appeared first on Machine Learning Mastery. ( 15 min )

Open

Attend the Data Science Symposium 2022, November 8 in Cincinnati

Sponsored Post Attend the Data Science Symposium 2022 on November 8 The Center for Business Analytics at the University of Cincinnati will present its annual Data Science Symposium 2022 on November 8. This all day in-person event will have three featured speakers and two tech talk tracks with four concurrent presentations in each track. The […] The post Attend the Data Science Symposium 2022, November 8 in Cincinnati appeared first on Machine Learning Mastery. ( 10 min )

Open

My family's unlikely homeschooling journey

My husband Jeremy and I never intended to homeschool, and yet we have now, unexpectedly, committed to homeschooling long-term. Prior to the pandemic, we both worked full-time in careers that we loved and found meaningful, and we sent our daughter to a full-day Montessori school. Although I struggled with significant health issues, I felt unbelievably lucky and fulfilled in both my family life and my professional life. The pandemic upended my careful balance. Every family is different, with different needs, circumstances, and constraints, and what works for one may not work for others. My intention here is primarily to share the journey of my own (very privileged) family. Our unplanned introduction to homeschooling For the first year of the pandemic, most schools in California, where … ( 7 min )

Open

The Jupyter+git problem is now solved

Jupyter notebooks don’t work with git by default. With nbdev2, the Jupyter+git problem has been totally solved. It provides a set of hooks which provide clean git diffs, solve most git conflicts automatically, and ensure that any remaining conflicts can be resolved entirely within the standard Jupyter notebook environment. To get started, follow the directions on Git-friendly Jupyter. Contents The Jupyter+git problem The solution The nbdev2 git merge driver The nbdev2 Jupyter save hook Background The result Postscript: other Jupyter+git tools ReviewNB An alternative solution: Jupytext nbdime The Jupyter+git problem Jupyter notebooks are a powerful tool for scientists, engineers, technical writers, students, teachers, and more. They provide an ideal notebook environment for interact… ( 7 min )