Loading
Feedback

TruthMaker 104

Name

Tjad Clark

Organization

Yobisense (Pty) Ltd

Location

Cape Town, ZA

Badges

0
0
2

Activity

Nov
Dec
Jan
Feb
Mar
Apr
May
Jun
Jul
Aug
Sep
Oct
Nov
Mon
Wed
Fri

Ratings Progression

Loading...

Challenge Categories

Loading...

Challenges Entered

Latest submissions

No submissions made in this challenge.

Sample-efficient reinforcement learning in Minecraft

Latest submissions

No submissions made in this challenge.

Multi Agent Reinforcement Learning on Trains.

Latest submissions

No submissions made in this challenge.

A new benchmark for Artificial Intelligence (AI) research in Reinforcement Learning

Latest submissions

See All
graded 9081
graded 9077
submitted 9068
Gold 0
Silver 0
Bronze 2
Trustable
May 16, 2020
Newtonian
May 16, 2020

Badges

  • Has filled their profile page
    May 16, 2020

  • May 16, 2020

  • May 16, 2020

  • May 16, 2020

  • May 16, 2020

  • May 16, 2020

  • May 16, 2020

  • May 16, 2020

  • May 16, 2020

  • May 16, 2020

  • May 16, 2020
  • Kudos! You've won a bronze badge in this challenge. Keep up the great work!
    Challenge: Unity Obstacle Tower Challenge
    May 16, 2020
Participant Rating
Participant Rating
TruthMaker has not joined any teams yet...

Snake Species Identification Challenge

Qualification for the end of round1

Over 1 year ago

Woops! Apologies, missed the tag.

Qualification for the end of round1

Over 1 year ago

Unity Obstacle Tower Challenge

GitHub repo gone?

Over 1 year ago

Not sure if I’m responding in time … both repos are accessible to me at present.

Question about the “Round 1 of Unity’s Obstacle Tower Challenge” email

Over 1 year ago

@hanschoi86, thanks for the added clarity. And Yikes! Out of interest, was looking at your guys “standard” vs “itemized” deductions is quite scary! Some empathy shed. What really works in the complexity is that different states may apply different taxes :open_mouth:

Yes, the “accounting rule” I speak of applies to “businesses” (self-employment qualifies here, which is the audience I was aiming at).
I imagined that nobody would be competing as a company (you’d hence be acting as self-employed) - think being an individual or team of individuals was part of the competition rules.

Wikipedia affirms that the “accounting rule” applies to “corporations” (self-employed inclusive)

Reading in, If you’d like to pursuit this route, you need to be eligible for schedule C (or C-EZ) and file under those schedule terms. See IRS.

Found this interesting, thanks for the conversation & input

Question about the “Round 1 of Unity’s Obstacle Tower Challenge” email

Over 1 year ago

@wywarren I believe they scheduled for 15 May 2019 - see this link

Question about the “Round 1 of Unity’s Obstacle Tower Challenge” email

Over 1 year ago

@ross_wightman ah ok. I dispute tax law, and I encourage protesting against it. I do comply with the law non the less, doesn’t mean we can’t protest though - otherwise we’ll just continue to be taken advantage of. Gotta speak up for ourselves in today’s cruel world.

Question about the “Round 1 of Unity’s Obstacle Tower Challenge” email

Over 1 year ago

@isy @ross_wightman Please bear in mind that it is your government who dictates the laws of your country, if you wanted to be upset with anyone regarding this tax dilemma, protest with your government. Tax is entirely unnecessary, and civilisation survived without income tax until less than 100 years ago. (American history, income tax was introduced to recuperate after a civil war, that civil war has long passed.)

I do agree that decoupling of the technical challenge from the taxes could have been provisioned.

Question about the “Round 1 of Unity’s Obstacle Tower Challenge” email

Over 1 year ago

As mentioned above, every country’s tax is governed differently, look at your tax law, or enquire with a tax practitioner. I just qualify as a businessman.

My tax tip comes from basic accounting. Buying bread for $1 and selling it for $1, means you have 0 profit. As far as I’m aware, generally tax is applied to profit. Similarly, if you “won” $1100, and spent $1100 in the same event in trying to obtain more profit, you ultimately make 0 profit, and hence no tax implications. (unless winnings increased)

As for the length of time that the paperwork for the tax takes, it’s essentially 1 or 2 records in your ledger ? perhaps 5-10min max. Whilst this is in context of a business, it would apply similarly to a service provided by an individual. Question you’re asking is, “Did I make a profit?”. So just keep your paper trail - receipts from google cloud.

Here’s a more complicated scenario:
The entry to this competition costs $500. I ultimately win $1100 from this competition. An accountant doing their job properly would only have you taxed for $600 ($1100 - $500 ), as the entry fee was an expense in order for you to gain the ability to obtain the $1100. If you didn’t win the $1100 however, you would have generated a loss of $500 and you would not be taxed at all - but you’d be smart to keep your accounting up to date.

Question about the “Round 1 of Unity’s Obstacle Tower Challenge” email

Over 1 year ago

Tax Tip: Deplete the $1100 within Round 2, then it wont be taxable - as an expense in toward generating income.

Question about the “Round 1 of Unity’s Obstacle Tower Challenge” email

Over 1 year ago

Ah, interesting. I’m a team of 1, and have only seen 1 other email like that of the leader.

Question about the “Round 1 of Unity’s Obstacle Tower Challenge” email

Over 1 year ago

@Miffyli I agree it may make sense that one is able to “reject” their round 1 prize. I however also see that it makes sense that all the required forms are still submitted for eligibility checking purposes - as in to make sure that you are eligible to receive a any prizes in the first place.

On the other hand, I’m of the view that not being eligible for a prize due to tax prohibitions should not subject the competitor to not being allowed to compete in following rounds, that seems a bit senseless (as if tax hadn’t disabled/disadvantaged anyone enough). I’d strongly agree with the idea of round 2 eligibility only be based on the competitive ability of the competitor, and if for any other reason beyond their competitive ability they could not receive the prize, it would be forfeited and handed down to the next leading competitor. Nothing wrong with that ?

Question about the “Round 1 of Unity’s Obstacle Tower Challenge” email

Over 1 year ago

Just for clarity - and I stand to be corrected-, the FW8BEN form to be completed is only dealing with international tax, which I’ve read to be at 30% by default. Hence, we only receive 70% of the prize if the default FW8BEN is applied (look at section 9 and 10 of the form). And so if you have a default FW8BEN configuration, you can be taxed within your own country on only 70% of the prize value (as that’s what you will officially receive). (please do correct me if I’m incorrect)

Question about the “Round 1 of Unity’s Obstacle Tower Challenge” email

Over 1 year ago

@ipv6 Please understand that tax is quite complicated (especially international taxes).
The tax forms (fw8ben) is required for them to release your prize to you, and this form has very little to do with your national tax. Understanding this may clear up many questions you’ve raised. That said, you will need to consult with a tax practitioner from your country to understand whether tax needs to be paid on the prize or not. As an example, here in South Africa, if we are granted a prize that is in relation to our field of work, SARS (the equivalent of IRS in South Africa) expect us to declare the value of the prize as part of our income tax. So if you work in the field of machine learning research, it is likely that you have to declare this within South Africa. On the other hand, if you generally don’t do this kind of work, it would not be considered income tax. (I am guessing the reasons for this is to tax sportsmen where their income usually consists of competition money. It’s quite unfortunate.)

1 + 2 have been answered above.

3.) Will have to await a competition official’s response ( @arthurj )
4.) I’m also keen to know about this, our identity documents are never tightly coupled to our domicilium
4.b) The email is directed at your team leader only
5.) I’ve submitted my documents on Friday - perhaps too late for a response before the weekend. And I’ve not yet received any response yet today. I’d say this warrants more time to be granted under reasonable circumstances.

Again, I’m not an official, just a competitor looking for fair competition procedures to be followed.

Let's talk sparse / dense rewards

Over 1 year ago

@arthurj catching up on my understanding of the problems that have been observed in sparse reward systems, the objective of this competition becomes clearer to me. I was initially pulled into this competition thinking from a reward engineering perspective - classical RL. I realise I may have been mistaken.

I would like to have clarity on technical limits within this competition. There are known solutions (or solution attempts) for generalising in sparse reward systems where by the agents generate their own rewards. It’s clear to me now that these kinds of algorithms are more sought after in this competition event, however, does this mean that manually engineering rewards for OT are not part of the allowed solutions? Or is it expected that the agent only self learns particular goals/rewards - for example previously I mentioned imitation learning as a possibility for the AI to learn, and although the agent may learn to generalise on the OT, it may not be a generic solution that can be applied to other problems (each environment would require imitation - and who knows, the top of this tower may not be reachable by humans (looking forward to seeing round 2))

A bit more thought in context of your robot, I’d essentially try to cut down on training time by breaking the back-flip into smaller rewardable steps for the agent, perhaps best thought of as a form of curriculum learning. First learn to jump high enough, and get a grip on gravity, and learn to rotate in a particular direction. This is assuming the robot is equipped with an accelerometer - as I doubt the robot would ever reach a back-flip goal in a reasonable/feasible time with only CV - I’m not saying it can’t happen, but the amount of effort/resource cost doesn’t appeal as an effective solution. Hence, I’d definitely exploit an accelerometer - and attach one if it didn’t have one. I mean, there is a lot I could do with an accelerometer, and our world consists more than just a visual perspective - we have access to physics and other interesting perceptions, and creating new ways of perception is all part of the challenge in most learning (including CV).

Libcuda.so.1 not available

Over 1 year ago

Glad you got sorted!

Let's talk sparse / dense rewards

Over 1 year ago

Yes, thanks a lot for your insights. The idea around robots not having availability of a dense reward function like in the back-flip example is particularly interesting.

Libcuda.so.1 not available

Over 1 year ago

Likely your docker environment. Are you certain it is installed in docker correctly?

Have you used docker run locally?

Let's talk sparse / dense rewards

Over 1 year ago

Hi Community

I’ve been contemplating the idea around sparse / dense rewards, and have been reading into @arthurj’s OT paper. I suppose the paper inspired my curiosity more than my agent’s so far… (ha ha)

Blockquote
Reward Function The Obstacle Tower is designed to be a sparse-reward environment. The environment comes with two possible configurations: a sparse and dense reward configuration. In the sparse reward configuration, a positive reward of +1 is provided only upon the agent reaching the exit stairs of a floor of the tower. In the dense reward version a positive reward of +0.1 is provided for opening doors, solving puzzles, or picking up keys. In many cases even the dense reward version of the Obstacle Tower will likely resemble the sparsity seen in previously sparse rewarding benchmarks, such as Montezuma’s Revenge (Bellemare et al. 2013). Given the sparse-reward nature of this task, we encourage researchers to develop novel intrinsic reward-based systems, such as curiosity (Pathak et al. 2017), empowerment (Mohamed and Rezende 2015), or other signals to augment the external reward signal provided by the environment.

I am wondering why this benchmark is aimed at being a sparsely rewarding environment. I ask this as my understanding right now is more that the sparse vs dense reward is actually more of a trade-off for learning time, and only that really. Sometimes we have the luxury to implement dense rewards that cut down the learning time, other times not.
I would actually find the reward engineering quite challenging to solve some of these tasks mentioned in the paper. Or rather, the articulation (implementation) of such a reward policy. I’ve been looking into RL for a number of years now, but I can’t say that I’ve been closely tied with the community up until this challenge, so I may be missing a bit on the intention as to why the tower has initially been setup in the direction of sparse rewards.
Could the tower not have been setup without a reward function, or rather that the reward function could be seen as more of a default?

When measuring the performance between agents, yes it makes sense that they can be measured on the same reward. However, then I stumble upon the thoughts in context of this competition, are we allowed to augment the reward function that our agent/neural net receives. As I understand it, just about anything goes - i.e I can build whatever neural net ensemble taking in any params that I feed it along with the data provided by the unity gym env. I could in fact go as far as setting up an imitation learning environment, where the rewards are not used at all. Are there any technical limits ?

Bouncing block bug

Over 1 year ago

@arthurj Awesome, thanks for the hard work! :smiley:

Agent dies often for no reson?

Over 1 year ago

I was reviewing my agent and just noticed this too. My agent is jumping and changing camera view constantly, then it falls through the floor and terminates the episode.

Not sure, but it may be in connection with another bug. Seems like the tile heights aren’t placed correctly. Bouncing block bug

If you look carefully on as the agent crosses the floor, there are some uneven bits - not sure if this was intended-, which will trigger the agent to seem as if it is jumping for a split second.

Bouncing block bug

Over 1 year ago

It’s quite rare, but hinders progress.

Bouncing block bug

Over 1 year ago

Found a strange placement bug of purple block, check it out.

Running Ubuntu 18.04.2 LTS

Obstacle Tower 1.3

Tags and Submissions

Over 1 year ago

I received a suggestion to start a submission tag with a bang,

!SUBMISSION-omit-env-build

Round 1 Qualification

Over 1 year ago

Hi @hanschoi86

If I interpreted correctly, you have to reach floor 5 on average.

They have made it such that if you have the abliity to train the algorithm correctly (without changing the algorithm itself), you should be able to obtain an agent that gets to the 5th floor after about 3 days of training.

At the moment more than 20 users have achieved this and hence the latter of the statement you referenced can be ignored.

Tags and Submissions

Over 1 year ago

@mohanty, cool. On that note, it may be useful for some form of mechanism to distinguish between the environment (docker) being built and the raw data. I tend to push just the data changes of my model, could save on cpu usage and build queue times when the server is busy.

Perhaps competition providers (or the platform guidelines) could structure a dedicated data folder, that when pushed to no building is done.

Realise it’s easier said than done :wink:

Best fortunes,

Tags and Submissions

Over 1 year ago

Hi @mohanty

It would be great if the tag used for submission was included somewhere within the submission - so as to know which build / issue belongs to which tag.

Also, it may be advisable to have @devops only process the last submitted tag. I push with a --tags and all tags were pushed. No idea what’s going on regarding issues that were created
This could end catastrophically for unsuspecting users, where they thought their last submission was their last tag, but in fact may not be so (I may be wrong).

And otherwise, it’s been interesting getting familiar with the platform, great work!

Best fortunes

Random

A machine learning lecturer

Over 1 year ago

A machine learning lecturer is checking up on whether her students have been doing their reinforcement-learning homework.

ML lecturer: You sit here with a look of horror on your face and empty handed, what happened to your homework assignment?
ML student: It got eaten by the TV.
ML lecturer: Progress.

TruthMaker has not provided any information yet.